Gene Dole_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1097 
Symbol 
ID5693931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1305011 
End bp1307236 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content59% 
IMG OID641263691 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001528981 
Protein GI158521111 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCGT TTCTAAACAC CCTGGCAAAA CCGGTCACCG TTCACACCGA GACACTTTGC 
CTGGACGAAC CGTTTGAAAC CACGGCGGCC CGGTTTGCCG ATCGGCCCCA CACGGTGGTT
CTGCTGTCCG GCGGCGATGC CGACAGCGCC CGGTATCACC TGCTGGCAAC AGACCCATGG
CTGATCTTTA CCGCCAGGAG CCGCCGCCTG GAACTTCAAA CGGAAAACGG TTCCCACACC
TTTACCGGCC ACCCTGTTGA GTCCTTAAAA AAAATCCTTG CCCGGTTTCA CATGGCGGCC
GCTGATCTGC CGGCCCCTGT CGGCGCCGGC CTGTTCGGAT ATCTGGCCTA TGACGTAAAG
GATTGCCTTG AACATCTTCC CCGCACATCA GTGAATGATC TTTGCCTGCC CGATCTTTAC
ATGTTTGCCC CCCGGCTCAT TCTGGTCCAT GACAAAAAAG AAAACACCAC CCGGCTCTGT
GTCCCCGAAT TGTCGGCTGA CGCCACCCGG CAGGCCCTGG ACCGGTTTTA CGCGGTCATG
AACCAGCCGG CCCCGGTTCC CGGACCGGTT TCCATTGGCA ACGGTTTTGC CTCCAACTTC
ACCCGGCCCG ATTACGAGGC GGCCGTGGCC CGCATTCGGG ACTACATTGC CGCCGGTGAT
GTCTACCAGG TCAACATGAG CCAGCGGTTT GAAACCGCCT TTGACGGCGA CCCCTATGCC
CTTTTTGCCC GGCTCTACAA AAAAAACCCG GCTCCGTTTT TTGCCTATGT CCATGCCGGC
GACCATCACC TTCTTTCCAC GTCGCCGGAG CGGTTTCTGC TGCAACAGGG CCGTTTCGTG
GAGACCCGGC CCATCAAGGG AACCCGGCCC AGGGGCAAGA CGCCGGACCA GGACGCGCAA
AACAGAAAAG ACCTGCTGCA AAGCAAAAAG GACGATGCCG AGCTTTCCAT GATCGTGGAC
CTGCTGCGAA ACGACCTGGG CCGGGTCTGC GCCTCCGGCA CCGTAAAAGT GGCCGGCCAT
AAGAGGCTGG AGGCCTATGA CAACGTCTTT CACATGGTGT CGGTGGTCAC CGGCGAACTG
GCCCAAAACC GGGACACGGC AGACCTTGTC GAGGCCGCCT TTCCCGGCGG GTCGATTACC
GGGTGTCCCA AAATCCGGTC CATGGAGATC ATCGACGAGC TGGAGCCCTG CCGCCGCCAC
ATCTACACCG GCTCTATCGG GTACATCAGC TTTCACGACA CCGCCGACCT CTCCATTGCC
ATTCGCACCG CCACCCTGCA CGGGGGCCGA CTCTTTTATT CAGCCGGCGG CGGCATTGTG
TACGACTCGG TGCCATCGGA AGAGTATGAA GAGACCCTGG CAAAGGCCCG GACCATGTTG
GACGCCTTTG CCGGGCCGGA GATTGTGCCG GCAAGCCGGC CTCTTGTCTG GCTCAACGGC
CGCATAATTA AAGAAAACGA GGCCGCTGTT CCTCTTGCAT CGCCGGGGTT TCAATACGGG
GCCGGCCTGT TTGAAACCAT TCGGGCCGAC AATGGAACGC CCCGCCTGCT GGACGCCCAC
ATTCAGCGGT TCAACCACTC CTGGCCCGAA ATTTTTCCCG GACCGGCCCC GGACCTGACA
TGGAGTACCG TTATCGAACA GGTGCTTTCA GCCAACGGTC TGGGCAACGG CCCGGCAGCC
GTCAAAATCA TGGCCTCCCT GGGACAGAAA GAAACCGCGC CCTTTGATTA CACCCTGGCG
GTAACAGCCA GGCCTTACAC CCACCGGCTT GCCCTGATAA ACAAAAAGGG CATTGACCTT
GCCCTTTACC CCGAACCCCG GCAGATTGCC ACGGCCCGGC ACAAAACCAT AAACTACCTG
TTCTATTTTC AGGCCGGAAA GTGGGCCAAA GCACAGGGCG CGGACGAAGC ATTGATTCTC
AACCCGGACA ACACCCTTTC GGAAACCAAC ACCGCCAACC TGATGCTGGC CTGCGGAAAC
ACCATCCTTG TGCCGCAATC CGCCACAGTG CTGCCCGGCA TCATGCAACA GGCCGCGCTT
GAGCTGCTTC AATCATGGGG ATACGAAATC AAAGAAACGC CGGTCACCCT GAACCAGGCC
CTTGAGGCGG ACGGCCTGCT TGTGACCAAC TCCCTGATGG GCGCCGTGCC GGTGCTGAGC
ATTGATAGCC GGAAAGCCTC CCCGGCATTT GAGTTGTGTG AAAGGATTAA TCGGGGAATG
GGATAA
 
Protein sequence
MDPFLNTLAK PVTVHTETLC LDEPFETTAA RFADRPHTVV LLSGGDADSA RYHLLATDPW 
LIFTARSRRL ELQTENGSHT FTGHPVESLK KILARFHMAA ADLPAPVGAG LFGYLAYDVK
DCLEHLPRTS VNDLCLPDLY MFAPRLILVH DKKENTTRLC VPELSADATR QALDRFYAVM
NQPAPVPGPV SIGNGFASNF TRPDYEAAVA RIRDYIAAGD VYQVNMSQRF ETAFDGDPYA
LFARLYKKNP APFFAYVHAG DHHLLSTSPE RFLLQQGRFV ETRPIKGTRP RGKTPDQDAQ
NRKDLLQSKK DDAELSMIVD LLRNDLGRVC ASGTVKVAGH KRLEAYDNVF HMVSVVTGEL
AQNRDTADLV EAAFPGGSIT GCPKIRSMEI IDELEPCRRH IYTGSIGYIS FHDTADLSIA
IRTATLHGGR LFYSAGGGIV YDSVPSEEYE ETLAKARTML DAFAGPEIVP ASRPLVWLNG
RIIKENEAAV PLASPGFQYG AGLFETIRAD NGTPRLLDAH IQRFNHSWPE IFPGPAPDLT
WSTVIEQVLS ANGLGNGPAA VKIMASLGQK ETAPFDYTLA VTARPYTHRL ALINKKGIDL
ALYPEPRQIA TARHKTINYL FYFQAGKWAK AQGADEALIL NPDNTLSETN TANLMLACGN
TILVPQSATV LPGIMQQAAL ELLQSWGYEI KETPVTLNQA LEADGLLVTN SLMGAVPVLS
IDSRKASPAF ELCERINRGM G