Gene Dole_0684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0684 
Symbol 
ID5693514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp809496 
End bp811442 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content62% 
IMG OID641263276 
Productputative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein 
Protein accessionYP_001528571 
Protein GI158520701 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGACT ATTCAACCAA CATATACTTA AAGAAAAAAA GCCTCACCGA GGCCCGGACC 
CTTCTGTTTG AAAAATTTGC GGCCCTTATG AAAACCGCCG CCGCCGAAAC CGTTCCCACG
CCAAATGCCG TGGGCCGGGT TTTGGCCCGC GGCGTGCCGG CCCTGCGCTC TTCTCCGGCC
CATCACCTGT CCGCCATGGA CGGCATTGCG GTAAAAGCCG AGCAGACCTT CGGCGCCGGC
GAGACCACTC CCAAAACCCT GGCCGTGGGC AGGGAAGCCT TTTTTGTCAA CACCGGCAAC
CTGCTGCCCG CCGGCACCAA TGCCGTGATC ATGATCGAGG ACGTTCACGT GATTGACGAC
GCCACGGTGG AGATCCTGGC CCCGTCCTTT CCCTGGCAGT ATGTGCGAAA GGCCGGGGAA
GACATCGTGG CCACGGAACT GCTCTTTCCC ACCAACCATG TGGTCACCCC CTATTGCGTG
GGCGCTTTGC TGGCCGCCGG CGTGACATCG GTGTCCGTGC GCAAAAAGCC CCGCGTGCTG
ATCCTTCCCA CCGGCAGTGA GCTGGTGGAC TGGGAAGACA AAACAGACCC GGCCGGCCTT
GCGCCGGGAA AGATTCTGGA ATCCAACTCC TACATGCTGG GGGCCCTGGT CCGTGCCTGC
GGCGCCGAAC CGGTGCGCCA CCCCATTTTG CCCGATGATC CGGACACCAT TGCCCAAGCC
CTGAAAAAGG CCGTTGACGA CGGCGATTAC CAGGCGGTGA TGCTGGTCGG CGGCTCATCA
GCCGGGGCCA AAGACTATTC ATGGATCGTG ATCGACCGCC TTGGGGAGGT GTTTGTTCAC
GGCCTGACCA TCATGCCGGG AAAGCCCCTG ATCATCGGTG CTGTTTCAAC GGTGCCGGTG
TTCGGCATGC CCGGCTATCC CGTGTCCGCC GTGGTCTGTT TTGAAGAGCT GGTCCGGCCG
TTGCTGTGCC GGATGCAGGG CCTGCCCGTG CCGGTGCGAA AAACCGTTTC CGCCATCCTG
GCAAAAAAGA CCGCTTCAAA GCTGGGCGTG GAGGAGTTTC TGCGGGTCAA GCTGGGCAAG
GTGGACGACA TTGTTATTGC CACCCAGTTG CCCCGGGGGG CCGGATCGGT CACCTCCCTG
GCCGATGCCG ACGCCTTTGT GAGAGTGCCG GCATCCACCG AGGGCATGGC CGAGGCCCAG
GCCGTCACCG CCGAGCTGCT GCGGCCCCTG TCCGACATCG AGCAGACCGT GGTGGTGGTG
GGCAGCCACG ACAACACCCT GGACGTGCTT TCCGACATGA TCGCGGCCCG GGGCCTGGGC
TTCAAGCTGG CCTCCACCCA CGTGGGCAGC ATGGGCGGGC TCATGGCCGT TGCCAGGGGC
CGTTGCCACG TGGCCGGCAG CCACCTGCTG GACGAAAAGA CCGGTGAATA TAATATTTCC
TACATTCAAA AACACCTGGC CGGCATGCCC GTCAAACTGG TCAAGCTGGT CTCCCGGGAA
CAGGGCCTGA TGGTGGTGCC GGGCAACCCC AAAAACATTT CCGGCATCGC TGACATCGCC
CGAACCGGGG TCACCTTTAT TAACCGCCAG GCCGGTTCCG GCACCCGGAT CCTGCTGGAC
TTCAAGCTCA AAGAGCTGGG CATCGACCCC GGGGCCATTG ACGGTTACGC CAATGACGAA
TATACCCACA TGTCCGTGGC CATTGCCGTG GCCAGCGGTG TGGCCGACAC CGGCCTGGGC
ATTCTGGCCG CGGCCCGCGC CCTGGGCCTT GATTTTATTC CCGTGGTCTC CGAAGAATAC
GACCTGGTCA TTCCGGCCCG GTTCTTTGAC CTGCCCGGCA TCGCCACCCT GCTCAAAGTG
ATTCAAAGCC CGGCCTTTGC CGAACGGGTC AACACCCTGG GCGGCTACGG CACCCACAAC
ACCGGCAAGG TGATTGATCT GGGCTGA
 
Protein sequence
MPDYSTNIYL KKKSLTEART LLFEKFAALM KTAAAETVPT PNAVGRVLAR GVPALRSSPA 
HHLSAMDGIA VKAEQTFGAG ETTPKTLAVG REAFFVNTGN LLPAGTNAVI MIEDVHVIDD
ATVEILAPSF PWQYVRKAGE DIVATELLFP TNHVVTPYCV GALLAAGVTS VSVRKKPRVL
ILPTGSELVD WEDKTDPAGL APGKILESNS YMLGALVRAC GAEPVRHPIL PDDPDTIAQA
LKKAVDDGDY QAVMLVGGSS AGAKDYSWIV IDRLGEVFVH GLTIMPGKPL IIGAVSTVPV
FGMPGYPVSA VVCFEELVRP LLCRMQGLPV PVRKTVSAIL AKKTASKLGV EEFLRVKLGK
VDDIVIATQL PRGAGSVTSL ADADAFVRVP ASTEGMAEAQ AVTAELLRPL SDIEQTVVVV
GSHDNTLDVL SDMIAARGLG FKLASTHVGS MGGLMAVARG RCHVAGSHLL DEKTGEYNIS
YIQKHLAGMP VKLVKLVSRE QGLMVVPGNP KNISGIADIA RTGVTFINRQ AGSGTRILLD
FKLKELGIDP GAIDGYANDE YTHMSVAIAV ASGVADTGLG ILAAARALGL DFIPVVSEEY
DLVIPARFFD LPGIATLLKV IQSPAFAERV NTLGGYGTHN TGKVIDLG