Gene Dole_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2079 
Symbol 
ID5694922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2530358 
End bp2531449 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID641264680 
Productriboflavin biosynthesis protein RibD 
Protein accessionYP_001529960 
Protein GI158522090 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0038813 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATA CCTATTTCAT GAATATGGCC CTGGGCCTGG CCGAACGGGG AACCGGTTTT 
ACTTCTCCCA ACCCCGTGGT GGGTGCCGTG GTGGTGCGGG ACGGCCGGGT GGTGGGCCAG
GGGTTTCACG CGGCAGCGGG AAAACCCCAC GCCGAAGTGG TGGCCATTGA CGATGCCGGT
GAACTGGCCC GTGGCGCCGA TCTTTACGTG ACCCTGGAAC CCTGCAATCA TACGGGCCGG
ACCCCGCCCT GTACGCGGAA GATTTGCGAT GCCGGCATCG CCCGGGTGGT GGTGGCCACG
ATCGACTCCA ACCCCCACGT GGCCGGCGGC GGCATTCAAT ACCTGGAGTC CCGGGGTATT
GCCGTTACTG TGGGGGTCTG CGAGGCCGAG GCCAAAAAGC AGATCGAGTG GTTTGCAAAA
TATGTGACCA CGGGGCGGCC TTTTGTCACG GTCAAGTGCG CCATGACGTT AGACGGCCGC
ATCGCCACAC GCACCGGCGA TTCCAAATGG GTTACCGGAG AGGCGGCCCG TGCCTTTGTG
CACAGGATGC GCCACGCGTC CGACGCCATC CTGGTAGGCG TGGGCACGGC CAACGCCGAT
AATCCCCGGC TCACGGCCCG GGTTGAGGGC ATGCGAACCA GAGATCCCAT GCGGATCGTG
CTGGACACAA AGCTCTCCAT TCGAGAGGAT GCCGCGATGT TTGACCTGGA TTCGTCCGCT
GAAACGCTGA TCGTGGTGGG GCCGGGGCAT GACAAAAAAA AACGGGACCG GCTCAAGGGA
AAGGCCCGGT TTTTTGATGC GGCGCTGGCC ATCGGTCGGA TTGACATGGC CGGGCTCATG
GACGGCCTTG GAAAGATGGG GATTACCAGC CTGCTGATTG AGGGAGGGGG TCAGGTGATC
GGGGCCGCGT TTGCGGCCGG CATCGTGGAC AAGGTATGCT TTTTTTACGC CCCGAAAATT
CTGGGCGGCG ACGACGGTGT TCCCGTGTGC GCGGGAGCGG GACAGGAACA GATGAAAAAT
GCGTTGCCGG TGCGGAATGT GTCCGTGACC CGGTTTGATG ATGACATTCT GATCGAGGGG
TATGTGTCCT GA
 
Protein sequence
MDDTYFMNMA LGLAERGTGF TSPNPVVGAV VVRDGRVVGQ GFHAAAGKPH AEVVAIDDAG 
ELARGADLYV TLEPCNHTGR TPPCTRKICD AGIARVVVAT IDSNPHVAGG GIQYLESRGI
AVTVGVCEAE AKKQIEWFAK YVTTGRPFVT VKCAMTLDGR IATRTGDSKW VTGEAARAFV
HRMRHASDAI LVGVGTANAD NPRLTARVEG MRTRDPMRIV LDTKLSIRED AAMFDLDSSA
ETLIVVGPGH DKKKRDRLKG KARFFDAALA IGRIDMAGLM DGLGKMGITS LLIEGGGQVI
GAAFAAGIVD KVCFFYAPKI LGGDDGVPVC AGAGQEQMKN ALPVRNVSVT RFDDDILIEG
YVS