Gene EcolC_3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3219 
SymbolribD 
ID6066717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3525089 
End bp3526192 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content55% 
IMG OID641602634 
Productbifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino)uracil reductase 
Protein accessionYP_001726168 
Protein GI170021214 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000143297 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAGGACG AGTATTACAT GGCGCGGGCG CTAAAGCTGG CGCAACGAGG ACGTTTTACC 
ACGCATCCCA ACCCGAATGT CGGGTGCGTC ATTGTCAAAG ATGGCGAAAT TGTCGGTGAA
GGTTACCACC AACGTGCGGG TGAACCACAT GCCGAAGTAC ACGCGTTGCG TATGGCGGGT
GAAAAAGCCA AAGGTGCGAC CGCCTATGTC ACACTCGAAC CCTGTAGCCA TCATGGTCGT
ACGCCACCGT GCTGTGACGC ACTCATCGCC GCTGGCGTAG CGCGCGTGGT TGCCTCGATG
CAAGATCCTA ACCCGCAGGT CGCTGGGCGT GGACTTTACC GTCTGCAACA GGCTGGCATT
GACGTCAGCC ACGGCCTGAT GATGAGTGAA GCCGAGCAAT TGAATAAAGG CTTTCTCAAG
CGGATGCGCA CCGGCTTTCC TTATATTCAG TTAAAACTTG GCGCATCGCT TGATGGTCGC
ACGGCGATGG CGAGCGGCGA AAGCCAGTGG ATCACTTCGC CCCAGGCGCG GCGTGATGTA
CAACTACTGC GCGCGCAAAG TCATGCCATT TTAACCAGCA GCGCCACGGT GCTGGCGGAT
GATCCTGCCT TAACGGTGCG TTGGTCTGAA CTGGATGAAC AAACTCAGGC GCTCTATCCG
CAACAAAATC TCCGTCAGCC GATACGTATT GTGATTGATA GCCAAAATCG CGTGACGCCG
GTACATCGCA TTGTGCAGCA GCCCGGCGAA ACCTGGTTCG CGCGTACGCA GGAAGATTCT
CGTGAGTGGC CGGAAACGGT GCGTACCTTG CTGATTCCAG CGCATAAAGG TCATCTGGAT
CTGGTTGTAC TGATGATGCA ACTGGGTAAA CAGCAAATTA ACAGCATCTG GGTGGAAGCG
GGGCCAACGC TCGCTGGCGC ATTGCTGCAG GCGGGTTTAG TCGATGAGCT GATTGTCTAT
ATCGCACCTA AACTATTAGG CAGCGACGCC CGTGGATTAT GCTCGCTGCC AGGGCTTGAG
AAATTAGCCG ACGCCCCCCA ATTTAAATTC AAAGAGATAC GTCATGTAGG CCCGGATGTT
TGCCTGCATT TAGTAGGTGC ATGA
 
Protein sequence
MQDEYYMARA LKLAQRGRFT THPNPNVGCV IVKDGEIVGE GYHQRAGEPH AEVHALRMAG 
EKAKGATAYV TLEPCSHHGR TPPCCDALIA AGVARVVASM QDPNPQVAGR GLYRLQQAGI
DVSHGLMMSE AEQLNKGFLK RMRTGFPYIQ LKLGASLDGR TAMASGESQW ITSPQARRDV
QLLRAQSHAI LTSSATVLAD DPALTVRWSE LDEQTQALYP QQNLRQPIRI VIDSQNRVTP
VHRIVQQPGE TWFARTQEDS REWPETVRTL LIPAHKGHLD LVVLMMQLGK QQINSIWVEA
GPTLAGALLQ AGLVDELIVY IAPKLLGSDA RGLCSLPGLE KLADAPQFKF KEIRHVGPDV
CLHLVGA