Gene EcSMS35_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0450 
SymbolribD 
ID6145022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp459611 
End bp460714 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content55% 
IMG OID641615344 
Productbifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino)uracil reductase 
Protein accessionYP_001742551 
Protein GI170680144 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGACG AGTATTACAT GGCGCGGGCG CTAAAGCTGG CGCAACGAGG ACGTTTTACC 
ACGCATCCCA ACCCGAATGT CGGGTGCGTC ATTGTCAAAG ATGGCGAAAC TGTCGGTGAA
GGTTACCATC AACGAGCGGG TGAACCGCAT GCCGAGGTAC ACGCGTTGCG CATGGCCGGC
GAAAAAGCCA AAGGCGCAAC GGCGTATGTC ACACTCGAAC CCTGTAGCCA TCATGGTCGT
ACGCCACCGT GTTGTGATGC ATTAATTGCT GCGGGCGTGG CGCGCGTGGT TGCTGCAATG
CAAGACCCCA ATCCGCAGGT CGCTGGGCGT GGACTTTACC GTCTACAACA GGCTGGTATT
GACGTCAGCC ACGGGTTGAT GATGAGTGAA GCCGAGCAAT TGAATAAAGG CTTTCTCAAG
CGGATGCGCA CCGGCTTTCC TTATATTCAG TTAAAACTTG GCGCATCGCT TGATGGCCGC
ACGGCGATGG CGAGCGGCGA AAGCCAGTGG ATCACTTCGC CTCAGGCGCG GCGCGACGTA
CAACGACTGC GCGCACAAAG TCATGCCATT TTAACCAGCA GCGCCACGGT GCTGGCGGAT
GATCCAGCCT TAACAGTGCG TTGGTCTGAA CTGGATGAAC AAACTCAGGC GCTCTATCCG
CAACAAAATC TCCGTCAGCC GATACGTATT GTGATTGATA GCCAAAATCG CGTGACGCCG
GAACATCGCA TTGTGCAGCA GCCCGGCGAA ACTTGGTTCG CGCGTACTCA GGATGATTCT
CGTGAGTGGC CGGAAACGGT GCGTACCTTG CTGATTCCAG AGCATAAAGG TCATCTGGAT
CTGGTTGTAC TGATGATGCA ACTGGGTAAA CAGCAAATTA ACAGCATCTG GGTGGAAGCG
GGGCCAACGC TCGCTGGCGC ATTGCTACAG GCGGGGTTAG TCGATGAGCT GATTGTCTAT
ATCGCACCTA AACTATTAGG CAGCGACGCC CGTGGATTAT GCTCGCTGCC AGGGCTTGAG
AAATTAGCCG ACGCCCCCCA ATTTAAATTC AAAGAGATAC GTCATGTAGG CCCGGATGTT
TGCCTGCATT TAGTGGGTGC ATGA
 
Protein sequence
MQDEYYMARA LKLAQRGRFT THPNPNVGCV IVKDGETVGE GYHQRAGEPH AEVHALRMAG 
EKAKGATAYV TLEPCSHHGR TPPCCDALIA AGVARVVAAM QDPNPQVAGR GLYRLQQAGI
DVSHGLMMSE AEQLNKGFLK RMRTGFPYIQ LKLGASLDGR TAMASGESQW ITSPQARRDV
QRLRAQSHAI LTSSATVLAD DPALTVRWSE LDEQTQALYP QQNLRQPIRI VIDSQNRVTP
EHRIVQQPGE TWFARTQDDS REWPETVRTL LIPEHKGHLD LVVLMMQLGK QQINSIWVEA
GPTLAGALLQ AGLVDELIVY IAPKLLGSDA RGLCSLPGLE KLADAPQFKF KEIRHVGPDV
CLHLVGA