Gene EcSMS35_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2113 
Symbol 
ID6144746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2122852 
End bp2123943 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content55% 
IMG OID641616989 
Productputative monooxygenase rutA 
Protein accessionYP_001744164 
Protein GI170680078 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03612] pyrimidine utilization protein A 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG GCGTATTCGT ACCTATTGGC AACAACGGCT GGCTCATTTC GACCCACGCG 
CCGCAGTACA TGCCGACCTT TGAACTGAAT AAAGCCATTG TGCAAAAAGC GGAGCACTAC
CATTTCGACT TCGCCCTGTC GATGATCAAA CTGCGTGGCT TTGGCGGCAA AACTGAGTTC
TGGGATCACA ACCTTGAGTC GTTCACCTTG ATGGCGGGTC TGGCAGCCGT GACCTCACGC
ATTCAGATTT ACGCGACAGC TGCCACCTTA ACGTTACCTC CGGCAATCGT CGCCCGTATG
GCCGCAACCA TCGACTCCAT CTCTGGCGGG CGTTTTGGCG TCAACCTCGT GACTGGCTGG
CAAAAGCCCG AGTATGAGCA GATGGGGATC TGGCCTGGCG ATGACTATTT CTCCCGTCGT
TACGACTATC TCACCGAATA TGTTCAGGTG CTGCGCGACC TGTGGGGCTC GGGGAAAAGC
GATTTTAAAG GCGATTTTTT CACTATGGAT GATTGTCGCG TCAGTCCGCA ACCGAGTGTC
CCCATGAAAG TGATCTGCGC CGGGCAAAGC GACGCTGGCA TGGCGTTCTC CGCCCGGTAT
GCCGATTTTA ACTTCTGTTT CGGCAAAGGC GTAAATACAC CCACGGCTTT CGCCCCGACC
GCTGCGCGGA TGAAACAGGC CGCAGAGCAA ACCGGACGCG ACGTTGGCTC TTATGTGTTG
TTTATGGTGA TTGCCGACGA AACCGACGAT GCCGCTCGTG CAAAATGGGA ACACTACAAA
GCGGGCGCGG ATGAAGAGGC CTTAAGCTGG CTAACCGAAC AAAGTCAGAA AGATACCCGC
TCCGGTACAG ACACCAACGT CCGTCAGATG GCCGATCCCA CTTCGGCGGT AAACATCAAT
ATGGGGACGT TAGTCGGTTC TTACGCCAGT GTCGCGCGCA TGTTAGATGA AGTCGCAACC
GTGCCTGGTG CCGAAGGCGT GCTGTTGACC TTCGATGATT TTCTGTCGGG AATCGAAAAC
TTCGGCGAGC GCATTCAACC ACTGATGCAG TGCCGCGCCC ATCTCCCTGC GCTGACTCAG
GAGGTGGCAT GA
 
Protein sequence
MKIGVFVPIG NNGWLISTHA PQYMPTFELN KAIVQKAEHY HFDFALSMIK LRGFGGKTEF 
WDHNLESFTL MAGLAAVTSR IQIYATAATL TLPPAIVARM AATIDSISGG RFGVNLVTGW
QKPEYEQMGI WPGDDYFSRR YDYLTEYVQV LRDLWGSGKS DFKGDFFTMD DCRVSPQPSV
PMKVICAGQS DAGMAFSARY ADFNFCFGKG VNTPTAFAPT AARMKQAAEQ TGRDVGSYVL
FMVIADETDD AARAKWEHYK AGADEEALSW LTEQSQKDTR SGTDTNVRQM ADPTSAVNIN
MGTLVGSYAS VARMLDEVAT VPGAEGVLLT FDDFLSGIEN FGERIQPLMQ CRAHLPALTQ
EVA