Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2113 |
Symbol | |
ID | 6144746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2122852 |
End bp | 2123943 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616989 |
Product | putative monooxygenase rutA |
Protein accession | YP_001744164 |
Protein GI | 170680078 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03612] pyrimidine utilization protein A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTG GCGTATTCGT ACCTATTGGC AACAACGGCT GGCTCATTTC GACCCACGCG CCGCAGTACA TGCCGACCTT TGAACTGAAT AAAGCCATTG TGCAAAAAGC GGAGCACTAC CATTTCGACT TCGCCCTGTC GATGATCAAA CTGCGTGGCT TTGGCGGCAA AACTGAGTTC TGGGATCACA ACCTTGAGTC GTTCACCTTG ATGGCGGGTC TGGCAGCCGT GACCTCACGC ATTCAGATTT ACGCGACAGC TGCCACCTTA ACGTTACCTC CGGCAATCGT CGCCCGTATG GCCGCAACCA TCGACTCCAT CTCTGGCGGG CGTTTTGGCG TCAACCTCGT GACTGGCTGG CAAAAGCCCG AGTATGAGCA GATGGGGATC TGGCCTGGCG ATGACTATTT CTCCCGTCGT TACGACTATC TCACCGAATA TGTTCAGGTG CTGCGCGACC TGTGGGGCTC GGGGAAAAGC GATTTTAAAG GCGATTTTTT CACTATGGAT GATTGTCGCG TCAGTCCGCA ACCGAGTGTC CCCATGAAAG TGATCTGCGC CGGGCAAAGC GACGCTGGCA TGGCGTTCTC CGCCCGGTAT GCCGATTTTA ACTTCTGTTT CGGCAAAGGC GTAAATACAC CCACGGCTTT CGCCCCGACC GCTGCGCGGA TGAAACAGGC CGCAGAGCAA ACCGGACGCG ACGTTGGCTC TTATGTGTTG TTTATGGTGA TTGCCGACGA AACCGACGAT GCCGCTCGTG CAAAATGGGA ACACTACAAA GCGGGCGCGG ATGAAGAGGC CTTAAGCTGG CTAACCGAAC AAAGTCAGAA AGATACCCGC TCCGGTACAG ACACCAACGT CCGTCAGATG GCCGATCCCA CTTCGGCGGT AAACATCAAT ATGGGGACGT TAGTCGGTTC TTACGCCAGT GTCGCGCGCA TGTTAGATGA AGTCGCAACC GTGCCTGGTG CCGAAGGCGT GCTGTTGACC TTCGATGATT TTCTGTCGGG AATCGAAAAC TTCGGCGAGC GCATTCAACC ACTGATGCAG TGCCGCGCCC ATCTCCCTGC GCTGACTCAG GAGGTGGCAT GA
|
Protein sequence | MKIGVFVPIG NNGWLISTHA PQYMPTFELN KAIVQKAEHY HFDFALSMIK LRGFGGKTEF WDHNLESFTL MAGLAAVTSR IQIYATAATL TLPPAIVARM AATIDSISGG RFGVNLVTGW QKPEYEQMGI WPGDDYFSRR YDYLTEYVQV LRDLWGSGKS DFKGDFFTMD DCRVSPQPSV PMKVICAGQS DAGMAFSARY ADFNFCFGKG VNTPTAFAPT AARMKQAAEQ TGRDVGSYVL FMVIADETDD AARAKWEHYK AGADEEALSW LTEQSQKDTR SGTDTNVRQM ADPTSAVNIN MGTLVGSYAS VARMLDEVAT VPGAEGVLLT FDDFLSGIEN FGERIQPLMQ CRAHLPALTQ EVA
|
| |