Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2953 |
Symbol | |
ID | 6145614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3028336 |
End bp | 3029136 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617822 |
Product | ThiF family protein |
Protein accession | YP_001744977 |
Protein GI | 170680315 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0973303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTAATTA GTGATGCATG GCGTCAGCGT TTTGGTGGCA CAGCGCGTCT GTATGGTGAA AAAGCATTGC AACTGTTTGC TGACGCGCAT ATTTGTGTGG TTGGTATCGG TGGTGTCGGT TCCTGGGCGG CGGAAGCGCT GGCGCGCACG GGGATTGGCT CAATCACGCT TATCGATATG GATGATGTGT GCGTCACCAA TACCAATCGG CAAATTCATG CCCTGCGCGA TAACGTTGGG CTGGCAAAAG CGGAAGTTAT GGCGGAGCGT ATTCGCCAGA TTAACCCGGA GTGCCGTGTA ACGGTGGTGG ATGATTTCGT GACGCCGGAT AACGTAGCGC AGTATATGAG CGTGGGTTAT TCGTACGTGA TTGATGCCAT TGATAGTGTA CGGCCCAAAG CGGCGCTGAT TGCTTATTGT CGGCGCAATA AAATCCCACT GGTCACAACC GGTGGCGCGG GTGGGCAGAT TGACCCGACG CAGATTCAGG TTACCGATCT GGCGAAAACG ATTCAGGACC CGCTGGCGGC GAAGTTGCGT GAGCGCCTGA AAAGCGATTT TGGCATAGTG AAAAACAGTA AAGGTAAGCT CGGCGTGGAT TGCGTGTTCT CTACCGAAGC GCTGGTGTAC CCGCAGTCAG ATGGCACGGT CTGTGCAATG AAAGCTACGG CGGAAGGGCC AAAGCGGATG GATTGTGCAT CTGGATTTGG TGCCGCGACG ATGGTGACCG CCACCTTTGG TTTTGTTGCG GTTTCTCATG CGCTGAAGAA GATGATGGCG AAAGCGGCGC GTCAGGGGTA A
|
Protein sequence | MVISDAWRQR FGGTARLYGE KALQLFADAH ICVVGIGGVG SWAAEALART GIGSITLIDM DDVCVTNTNR QIHALRDNVG LAKAEVMAER IRQINPECRV TVVDDFVTPD NVAQYMSVGY SYVIDAIDSV RPKAALIAYC RRNKIPLVTT GGAGGQIDPT QIQVTDLAKT IQDPLAAKLR ERLKSDFGIV KNSKGKLGVD CVFSTEALVY PQSDGTVCAM KATAEGPKRM DCASGFGAAT MVTATFGFVA VSHALKKMMA KAARQG
|
| |