Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3806 |
Symbol | mtlA |
ID | 5593041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3798811 |
End bp | 3800724 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640922918 |
Product | PTS system, mannitol-specific IIABC component |
Protein accession | YP_001460396 |
Protein GI | 157163078 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2213] Phosphotransferase system, mannitol-specific IIBC component [COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain |
TIGRFAM ID | [TIGR00851] PTS system, mannitol-specific IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCCG ATATTAAGAT CAAAGTGCAA AGCTTTGGTC GTTTCCTCAG CAACATGGTG ATGCCAAATA TCGGCGCGTT TATCGCGTGG GGTATCATCA CCGCATTATT TATTCCAACA GGGTGGTTAC CGAACGAGAC GCTGGCGAAG CTAGTCGGAC CGATGATCAC TTACCTCCTG CCGCTGCTGA TCGGTTATAC CGGTGGTAAG CTGGTAGGCG GCGAACGTGG CGGCGTAGTC GGTGCCATCA CCACCATGGG CGTTATCGTC GGCGCAGACA TGCCGATGTT CCTCGGTTCT ATGATTGCAG GTCCGCTGGG CGGCTGGTGC ATTAAGCACT TCGACCGCTG GGTAGACGGT AAGATCAAAT CCGGTTTTGA GATGCTGGTG AATAACTTCT CCGCTGGCAT CATCGGGATG ATCCTCGCCA TTCTGGCATT CCTCGGCATT GGCCCGATTG TTGAAGCCCT GTCCAAAATG CTGGCTGCGG GCGTTAACTT CATGGTTGTC CATGACATGC TGCCACTGGC ATCTATCTTT GTTGAACCGG CGAAAATCCT GTTCCTCAAC AACGCCATTA ACCACGGTAT CTTCTCGCCG CTGGGTATTC AGCAGTCCCA TGAACTGGGT AAATCCATCT TCTTCCTGAT TGAAGCTAAC CCAGGTCCAG GTATGGGCGT GCTGCTGGCG TACATGTTCT TTGGTCGTGG TAGTGCTAAA CAGTCTGCGG GCGGTGCGGC AATCATCCAC TTCCTGGGTG GTATCCACGA AATCTACTTC CCGTATGTGC TGATGAATCC GCGTCTGATC CTGGCCGTTA TCCTCGGCGG TATGACTGGC GTGTTCACGC TGACTATCCT GGGCGGTGGT CTGGTTTCTC CAGCATCTCC GGGGTCTATC CTTGCGGTAC TGGCGATGAC GCCAAAAGGT GCTTACTTCG CTAACATCGC GGGTGTGTGT GCGGCGATGG CTGTCTCCTT CGTTGTCTCT GCTATTTTGC TGAAAACCAG CAAAGTGAAA GAAGAAGATG ATATTGAAGC AGCAACTCGT CGTATGCAGG ACATGAAAGC TGAGTCTAAA GGCGCATCTC CGCTGTCTGC TGGCGATGTG ACTAACGACC TGAGCCACGT ACGTAAAATC ATCGTTGCCT GTGACGCCGG TATGGGTTCC AGTGCGATGG GCGCAGGCGT TCTGCGTAAG AAAATTCAGG ATGCAGGTCT GTCGCAGATT TCTGTTACTA ACAGCGCGAT CAACAACCTG CCGCCAGATG TGGACCTCGT CATCACTCAC CGTGACCTGA CCGAACGCGC TATGCGCCAG GTTCCGCAGG CACAGCATAT TTCGCTGACC AACTTCCTCG ACAGCGGCCT GTACACCAGC CTGACCGAAC GTCTGGTTGC TGCCCAACGC CACACGGCAA ACGAAGAGAA AGTAAAAGAC AGCCTGAAAG ACAGCTTTGA CGATTCCAGT GCTAACCTGT TCAAGCTAGG CGCGGAGAAC ATCTTCCTCG GTCGCAAAGC GGCAACCAAA GAAGAAGCGA TTCGTTTTGC TGGCGAGCAG CTGGTGAAAG GCGGTTACGT TGAGCCGGAA TACGTTCAGG CGATGCTGGA TCGTGAAAAA CTGACCCCGA CTTATCTGGG TGAGTCTATC GCGGTGCCAC ACGGTACGGT TGAAGCGAAA GATCGCGTAC TGAAAACGGG CGTCGTGTTC TGCCAGTACC CGGAAGGCGT GCGCTTCGGT GAAGAAGAAG ATGACATTGC CCGTCTGGTG ATTGGTATTG CTGCCCGTAA CAACGAGCAC ATTCAGGTTA TCACCAGCCT GACCAATGCA CTGGATGATG AGTCCGTCAT CGAGCGTCTG GCACACACCA CCAGCGTGGA TGAAGTGCTG GAACTGCTGG CAGGTCGTAA GTAA
|
Protein sequence | MSSDIKIKVQ SFGRFLSNMV MPNIGAFIAW GIITALFIPT GWLPNETLAK LVGPMITYLL PLLIGYTGGK LVGGERGGVV GAITTMGVIV GADMPMFLGS MIAGPLGGWC IKHFDRWVDG KIKSGFEMLV NNFSAGIIGM ILAILAFLGI GPIVEALSKM LAAGVNFMVV HDMLPLASIF VEPAKILFLN NAINHGIFSP LGIQQSHELG KSIFFLIEAN PGPGMGVLLA YMFFGRGSAK QSAGGAAIIH FLGGIHEIYF PYVLMNPRLI LAVILGGMTG VFTLTILGGG LVSPASPGSI LAVLAMTPKG AYFANIAGVC AAMAVSFVVS AILLKTSKVK EEDDIEAATR RMQDMKAESK GASPLSAGDV TNDLSHVRKI IVACDAGMGS SAMGAGVLRK KIQDAGLSQI SVTNSAINNL PPDVDLVITH RDLTERAMRQ VPQAQHISLT NFLDSGLYTS LTERLVAAQR HTANEEKVKD SLKDSFDDSS ANLFKLGAEN IFLGRKAATK EEAIRFAGEQ LVKGGYVEPE YVQAMLDREK LTPTYLGESI AVPHGTVEAK DRVLKTGVVF CQYPEGVRFG EEEDDIARLV IGIAARNNEH IQVITSLTNA LDDESVIERL AHTTSVDEVL ELLAGRK
|
| |