Gene EcSMS35_2543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2543 
SymbolmntH 
ID6147533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2602202 
End bp2603440 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content53% 
IMG OID641617415 
Productmanganese transport protein MntH 
Protein accessionYP_001744586 
Protein GI170683992 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1914] Mn2+ and Fe2+ transporters of the NRAMP family 
TIGRFAM ID[TIGR01197] NRAMP (natural resistance-associated macrophage protein) metal ion transporters 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.630875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACT ATCGCGTTGA GAGTAGCAGC GGACGGGCGG CGCGCAAGAT GAGGCTCGCA 
TTAATGGGAC CTGCGTTCAT TGCGGCGATT GGTTATATCG ATCCTGGTAA CTTTGCGACC
AATATTCAGG CCGGGGCCAG CTTCGGCTAT CAGCTACTGT GGGTTGTCGT TTGGGCCAAC
CTGATGGCGA TGCTGATTCA GATACTGTCT GCCAAACTGG GGATTGCCAC CGGTAAAAAC
CTGGCGGAGC AGATTCGCGA TCACTATCCG CGTCCCGTAG TGTGGTTCTA TTGGGTTCAG
GCAGAAATCA TTGCGATGGC AACCGACCTG GCAGAATTTA TTGGTGCGGC GATCGGTTTT
AAACTCATTC TGGGTGTTTC GCTGTTGCAG GGCGCGGTGC TGACGGGGAT CGCGACTTTC
CTGATTTTAA TGCTGCAACG TCGAGGGCAA AAACCGCTGG AGAAAGTAAT TGGCGGGTTA
CTGCTGTTTG TTGCTGCGGC TTACATTGTC GAGTTGATTT TCTCCCAACC TAACCTGGCG
CAACTGGGTA AAGGAATGGT AATCCCGAGT TTACCCACCT CGGAGGCAGT CTTCCTGGCG
GCAGGTGTGT TGGGGGCGAC GATTATGCCG CACGTGATTT ATTTGCACTC TTCGCTGACT
CAGCATTTAC ATGGTGGTTC GCGTCAACAA CGTTATTCCG CCACCAAATG GGATGTGGCT
ATCGCCATGA CGATTGCCGG TTTTGTCAAT CTGGCGATGA TGGCTACGGC TGCGGCGGCG
TTCCACTTTT CCGGTCATAC TGGCGTTGCC GATCTTGATG AGGCTTATCT GACGCTGCAA
CCGTTGTTAA GCCATGCTGC GGCAACGGTT TTTGGTTTAA GTCTGGTGGC TGCGGGGCTT
TCGTCGACGG TGGTGGGGAC ACTGGCGGGG CAGGTAGTGA TGCAGGGCTT CATTCGTTTT
CATATCCCGC TGTGGGTGCG TCGCACTGTT ACCATGTTGC CGTCATTTAT TGTCATTCTG
ATGGGATTAG ATCCGACACG GATTCTGGTT ATGAGTCAGG TGCTGTTAAG TTTTGGTATC
GCTCTGGCGC TGGTTCCACT ACTAATTTTT ACCAGTGACA GTAAGTTGAT GGGCGATCTG
GTGAACAGCA AACGCGTAAA ACAGACAGGC TGGGTGATTG TGGTGCTGGT AGTGGCGCTG
AATATTTGGT TGTTAGTGGG AACGGCACTG GGATTGTAG
 
Protein sequence
MTNYRVESSS GRAARKMRLA LMGPAFIAAI GYIDPGNFAT NIQAGASFGY QLLWVVVWAN 
LMAMLIQILS AKLGIATGKN LAEQIRDHYP RPVVWFYWVQ AEIIAMATDL AEFIGAAIGF
KLILGVSLLQ GAVLTGIATF LILMLQRRGQ KPLEKVIGGL LLFVAAAYIV ELIFSQPNLA
QLGKGMVIPS LPTSEAVFLA AGVLGATIMP HVIYLHSSLT QHLHGGSRQQ RYSATKWDVA
IAMTIAGFVN LAMMATAAAA FHFSGHTGVA DLDEAYLTLQ PLLSHAAATV FGLSLVAAGL
SSTVVGTLAG QVVMQGFIRF HIPLWVRRTV TMLPSFIVIL MGLDPTRILV MSQVLLSFGI
ALALVPLLIF TSDSKLMGDL VNSKRVKQTG WVIVVLVVAL NIWLLVGTAL GL