Gene EcolC_1278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1278 
Symbol 
ID6065937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1398379 
End bp1399617 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content53% 
IMG OID641600699 
Productmanganese transport protein MntH 
Protein accessionYP_001724271 
Protein GI170019317 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1914] Mn2+ and Fe2+ transporters of the NRAMP family 
TIGRFAM ID[TIGR01197] NRAMP (natural resistance-associated macrophage protein) metal ion transporters 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.913398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0884608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACT ATCGCGTTGA GAGTAGCAGC GGACGGGCGG CGCGCAAGAT GAGGCTCGCA 
TTAATGGGAC CTGCGTTCAT TGCGGCGATT GGTTATATCG ATCCCGGTAA CTTTGCGACC
AATATTCAGG CGGGTGCTAG CTTCGGCTAT CAGCTACTGT GGGTTGTCGT TTGGGCCAAC
CTGATGGCGA TGCTGATTCA GATCCTCTCT GCCAAACTAG GGATTGCCAC CGGTAAAAAT
CTGGCGGAGC AGATTCGCGA TCACTATCCG CGTCCCGTAG TGTGGTTCTA TTGGGTTCAG
GCAGAAATTA TTGCGATGGC AACCGACCTG GCGGAATTTA TTGGTGCGGC GATCGGTTTT
AAACTCATTC TTGGTGTTTC GTTGTTGCAG GGCGCGGTGC TGACGGGGAT CGCGACTTTC
CTGATTTTAA TGCTGCAACG TCGCGGGCAA AAACCGCTGG AGAAAGTGAT TGGCGGGTTA
CTGTTGTTTG TTGCCGCGGC TTACATTGTC GAGTTGATTT TCTCCCAGCC TAACCTGGCG
CAGCTGGGTA AAGGAATGGT GATCCCGAGT TTACCTACTT CGGAAGCGGT CTTCCTGGCA
GCAGGCGTGT TAGGGGCGAC GATTATGCCG CATGTGATTT ATTTGCACTC CTCGCTCACT
CAGCATTTAC ATGGCGGTTC GCGTCAACAA CGTTATTCCG CCACCAAATG GGATGTGGCT
ATCGCCATGA CTATTGCCGG TTTTGTCAAT CTGGCGATGA TGGCTACAGC TGCGGCGGCG
TTCCACTTTT CCGGTCATAC TGGTGTTGCC GATCTTGATG AGGCTTATCT GACGCTGCAA
CCGCTGTTAA GCCACGCTGC GGCAACGGTC TTTGGATTAA GCCTGGTTGC TGCGGGGCTG
TCTTCAACGG TGGTGGGGAC ACTGGCGGGG CAGGTGGTGA TGCAGGGCTT CATTCGCTTT
CATATCCCGC TGTGGGTGCG TCGTACAGTC ACCATGTTGC CGTCATTTAT TGTCATTCTG
ATGGGATTAG ATCCGACACG GATTCTGGTT ATGAGTCAGG TACTGTTAAG TTTTGGTATC
GCTCTGGCGC TGGTTCCACT GCTGATTTTC ACCAGTGACA GCAAGTTGAT GGGCGATCTG
GTGAACAGCA AACGCGTAAA ACAGACAGGC TGGGTGATTG TGGTGCTGGT CGTGGCGCTG
AATATCTGGT TGTTGGTGGG GACGGCGCTG GGATTGTAG
 
Protein sequence
MTNYRVESSS GRAARKMRLA LMGPAFIAAI GYIDPGNFAT NIQAGASFGY QLLWVVVWAN 
LMAMLIQILS AKLGIATGKN LAEQIRDHYP RPVVWFYWVQ AEIIAMATDL AEFIGAAIGF
KLILGVSLLQ GAVLTGIATF LILMLQRRGQ KPLEKVIGGL LLFVAAAYIV ELIFSQPNLA
QLGKGMVIPS LPTSEAVFLA AGVLGATIMP HVIYLHSSLT QHLHGGSRQQ RYSATKWDVA
IAMTIAGFVN LAMMATAAAA FHFSGHTGVA DLDEAYLTLQ PLLSHAAATV FGLSLVAAGL
SSTVVGTLAG QVVMQGFIRF HIPLWVRRTV TMLPSFIVIL MGLDPTRILV MSQVLLSFGI
ALALVPLLIF TSDSKLMGDL VNSKRVKQTG WVIVVLVVAL NIWLLVGTAL GL