Gene ECD_02301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02301 
SymbolmntH 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2389354 
End bp2390592 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content53% 
IMG OID 
Productmanganese transport protein MntH 
Protein accessionACT44122 
Protein GI253978452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAACT ATCGCGTTGA GAGTAGCAGC GGACGGGCGG CGCGCAAGAC GAGGCTCGCA 
TTAATGGGAC CTGCGTTCAT TGCGGCGATT GGTTATATCG ATCCCGGTAA CTTTGCGACC
AATATTCAGG CGGGTGCCAG CTTCGGCTAT CAGCTCCTGT GGGTTGTCGT TTGGGCCAAC
CTGATGGCGA TGCTGATTCA GATCCTCTCT GCCAAACTAG GGATTGCCAC CGGTAAAAAT
CTGGCGGAGC AAATTCGCGA TCACTATCCG CGTCCCGTAG TGTGGTTCTA TTGGGTTCAG
GCAGAAATTA TTGCGATGGC AACCGACCTG GCGGAATTTA TTGGTGCGGC GATCGGTTTT
AAACTCATTC TTGGTGTTTC GTTGTTGCAA GGTGCGGTGC TGACGGGGAT CGCGACTTTC
CTGATTTTAA TGCTGCAACG TCGCGGGCAA AAACCGCTGG AGAAAGTGAT TGGCGGGTTA
CTGTTGTTTG TTGCCGCGGC TTACATTGTC GAGTTGATTT TCTCCCAGCC TAACCTGGCG
CAGCTGGGTA AAGGAATGGT GATCCCGAGT TTACCTACTT CGGAAGCGGT CTTCCTAGCA
GCAGGCGTGT TAGGGGCGAC GATTATGCCG CATGTGATTT ATTTGCACTC TTCGCTCACT
CAGCATTTAC ATGGCGGTTC GCGTCAACAA CGTTATTCCG CCACCAAATG GGATGTGGCT
ATCGCCATGA CTATTGCCGG TTTTGTCAAT CTGGCGATGA TGGCTACAGC TGCGGCGGCG
TTCCACTTTT CTGGTCATAC TGGTGTTGCC GATCTTGATG AGGCATATCT GACGCTGCAA
CCGTTGTTAA GCCATGCTGC GGCAACGGTC TTTGGATTAA GCCTGGTTGC TGCCGGACTG
TCCTCAACGG TGGTGGGGAC ACTGGCGGGG CAGGTGGTGA TGCAGGGGTT CATTCGCTTC
CATATCCCGC TGTGGGTGCG TCGTACAGTC ACCATGTTGC CGTCATTTAT TGTCATTCTG
ATGGGATTAG ATCCGACACG GATTCTGGTT ATGAGTCAGG TGCTGTTAAG TTTTGGTATC
GCCCTGGCGC TGGTTCCACT GCTGATTTTC ACCAGTGACA GCAAGTTGAT GGGCGATCTG
GTGAACAGCA AACGCGTAAA ACAGACAGGC TGGGTGATTG TGGTGCTGGT AGTGGCGCTG
AATATCTGGT TGTTGGTGGG TACGGCACTG GGATTGTAG
 
Protein sequence
MTNYRVESSS GRAARKTRLA LMGPAFIAAI GYIDPGNFAT NIQAGASFGY QLLWVVVWAN 
LMAMLIQILS AKLGIATGKN LAEQIRDHYP RPVVWFYWVQ AEIIAMATDL AEFIGAAIGF
KLILGVSLLQ GAVLTGIATF LILMLQRRGQ KPLEKVIGGL LLFVAAAYIV ELIFSQPNLA
QLGKGMVIPS LPTSEAVFLA AGVLGATIMP HVIYLHSSLT QHLHGGSRQQ RYSATKWDVA
IAMTIAGFVN LAMMATAAAA FHFSGHTGVA DLDEAYLTLQ PLLSHAAATV FGLSLVAAGL
SSTVVGTLAG QVVMQGFIRF HIPLWVRRTV TMLPSFIVIL MGLDPTRILV MSQVLLSFGI
ALALVPLLIF TSDSKLMGDL VNSKRVKQTG WVIVVLVVAL NIWLLVGTAL GL