Gene EcHS_A4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4041 
SymbolcorA 
ID5593301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4031903 
End bp4032853 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content53% 
IMG OID640923145 
Productmagnesium/nickel/cobalt transporter CorA 
Protein accessionYP_001460611 
Protein GI157163293 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0598] Mg2+ and Co2+ transporters 
TIGRFAM ID[TIGR00383] magnesium Mg(2+) and cobalt Co(2+) transport protein (corA) 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGCG CATTTCAACT GGAAAATAAC CGACTGACCC GGCTGGAAGT CGAAGAGTCA 
CAACCCCTTG TAAATGCAGT ATGGATTGAT CTTGTCGAAC CGGACGACGA CGAGCGACTG
CGCGTACAAT CTGAACTTGG TCAGAGCCTG GCAACCCGCC CGGAACTGGA AGACATCGAA
GCATCGGCAC GTTTCTTTGA AGACGACGAC GGCCTGCATA TTCACTCCTT CTTCTTCTTT
GAAGATGCGG AAGATCACGC CGGTAACTCC ACTGTGGCAT TTACCATCCG TGATGGTCGT
CTGTTTACTC TGCGTGAGCG TGAACTGCCC GCTTTTCGTC TGTATCGTAT GCGTGCCCGT
AGCCAGTCGA TGGTGGACGG TAACGCCTAC GAGCTGCTGC TGGATCTGTT CGAAACCAAA
ATCGAACAGT TGGCAGATGA AATTGAAAAT ATCTATAGCG ACCTGGAGCA GTTGAGCCGG
GTGATTATGG AAGGGCATCA GGGCGATGAG TACGACGAGG CGCTCTCCAC TCTGGCGGAA
CTGGAAGATA TCGGCTGGAA AGTGCGCCTG TGTCTGATGG ATACCCAGCG TGCGCTCAAC
TTCCTGGTGC GTAAAGCGCG TTTACCGGGT GGGCAACTGG AGCAGGCGCG TGAAATCCTG
CGAGATATCG AATCCCTGCT GCCGCATAAC GAATCCCTGT TCCAGAAGGT GAACTTCCTG
ATGCAGGCGG CAATGGGTTT TATCAACATC GAGCAGAACC GCATCATCAA AATTTTCTCG
GTGGTATCCG TCGTGTTCCT GCCGCCAACG CTGGTTGCCT CCAGCTACGG GATGAACTTT
GAGTTTATGC CAGAACTGAA GTGGAGTTTC GGCTACCCAG GCGCGATTAT CTTTATGATC
CTCGCGGGCC TGGCACCGTA TCTGTACTTT AAGCGGAAGA ACTGGTTGTA A
 
Protein sequence
MLSAFQLENN RLTRLEVEES QPLVNAVWID LVEPDDDERL RVQSELGQSL ATRPELEDIE 
ASARFFEDDD GLHIHSFFFF EDAEDHAGNS TVAFTIRDGR LFTLRERELP AFRLYRMRAR
SQSMVDGNAY ELLLDLFETK IEQLADEIEN IYSDLEQLSR VIMEGHQGDE YDEALSTLAE
LEDIGWKVRL CLMDTQRALN FLVRKARLPG GQLEQAREIL RDIESLLPHN ESLFQKVNFL
MQAAMGFINI EQNRIIKIFS VVSVVFLPPT LVASSYGMNF EFMPELKWSF GYPGAIIFMI
LAGLAPYLYF KRKNWL