Gene EcolC_3138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3138 
Symbol 
ID6066417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3438322 
End bp3439998 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content55% 
IMG OID641602554 
Productputative cation:proton antiport protein 
Protein accessionYP_001726088 
Protein GI170021134 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4651] Kef-type K+ transport system, predicted NAD-binding component 
TIGRFAM ID[TIGR00932] transporter, monovalent cation:proton antiporter-2 (CPA2) family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000511667 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATCACG CCACCCCGCT TATCACCACC ATTGTTGGCG GCCTTGTGCT CGCCTTTATC 
CTCGGCATGC TGGCCAATAA ACTACGTATT TCTCCTCTGG TGGGATATCT GTTAGCGGGT
GTGCTGGCAG GACCATTCAC TCCGGGCTTT GTTGCCGATA CCAAGCTTGC GCCGGAACTG
GCTGAACTGG GCGTCATTCT GTTGATGTTT GGCGTCGGTT TGCACTTTTC GCTGAAGGAT
TTGATGGCGG TAAAGGCCAT CGCCATTCCC GGTGCGATCG CCCAGATAGC CGTGGCGACG
CTGCTGGGTA TGGCGCTCTC TGCCGTGCTG GGCTGGTCGT TAATGACCGG TATCGTGTTC
GGTTTATGTC TTTCCACCGC CAGTACCGTG GTGTTACTGC GCGCACTTGA AGAACGGCAA
TTAATTGACA GTCAGCGTGG GCAAATCGCC ATCGGTTGGT TGATTGTGGA AGACCTGGTA
ATGGTTCTGA CGCTGGTGTT GCTGCCCGCA GTGGCAGGAA TGATGGAACA GGGCGATGTG
GGCTTTGCCA CTCTTGCAGT CGATATGGGG ATCACCATCG GCAAAGTGAT CGCATTTATC
GCCATTATGA TGCTGGTAGG TCGCCGTCTG GTGCCGTGGA TTATGGCACG CAGCGCGGCA
ACCGGTTCTC GCGAGCTGTT TACCCTGTCG GTGCTGGCGC TGGCGTTAGG GGTTGCCTTT
GGTGCGGTAG AGCTGTTTGA TGTCTCCTTT GCACTCGGTG CGTTCTTTGC CGGGATGGTA
CTGAACGAGT CTGAACTGAG TCACCGTGCC GCCCACGATA CGCTGCCATT GCGCGACGCG
TTTGCGGTGC TGTTTTTTGT CTCCGTCGGG ATGTTGTTTG ATCCGTTAAT TCTGATTCAG
CAACCGCTGG CAGTGCTGGC GACGCTGGCG ATTATTCTGT TTGGTAAGTC GTTAGCCGCA
TTTTTCCTGG TGCGACTGTT TGGTCACTCC CAACGTACGG CATTAACCAT CGCCGCCAGC
CTGGCGCAGA TTGGTGAGTT CGCGTTTATC CTAGCGGGAC TGGGAATGGC ATTGAATTTA
CTGCCGCAGG CCGGACAAAA CCTGGTACTG GCAGGGGCGA TCCTGTCGAT TATGCTCAAC
CCGGTACTGT TCGCACTACT GGAGAAATAT CTGGCGAAGA CCGAAACGCT GGAAGAGCAG
ACGCTGGAAG AGGCAATCGA AGAAGAGAAG CAGATCCCAG TGGATATTTG CAACCATGCG
CTACTGGTGG GTTACGGTCG TGTAGGCAGC CTGCTGGGGG AGAAATTGCT CGCCTCTGAT
ATTCCGCTGG TGGTGATTGA GACGTCACGA ACCCGTGTTG ATGAGCTGCG AGAGCGCGGG
GTCCGCGCAG TATTGGGCAA TGCGGCGAAC GAAGAAATTA TGCAACTGGC GCATCTGGAA
TGTGCAAAAT GGCTGATCCT GACGATTCCC AACGGTTATG AAGCGGGTGA GATTGTGGCA
TCTGCCCGCG CGAAAAATCC GGATATTGAG ATTATTGCCC GCGCCCATTA TGACGATGAA
GTGGCGTATA TCACCGAACG TGGTGCGAAT CAGGTAGTGA TGGGCGAGCG TGAAATCGCC
CGTACCATGC TGGAACTGCT GGAAACGCCA CCGGCGGGTG AGGTGGTGAC GGGGTAA
 
Protein sequence
MHHATPLITT IVGGLVLAFI LGMLANKLRI SPLVGYLLAG VLAGPFTPGF VADTKLAPEL 
AELGVILLMF GVGLHFSLKD LMAVKAIAIP GAIAQIAVAT LLGMALSAVL GWSLMTGIVF
GLCLSTASTV VLLRALEERQ LIDSQRGQIA IGWLIVEDLV MVLTLVLLPA VAGMMEQGDV
GFATLAVDMG ITIGKVIAFI AIMMLVGRRL VPWIMARSAA TGSRELFTLS VLALALGVAF
GAVELFDVSF ALGAFFAGMV LNESELSHRA AHDTLPLRDA FAVLFFVSVG MLFDPLILIQ
QPLAVLATLA IILFGKSLAA FFLVRLFGHS QRTALTIAAS LAQIGEFAFI LAGLGMALNL
LPQAGQNLVL AGAILSIMLN PVLFALLEKY LAKTETLEEQ TLEEAIEEEK QIPVDICNHA
LLVGYGRVGS LLGEKLLASD IPLVVIETSR TRVDELRERG VRAVLGNAAN EEIMQLAHLE
CAKWLILTIP NGYEAGEIVA SARAKNPDIE IIARAHYDDE VAYITERGAN QVVMGEREIA
RTMLELLETP PAGEVVTG