Gene EcSMS35_3585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3585 
SymboltrkA 
ID6143400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3663753 
End bp3665129 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content50% 
IMG OID641618412 
Productpotassium transporter peripheral membrane component 
Protein accessionYP_001745552 
Protein GI170680013 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.931761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.13042 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA TCATTCTGGG TGCCGGCCAG GTTGGCGGCA CACTGGCGGA AAACCTGGTT 
GGCGAGAACA ACGATATCAC TGTTGTCGAT ACCAACGGTG AGCGTCTGCG GACCTTGCAG
GATAAATTTG ACCTGCGAGT CGTGCAGGGG CATGGCTCTC ATCCACGCGT ATTGCGGGAG
GCAGGTGCCG ACGACGCCGA TATGCTGGTT GCTGTAACCA GTTCAGATGA AACCAATATG
GTTGCCTGCC AGGTAGCCTA CTCACTTTTC AACACCCCTA ATCGCATCGC TCGTATCCGC
TCACCAGACT ACGTGCGCGA TGCCGATAAG CTATTTCATT CAGATGCTGT ACCGATTGAT
CATCTGATCG CACCAGAGCA GTTGGTTATC GATAATATTT ACCGACTGAT TGAGTATCCC
GGCGCATTGC AGGTGGTAAA CTTCGCTGAG GGTAAAGTCA GCCTGGCTGT GGTTAAAGCC
TATTACGGCG GCCCGCTGAT TGGTAATGCA CTTTCGACCA TGCGCGAACA TATGCCACAT
ATCGATACTC GTGTGGCAGC AATTTTCCGC CATGATCGCC CCATTCGTCC GCAAGGTTCG
ACCATTGTTG AAGCTGGTGA TGAAGTGTTC TTTATTGCCG CTTCACAGCA TATCCGCGCG
GTGATGAGTG AATTACAGCG ACTGGAAAAA CCGTATAAGC GGATCATGCT GGTTGGTGGT
GGTAATATCG GTGCAGGGCT GGCGCGTCGT CTGGAAAAAG ATTACAGCGT CAAACTCATC
GAACGTAATC AGCAGCGCGC TGCCGAACTG GCGGAAAAGT TGCAGAATAC GATCGTCTTT
TTTGGTGATG CGTCGGATCA AGAACTGCTG GCCGAAGAAC ATATCGATCA AGTTGATCTG
TTTATTGCTG TCACCAACGA TGACGAAGCC AATATCATGT CCGCCATGCT TGCCAAACGT
ATGGGTGCGA AAAAGGTGAT GGTATTGATC CAGCGTCGCG CTTATGTGGA TCTGGTTCAG
GGGAGCGTGA TCGATATTGC GATTTCACCG CAACAAGCAA CTATTTCTGC GTTGCTTAGC
CATGTGCGAA AAGCTGATAT TGTTGGTGTT TCCTCACTGC GCCGCGGCGT AGCAGAAGCT
ATTGAAGCCG TTGCTCACGG TGATGAAAGC ACCTCACGCG TTGTCGGCAG AGTCATTGAC
GAAATCAAGC TACCGCCAGG AACGATTATT GGAGCGGTGG TACGTGGAAA CGATGTGATG
ATTGCCAATG ACAATCTGCG CATTGAGCAA GGCGATCACG TAATTATGTT CCTCACAGAT
AAAAAGTTTA TTACCGACGT CGAAAGACTC TTCCAGCCAA GTCCTTTCTT CTTGTAA
 
Protein sequence
MKIIILGAGQ VGGTLAENLV GENNDITVVD TNGERLRTLQ DKFDLRVVQG HGSHPRVLRE 
AGADDADMLV AVTSSDETNM VACQVAYSLF NTPNRIARIR SPDYVRDADK LFHSDAVPID
HLIAPEQLVI DNIYRLIEYP GALQVVNFAE GKVSLAVVKA YYGGPLIGNA LSTMREHMPH
IDTRVAAIFR HDRPIRPQGS TIVEAGDEVF FIAASQHIRA VMSELQRLEK PYKRIMLVGG
GNIGAGLARR LEKDYSVKLI ERNQQRAAEL AEKLQNTIVF FGDASDQELL AEEHIDQVDL
FIAVTNDDEA NIMSAMLAKR MGAKKVMVLI QRRAYVDLVQ GSVIDIAISP QQATISALLS
HVRKADIVGV SSLRRGVAEA IEAVAHGDES TSRVVGRVID EIKLPPGTII GAVVRGNDVM
IANDNLRIEQ GDHVIMFLTD KKFITDVERL FQPSPFFL