Gene EcSMS35_4115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4115 
SymboltrkD 
ID6143844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4211115 
End bp4212983 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content52% 
IMG OID641618939 
Productpotassium transport protein Kup 
Protein accessionYP_001746077 
Protein GI170682987 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3158] K+ transporter 
TIGRFAM ID[TIGR00794] potassium uptake protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.133962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG ATAATAAGCA ATCATTGCCC GCGATTACCC TCGCGGCGAT TGGAGTTGTC 
TACGGCGATA TTGGTACCAG CCCGTTATAT ACACTTCGTG AATGTTTGTC CGGCCAGTTT
GGTTTTGGCG TTGAACGCGA TGCCGTGTTT GGCTTTTTAT CGCTGATCTT CTGGCTGCTA
ATCTTTGTGG TTTCCATTAA ATATCTCACC TTCGTGATGC GGGCAGATAA CGCCGGTGAA
GGGGGGATCC TGACGTTGAT GTCGCTTGCC GGGCGTAATA CGTCGGCGCG AACCACATCA
ATGCTGGTGA TTATGGGGCT AATCGGCGGC AGCTTTTTCT ATGGTGAAGT CGTCATAACA
CCCGCTATTT CGGTGATGTC AGCCATTGAA GGTCTGGAAA TCGTCGCCCC GCAGCTGGAT
ACCTGGATAG TTCCCCTCTC AATTATCGTT CTCACTTTAT TATTTATGAT TCAAAAACAT
GGCACGGCTA TGGTCGGTAA GCTGTTTGCA CCGATCATGC TGACCTGGTT TTTGATTCTG
GCAGGGCTGG GGTTACGTAG CATTATTGCT AACCCGGAAG TGCTGCATGC ACTGAATCCG
ATGTGGGCGG TGCATTTCTT CCTTGAATAC AAAACGGTTT CTTTTATTGC ATTAGGGGCA
GTGGTGCTGT CGATTACGGG GGTCGAGGCG CTGTATGCTG ATATGGGGCA CTTTGGTAAG
TTCCCTATTC GTCTGGCGTG GTTCACCGTC GTATTGCCTT CCTTAACCCT TAATTACTTC
GGCCAGGGAG CGCTGTTGTT AAAGAATCCG GAAGCGATTA AGAACCCGTT CTTCCTGTTG
GCACCGGACT GGGCGCTGAT CCCGCTGCTG ATCATCGCCG CACTGGCGAC GGTAATTGCC
TCGCAGGCGG TTATCTCTGG CGTCTTCTCA TTGACGCGTC AGGCGGTACG TCTGGGATAT
TTGTCGCCGA TGCGCATTAT TCACACCTCC GAAATGGAGT CAGGGCAAAT CTATATTCCC
TTTGTGAACT GGATGCTCTA TGTCGCGGTC GTGATTGTGA TTGTCAGCTT TGAGCACTCC
AGCAACCTGG CGGCGGCGTA CGGGATTGCG GTGACCGGAA CCATGGTGCT GACGTCTATT
CTCTCGACTA CCGTGGCACG TCAGAACTGG CACTGGAATA AGTATTTTGT TGCGCTGATC
CTGATTGCTT TCCTTTGTGT CGATATTCCC TTGTTCACCG CTAACCTCGA TAAACTGCTC
TCCGGCGGCT GGTTGCCATT AAGCCTCGGT ACGGTGATGT TTATCGTGAT GACCACCTGG
AAGAGCGAGC GTTTCCGCTT GCTGCGGCGG ATGCATGAAC ATGGTAACTC TCTGGAAGCA
ATGATTGCTT CGCTGGAGAA ATCACCGCCC GTTCGCGTGC CTGGGACCGC GGTGTATATG
TCGCGTGCAA TCAACGTCAT TCCCTTTGCG CTGATGCATA ACCTGAAACA TAACAAGGTA
TTGCATGAGC GGGTGATTCT GTTAACTCTG CGCACCGAAG ACGCGCCATA TGTCCATAAC
GTCCGTCGGG TACAGATTGA ACAACTGTCG CCCACTTTCT GGCGCGTGGT GGCAAGTTAT
GGTTGGCGAG AAACGCCAAA CGTGGAAGAA GTTTTCCACC GCTGCGGTCT GGAAGGATTA
AGTTGCCGGA TGATGGAAAC CTCCTTCTTT ATGTCGCATG AGTCGTTGAT CCTCGGCAAA
CGCCCGTGGT ATTTGCGTCT GCGCGGCAAG CTGTACTTGC TGCTGCAACG TAATGCGCTG
CGTGCGCCGG ATCAATTTGA AATCCCGCCA AACAGGGTTA TCGAACTGGG TACTCAGGTC
GAAATCTAA
 
Protein sequence
MSTDNKQSLP AITLAAIGVV YGDIGTSPLY TLRECLSGQF GFGVERDAVF GFLSLIFWLL 
IFVVSIKYLT FVMRADNAGE GGILTLMSLA GRNTSARTTS MLVIMGLIGG SFFYGEVVIT
PAISVMSAIE GLEIVAPQLD TWIVPLSIIV LTLLFMIQKH GTAMVGKLFA PIMLTWFLIL
AGLGLRSIIA NPEVLHALNP MWAVHFFLEY KTVSFIALGA VVLSITGVEA LYADMGHFGK
FPIRLAWFTV VLPSLTLNYF GQGALLLKNP EAIKNPFFLL APDWALIPLL IIAALATVIA
SQAVISGVFS LTRQAVRLGY LSPMRIIHTS EMESGQIYIP FVNWMLYVAV VIVIVSFEHS
SNLAAAYGIA VTGTMVLTSI LSTTVARQNW HWNKYFVALI LIAFLCVDIP LFTANLDKLL
SGGWLPLSLG TVMFIVMTTW KSERFRLLRR MHEHGNSLEA MIASLEKSPP VRVPGTAVYM
SRAINVIPFA LMHNLKHNKV LHERVILLTL RTEDAPYVHN VRRVQIEQLS PTFWRVVASY
GWRETPNVEE VFHRCGLEGL SCRMMETSFF MSHESLILGK RPWYLRLRGK LYLLLQRNAL
RAPDQFEIPP NRVIELGTQV EI