Gene EcSMS35_1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1890 
Symbolkch 
ID6146258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1910255 
End bp1911463 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content43% 
IMG OID641616766 
Productvoltage-gated potassium channel 
Protein accessionYP_001743944 
Protein GI170682593 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0139159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTAC GGCACGACAT TCTCGCGCTG GCCGTCTTTT TAAATGGATT GCTTATTTTT 
AAAACAATCT ATGGTATGTC GGTCAATTTG CTAGATATTT TCCATATCAA AGCATTTTCA
GAGTTGGATC TCTCCTTGCT GGCAAACGCC CCTCTATTTA TGCTCGGCGT CTTTCTTGTC
CTGAACTCCA TTGGCTTACT GTTCCGGGCA AAGCTCGCAT GGGCAATCAG TATCATTTTG
TTGTTGATAG CGCTAATTTA CACCCTGCAT TTTTATCCCT GGCTGAAATT TAGTATTGGA
TTTTGCATTT TTACGCTGGT GTTTTTGCTG ATACTGCGCA AAGACTTCTC CCACAGTAGC
GCCGCAGCCG GGACAATTTT CGCATTTATT AGTTTCACGA CGTTACTGTT TTACTCCACC
TACGGTGCGC TTTATTTAAG CGAAGGTTTT AATCCGCGAA TAGAAAGTTT GATGACCGCG
TTCTATTTTT CGATAGAAAC CATGTCAACC GTCGGCTACG GCGATATTGT CCCTGTTTCT
GAATCAGCAC GATTGTTCAC TATTTCGGTC ATTATTTCCG GCATTACCGT TTTTGCCACC
TCCATGACTT CAATTTTTGG CCCGCTTATC CGCGGGGGAT TCAACAAACT TGTAAAAGGA
AACAATCATA CAATGCATCG TAAAGATCAT TTTATTGTTT GCGGACATTC GATTCTCGCC
ATCAATACGA TTCTGCAACT GAATCAACGC GGACAAAATG TAACGGTTAT CAGCAACTTG
CCTGAAGATG ATATCAAGCA ACTTGAGCAA CGCTTAGGTG ATAACGCTGA TGTTATCCCC
GGTGACAGTA ATGACAGTTC AGTATTAAAG AAAGCAGGAA TCGATCGATG CCGGGCCATT
CTGGCGCTGA GTGATAACGA TGCAGATAAC GCGTTTGTTG TACTCTCGGC AAAAGATATG
AGCAGTGATG TCAAAACAGT TCTCGCCGTC AGTGATAGCA AAAACCTGAA TAAGATTAAG
ATGGTACATC CGGATATCAT TCTCTCGCCG CAACTGTTTG GCAGCGAAAT TCTGGCACGA
GTTTTAAATG GTGAAGAGAT TAATAATGAT ATGCTCGTTT CAATGTTGTT GAACTCCGGT
CATGGTATTT TCAGCGATAA CGATGAACAA GAAACGAAAG CTGACAATAA AGAATCAGCG
CAAAAATAG
 
Protein sequence
MTLRHDILAL AVFLNGLLIF KTIYGMSVNL LDIFHIKAFS ELDLSLLANA PLFMLGVFLV 
LNSIGLLFRA KLAWAISIIL LLIALIYTLH FYPWLKFSIG FCIFTLVFLL ILRKDFSHSS
AAAGTIFAFI SFTTLLFYST YGALYLSEGF NPRIESLMTA FYFSIETMST VGYGDIVPVS
ESARLFTISV IISGITVFAT SMTSIFGPLI RGGFNKLVKG NNHTMHRKDH FIVCGHSILA
INTILQLNQR GQNVTVISNL PEDDIKQLEQ RLGDNADVIP GDSNDSSVLK KAGIDRCRAI
LALSDNDADN AFVVLSAKDM SSDVKTVLAV SDSKNLNKIK MVHPDIILSP QLFGSEILAR
VLNGEEINND MLVSMLLNSG HGIFSDNDEQ ETKADNKESA QK