Gene EcSMS35_4528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4528 
Symbol 
ID6145685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4627566 
End bp4629215 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content56% 
IMG OID641619344 
ProductNa+/H+ antiporter 
Protein accessionYP_001746456 
Protein GI170682054 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID[TIGR00831] Na+/H+ antiporter, bacterial form 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCT TCTTCACCAT ACTGATAATG ACCCTCGTGG TCTCGCTGTC CGGGGTGGTC 
ACTCGTGTCA TGCCCTTTCA GATCCCGCTT CCGCTTATGC AAATCGCCAT CGGTGCGCTA
CTGGCGTGGC CGACGTTTGG TTTGCATGTG GAGTTTGATC CTGAACTCTT TTTAGTCTTG
TTTATCCCGC CGTTGCTGTT CGCTGATGGC TGGAAAACGC CGACCCGTGA ATTTCTCGAA
CATGGGCGAG AGATTTTCGG CCTCGCGCTG GCGCTGGTGG TGGTCACCGT GGTCGGCATT
GGCTTCCTTA TTTACTGGGT GGTGCCGGGC ATTCCGCTGA TCCCCGCCTT TGCGCTGGCG
GCGGTGCTTT CTCCGACCGA TGCTGTGGCG CTCTCCGGGA TTGTTGGCGA AGGGCGCATC
CCGAAAAAAA TCATGGGCAT TTTGCAGGGC GAAGCGTTGA TGAATGACGC CTCCGGTCTG
GTGTCGTTGA AGTTTGCCGT GGCAGTGGCG ATGGGGACGA TGATCTTCAC CGTCGGTGGT
GCGACGGTCG AATTTATGAA AGTGGCCATT GGCGGTATTC TCGCCGGTTT TGTGGTGAGC
TGGCTGTACG GTCGCTCGCT GCGATTCCTC AGCCGCTGGG GCGGTGATGA ACCCGCGACG
CAGATTGTTC TACTGTTCTT GCTGCCATTC GCTTCTTATC TGATTGCCGA ACATATTGGC
GTTTCCGGCA TCCTCGCTGC GGTTGCCGCC GGGATGACCA TCACCCGCTC CGGTGTGATG
CGCCGTGCGC CGCTGGCAAT GCGCCTGCGC GCAAACAGTA CCTGGGCGAT GCTGGAATTT
GTCTTTAACG GCATGGTATT CCTGCTGTTA GGTCTGCAGC TGCCGGGTAT TCTGGAGACG
TCGCTGATGG CGGCAGAAAT CGACCCTAAC GTCGAAATCT GGATGCTGTT TACCGATATC
ATTCTGATTT ATGCGGCGCT GATGCTGGTC CGTTTCGGCT GGCTGTGGAC GATGAAAAAG
TTCAGCAACC GCTTCCTGAA GAAGAAGCCG ATGGAGTTTG GTTCGTGGAC TACACGAGAA
ATCCTGATCG CGTCTTTCGC CGGGGTGCGT GGGGCGATCA CTCTGGCCGG TGTGCTCTCT
ATCCCGCTGC TCTTGCCGGA TGGTAACGTC TTCCCGGCGC GCTATGAGCT GGTGTTCCTG
GCGGCTGGTG TCATTCTCTT CTCGCTGTTT GTCGGCGTGG TGATGTTGCC TATTCTGCTA
CAACACATTG AAGTCGCGGA CCATTCGCAA CAATTGAAAG AGGAACGTAT TGCGCGAGCG
GCAACGGCAG AAGTGGCGAT TGTGGCGATC CAGAAAATGG AGGAGCGTCT GGCGGCGGAT
ACCGAAGAGA ATATCGATAA CCAGCTGCTC ACAGAGGTTA GTTCTCGCGT CATTGGTAAC
CTGCGTCGTC GCGCCGATGG ACGTAATGAT GTAGAAAGTT CAATCCTGGA AGAGAACCTG
GAGCGTCGCT TCCGTCTGGC GGCATTGCGT TCTGAACGTG CTGAACTGTA CCACCTGCGC
GCCACGCGGG AGATCAGTAA CGAAACGCTG CAAAAATTAT TGCACGATCT CGATTTGCTT
GAAGCGTTGC TGATTGAAGA GAACCAATAA
 
Protein sequence
MEIFFTILIM TLVVSLSGVV TRVMPFQIPL PLMQIAIGAL LAWPTFGLHV EFDPELFLVL 
FIPPLLFADG WKTPTREFLE HGREIFGLAL ALVVVTVVGI GFLIYWVVPG IPLIPAFALA
AVLSPTDAVA LSGIVGEGRI PKKIMGILQG EALMNDASGL VSLKFAVAVA MGTMIFTVGG
ATVEFMKVAI GGILAGFVVS WLYGRSLRFL SRWGGDEPAT QIVLLFLLPF ASYLIAEHIG
VSGILAAVAA GMTITRSGVM RRAPLAMRLR ANSTWAMLEF VFNGMVFLLL GLQLPGILET
SLMAAEIDPN VEIWMLFTDI ILIYAALMLV RFGWLWTMKK FSNRFLKKKP MEFGSWTTRE
ILIASFAGVR GAITLAGVLS IPLLLPDGNV FPARYELVFL AAGVILFSLF VGVVMLPILL
QHIEVADHSQ QLKEERIARA ATAEVAIVAI QKMEERLAAD TEENIDNQLL TEVSSRVIGN
LRRRADGRND VESSILEENL ERRFRLAALR SERAELYHLR ATREISNETL QKLLHDLDLL
EALLIEENQ