Gene EcolC_3962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3962 
Symbol 
ID6064489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4352222 
End bp4353871 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content56% 
IMG OID641603375 
ProductNa+/H+ antiporter 
Protein accessionYP_001726890 
Protein GI170021936 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID[TIGR00831] Na+/H+ antiporter, bacterial form 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00930832 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAAATCT TCTTCACCAT ACTGATAATG ACCCTCGTGG TCTCGCTGTC CGGGGTGGTC 
ACTCGTGTCA TGCCCTTTCA GATCCCGCTT CCGCTTATGC AAATCGCCAT CGGTGCGCTA
CTGGCGTGGC CGACGTTTGG TTTGCATGTG GAGTTTGATC CTGAACTCTT TTTAGTCTTG
TTTATCCCGC CGTTGCTGTT CGCTGATGGC TGGAAAACGC CGACCCGTGA ATTTCTTGAA
CATGGTCGAG AGATTTTCGG CCTCGCACTG GCGCTGGTGG TGGTCACCGT GGTCGGCATT
GGCTTCCTTA TTTACTGGGT GGTGCCGGGC ATTCCGCTGA TCCCCGCCTT TGCGCTGGCG
GCGGTGCTTT CTCCGACCGA TGCTGTGGCG CTCTCCGGGA TTGTTGGCGA AGGGCGCATC
CCGAAAAAAA TCATGGGCAT TTTGCAGGGC GAAGCGTTGA TGAATGACGC CTCCGGCCTG
GTGTCGTTGA AGTTTGCCGT GGCAGTGGCG ATGGGGACGA TGATCTTCAC CGTTGGCGGT
GCAACGGTCG AATTTATGAA AGTAGCCATT GGCGGTATTC TCGCTGGTTT TGTGGTGAGC
TGGCTGTATG GTCGCTCGCT GCGATTCCTC AGCCGCTGGG GCGGTGATGA ACCCGCGACG
CAGATCGTTC TGCTGTTCTT GCTGCCATTC GCTTCTTATC TGATTGCCGA ACATATTGGC
GTTTCGGGCA TCCTCGCTGC GGTTGCCGCC GGGATGACCA TCACCCGCTC CGGTGTGATG
CGCCGTGCGC CGCTGGCAAT GCGCCTGCGT GCAAACAGCA CCTGGGCGAT GCTGGAATTT
GTCTTTAACG GCATGGTGTT CCTGCTGTTA GGTCTGCAGC TGCCGGGTAT TCTGGAGACG
TCGCTGATGG CGGCAGAAAT CGACCCTAAC GTCGAAATCT GGATGCTGTT TACCGATATT
ATTCTGATAT ATGCGGCGCT GATGCTGGTC CGTTTCGGCT GGCTGTGGAC GATGAAAAAG
TTCAGCAACC GCTTCCTGAA GAAGAAGCCG ATGGAGTTTG GTTCGTGGAC CACACGAGAA
ATCCTGATCG CGTCTTTCGC CGGGGTGCGT GGGGCGATCA CTCTGGCCGG TGTGCTCTCT
ATCCCGCTGC TCTTGCCGGA TGGTAACGTC TTCCCGGCGC GCTATGAGCT GGTGTTCCTG
GCGGCTGGCG TCATTCTCTT CTCGCTGTTT GTCGGCGTGG TGATGTTGCC TATTCTGCTA
CAACACATTG AAGTCGCGGA CCATTCGCAA CAATTGAAAG AGGAACGTAT TGCGCGAGCG
GCAACGGCAG AAGTGGCGAT TGTGGCGATC CAGAAAATGG AGGAGCGTCT GGCGGCGGAT
ACCGAAGAGA ATATCGATAA CCAGCTGCTT ACGGAGGTCA GTTCTCGCGT CATTGGTAAC
CTGCGTCGTC GCGCCGATGG ACGTAACGAC GTTGAAAGTT CCGTGCAGGA AGAGAACCTT
GAGCGTCGCT TCCGTCTGGC GGCATTGCGT TCTGAACGTG CTGAACTTTA CCACCTGCGC
GCCACGCGGG AGATCAGCAA CGAAACGCTG CAAAAATTAC TGCACGATCT CGATTTGCTT
GAAGCGTTGC TAATTGAGGA AAATCAGTAA
 
Protein sequence
MEIFFTILIM TLVVSLSGVV TRVMPFQIPL PLMQIAIGAL LAWPTFGLHV EFDPELFLVL 
FIPPLLFADG WKTPTREFLE HGREIFGLAL ALVVVTVVGI GFLIYWVVPG IPLIPAFALA
AVLSPTDAVA LSGIVGEGRI PKKIMGILQG EALMNDASGL VSLKFAVAVA MGTMIFTVGG
ATVEFMKVAI GGILAGFVVS WLYGRSLRFL SRWGGDEPAT QIVLLFLLPF ASYLIAEHIG
VSGILAAVAA GMTITRSGVM RRAPLAMRLR ANSTWAMLEF VFNGMVFLLL GLQLPGILET
SLMAAEIDPN VEIWMLFTDI ILIYAALMLV RFGWLWTMKK FSNRFLKKKP MEFGSWTTRE
ILIASFAGVR GAITLAGVLS IPLLLPDGNV FPARYELVFL AAGVILFSLF VGVVMLPILL
QHIEVADHSQ QLKEERIARA ATAEVAIVAI QKMEERLAAD TEENIDNQLL TEVSSRVIGN
LRRRADGRND VESSVQEENL ERRFRLAALR SERAELYHLR ATREISNETL QKLLHDLDLL
EALLIEENQ