Gene ECH74115_5490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5490 
SymbolpNaS 
ID6968994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5139245 
End bp5140876 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content55% 
IMG OID643389135 
Productinorganic phosphate transporter, sodium-dependent 
Protein accessionYP_002273532 
Protein GI209398736 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter
[TIGR01013] Phosphate:Na+ Symporter (PNaS) Family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAACGC TGCTTAACCT GCTTTCTGCC GTCGCCCTGC TGGTCTGGGG GACTCATATT 
GTTCGAACCG GCGTAATGCG CGTCTTCGGC GCGCGTTTGC GTACTGTCCT TAGCCGGAGC
GTCGAAAAAA AGCCGCTCGC CTTTTGCGCG GGGATCGGCG TTACCGCACT GGTACAGAGC
AGTAATGCCA CCACCATGCT GGTGACCTCG TTCGTCGCTC AGGATCTGGT AGCCCTCGCA
CCGGCTCTGG TCATTGTGCT GGGGGCAGAT GTCGGGACGG CGCTAATGGC GCGTATTCTC
ACCTTCGACT TATCCTGGCT GTCACCGTTA CTTATTTTTA TCGGCGTGAT TTTTTTCCTC
GGACGCAAAC AGTCACGCGC CGGGCAACTG GGCCGCGTCG GTATTGGTCT TGGGCTGATT
TTGCTGGCGC TGGAGTTGAT TGTGCAGGCC GTAACGCCGA TCACCCAGGC AAACGGCGTT
CAGGTGATCT TTGCCTCGCT GACCGGCGAT ATTCTGCTGG ATGCGCTGAT TGGCGCGATG
TTCGCCATTA TCAGCTACTC CAGCCTTGCT GCTGTACTGC TGACCGCGAC TCTGACCGCC
GCAGGCATTA TCTCCTTCCC CGTGGCGCTC TGTCTGGTGA TTGGTGCTAA CCTCGGTTCC
GGCCTGCTGG CGATGCTCAA CAACAGTGCC GCCAATGCCG CAGCCCGCCG TGTCGCGCTG
GGTAGTCTGC TGTTTAAGCT GGTGGGTAGC CTGATTATCC TGCCGTTTGT CCATTTGCTG
GCAGAGACAA TGGGGAAGTT GTCATTGCCA AAAGCGGAAC TGGTGATCTA TTTCCACGTC
TTCTACAACC TTGTACGTTG CCTGGCCATG CTGCCATTTG TTGACCCGAT GGCACGGTTT
TGCAAAACGA TTATTCGCGA TGAACCGGAA CTGGATACCC AGCTACGGCC TAAACATCTG
GATGTCAGCG CGCTGGATAC GCCCACGCTT GCTCTGGCGA ACGCCGCGCG CGAAACCCTG
CGCATTGGCG ACGCCATGGA ACAGATGATG GAAGGGCTGA ATAAAGTGAT GCACGGCGAG
CCAAGGCAGG AGAAAGAGCT GCGTAAGCTG GCAGATGATA TCAACGTTCT CTATACCGCC
ATTAAGCTGT ATCTGGCGCG GATGCCAAAA GAGGAACTGG CAGAAGAAGA GTCGCGCCGC
TGGGCGGAGA TCATCGAAAT GTCGCTCAAC CTTGAACAGG CCTCCGATAT CGTCGAGCGC
ATGAGTAGTG AAATTGCCGA CAAATCGCTG GCTGCAAGGC GAGCATTTTC GCTTGATGGC
TTGAAGGAAC TGGATGCGCT CTATGAGCAA TTGCTCAGTA ATTTAAAGCT GGCAATGTCG
GTGTTCTTCT CTGGCGATGT CACCAGCGCT CGTCGTTTGC GCCGCAGCAA GCATCGCTTT
CGCATTCTTA ATCGCCGCTA TTCCCACGCC CACGTCGATC GCCTGCATCA GCAAAACGTG
CAAAGCATTG AAACCAGTTC GCTACATTTA GGCTTACTGG GAGATATGCA GCGTCTGAAC
TCGCTGTTTT GTTCGGTGGC TTACAGTGTG CTGGAACAGC CTGATGAAGA TGAGGGACGG
GACGAGTATT AA
 
Protein sequence
MLTLLNLLSA VALLVWGTHI VRTGVMRVFG ARLRTVLSRS VEKKPLAFCA GIGVTALVQS 
SNATTMLVTS FVAQDLVALA PALVIVLGAD VGTALMARIL TFDLSWLSPL LIFIGVIFFL
GRKQSRAGQL GRVGIGLGLI LLALELIVQA VTPITQANGV QVIFASLTGD ILLDALIGAM
FAIISYSSLA AVLLTATLTA AGIISFPVAL CLVIGANLGS GLLAMLNNSA ANAAARRVAL
GSLLFKLVGS LIILPFVHLL AETMGKLSLP KAELVIYFHV FYNLVRCLAM LPFVDPMARF
CKTIIRDEPE LDTQLRPKHL DVSALDTPTL ALANAARETL RIGDAMEQMM EGLNKVMHGE
PRQEKELRKL ADDINVLYTA IKLYLARMPK EELAEEESRR WAEIIEMSLN LEQASDIVER
MSSEIADKSL AARRAFSLDG LKELDALYEQ LLSNLKLAMS VFFSGDVTSA RRLRRSKHRF
RILNRRYSHA HVDRLHQQNV QSIETSSLHL GLLGDMQRLN SLFCSVAYSV LEQPDEDEGR
DEY