Gene EcSMS35_4473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4473 
SymbolpNaS 
ID6145420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4570634 
End bp4572265 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content56% 
IMG OID641619289 
Productinorganic phosphate transporter, sodium-dependent 
Protein accessionYP_001746401 
Protein GI170680550 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter
[TIGR01013] Phosphate:Na+ Symporter (PNaS) Family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.482641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAACGC TGCTTCACCT GCTTTCTGCC GTCGCCCTGC TGGTCTGGGG GACTCATATT 
GTTCGAACAG GCGTAATGCG CGTCTTCGGC GCGCGTCTGC GTACTGTTCT TAGCCGAAGC
GTCGAAAAGA AGCCGCTCGC CTTTTGCGCG GGGATAGGCG TTACCGCACT GGTGCAGAGC
AGTAATGCCA CCACCATGCT GGTGACCTCG TTTGTCGCTC AGGATCTGGT AGCCCTCGCA
CCGGCTCTGG TGATTGTGCT GGGTGCAGAT GTCGGGACGG CGCTAATGGC GCGTATTCTC
ACCTTCGACT TATCCTGGCT GTCACCGTTA CTTATTTTTA TCGGCGTGAT TTTTTTCCTC
GGACGCAAAC AGTCACGCGC CGGGCAACTG GGCCGCGTCG GTATTGGTCT TGGGCTGATT
TTGCTGGCGC TGGAGTTGAT TGTGCAGGCC GTAACGCCGA TCACCCAGGC AAACGGCGTT
CAGGTGATCT TTGCCTCGCT GACCGGCGAT ATTCTGCTGG ATGCGCTGAT TGGCGCGATG
TTCGCCATTA TCAGCTACTC CAGCCTTGCT GCTGTACTGC TGACCGCGAC TCTGACCGCC
GCAGGCATTA TCTCCTTCCC CGTGGCGCTC TGTCTGGTGA TTGGTGCTAA CCTCGGTTCC
GGTCTGCTGG CGATGCTCAA CAACAGTGCC GCCAATGCCG CAGCCCGCCG TGTCGCGCTG
GGTAGCCTGC TGTTTAAGCT GGTGGGTAGC CTGATTATCC TGCCGTTTGT CCATTTGCTG
GCAGAGACAA TGGGGAAGTT GCCACTGCCA AAAGCGGAAC TGGTGATCTA TTTCCACGTC
TTCTACAACC TTGTACGCTG CCTGGTCATG CTGCCATTTG TTGACCCGAT GGCACGGTTT
TGCAAAACGA TTATTCGCGA TGAACCGGAA CTGGATACCC AGCTACGACC CAAACATCTG
GATGTCAGCG CGCTGGATAC GCCCACGCTT GCTCTGGCGA ACGCCGCGCG CGAAACCCTG
CGCATTGGTG ACGCGATGGA ACAGATGATG GAAGGGCTGA ATAAAGTGAT GCACGGCGAG
CCACGGCAGG AGAAAGAGCT GCGTAAGCTG GCAGATGATA TCAACGTTCT CTATACCGCC
ATTAAGCTGT ATCTGGCGCG GATGCCAAAA GAAGAGCTGG CGGAGGAAGA GTCGCGCCGC
TGGGCGGAGA TCATCGAAAT GTCGCTCAAC CTTGAACAGG CCTCCGATAT CGTCGAGCGC
ATGGGGAGCG AAATTGCCGA TAAATCGCTG GCAGCACGGC GGGCATTTTC GCTTGATGGC
TTGAAGGAAC TGGATGCGCT CTATGAGCAA TTGCTCAGTA ATTTAAAACT GGCAATGTCG
GTTTTCTTCT CTGGCGATGT TACCAGCGCC CGTCGTTTGC GCCGCAGCAA GCATCGCTTT
CGCATTCTTA ATCGCCGTTA TTCCCATGCT CACGTCGATC GCCTGCATCA GCAAAACGTG
CAAAGCATTG AAACCAGTTC GCTACATTTA GGCTTACTGG GAGATATGCA GCGTCTGAAC
TCGCTGTTTT GTTCGGTGGC TTACAGTGTG CTGGAACAGC CGGATGAAGA CGAAGGACGG
GACGAGTATT AA
 
Protein sequence
MLTLLHLLSA VALLVWGTHI VRTGVMRVFG ARLRTVLSRS VEKKPLAFCA GIGVTALVQS 
SNATTMLVTS FVAQDLVALA PALVIVLGAD VGTALMARIL TFDLSWLSPL LIFIGVIFFL
GRKQSRAGQL GRVGIGLGLI LLALELIVQA VTPITQANGV QVIFASLTGD ILLDALIGAM
FAIISYSSLA AVLLTATLTA AGIISFPVAL CLVIGANLGS GLLAMLNNSA ANAAARRVAL
GSLLFKLVGS LIILPFVHLL AETMGKLPLP KAELVIYFHV FYNLVRCLVM LPFVDPMARF
CKTIIRDEPE LDTQLRPKHL DVSALDTPTL ALANAARETL RIGDAMEQMM EGLNKVMHGE
PRQEKELRKL ADDINVLYTA IKLYLARMPK EELAEEESRR WAEIIEMSLN LEQASDIVER
MGSEIADKSL AARRAFSLDG LKELDALYEQ LLSNLKLAMS VFFSGDVTSA RRLRRSKHRF
RILNRRYSHA HVDRLHQQNV QSIETSSLHL GLLGDMQRLN SLFCSVAYSV LEQPDEDEGR
DEY