Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4473 |
Symbol | pNaS |
ID | 6145420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4570634 |
End bp | 4572265 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619289 |
Product | inorganic phosphate transporter, sodium-dependent |
Protein accession | YP_001746401 |
Protein GI | 170680550 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1283] Na+/phosphate symporter |
TIGRFAM ID | [TIGR00704] Na/Pi-cotransporter [TIGR01013] Phosphate:Na+ Symporter (PNaS) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.482641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAACGC TGCTTCACCT GCTTTCTGCC GTCGCCCTGC TGGTCTGGGG GACTCATATT GTTCGAACAG GCGTAATGCG CGTCTTCGGC GCGCGTCTGC GTACTGTTCT TAGCCGAAGC GTCGAAAAGA AGCCGCTCGC CTTTTGCGCG GGGATAGGCG TTACCGCACT GGTGCAGAGC AGTAATGCCA CCACCATGCT GGTGACCTCG TTTGTCGCTC AGGATCTGGT AGCCCTCGCA CCGGCTCTGG TGATTGTGCT GGGTGCAGAT GTCGGGACGG CGCTAATGGC GCGTATTCTC ACCTTCGACT TATCCTGGCT GTCACCGTTA CTTATTTTTA TCGGCGTGAT TTTTTTCCTC GGACGCAAAC AGTCACGCGC CGGGCAACTG GGCCGCGTCG GTATTGGTCT TGGGCTGATT TTGCTGGCGC TGGAGTTGAT TGTGCAGGCC GTAACGCCGA TCACCCAGGC AAACGGCGTT CAGGTGATCT TTGCCTCGCT GACCGGCGAT ATTCTGCTGG ATGCGCTGAT TGGCGCGATG TTCGCCATTA TCAGCTACTC CAGCCTTGCT GCTGTACTGC TGACCGCGAC TCTGACCGCC GCAGGCATTA TCTCCTTCCC CGTGGCGCTC TGTCTGGTGA TTGGTGCTAA CCTCGGTTCC GGTCTGCTGG CGATGCTCAA CAACAGTGCC GCCAATGCCG CAGCCCGCCG TGTCGCGCTG GGTAGCCTGC TGTTTAAGCT GGTGGGTAGC CTGATTATCC TGCCGTTTGT CCATTTGCTG GCAGAGACAA TGGGGAAGTT GCCACTGCCA AAAGCGGAAC TGGTGATCTA TTTCCACGTC TTCTACAACC TTGTACGCTG CCTGGTCATG CTGCCATTTG TTGACCCGAT GGCACGGTTT TGCAAAACGA TTATTCGCGA TGAACCGGAA CTGGATACCC AGCTACGACC CAAACATCTG GATGTCAGCG CGCTGGATAC GCCCACGCTT GCTCTGGCGA ACGCCGCGCG CGAAACCCTG CGCATTGGTG ACGCGATGGA ACAGATGATG GAAGGGCTGA ATAAAGTGAT GCACGGCGAG CCACGGCAGG AGAAAGAGCT GCGTAAGCTG GCAGATGATA TCAACGTTCT CTATACCGCC ATTAAGCTGT ATCTGGCGCG GATGCCAAAA GAAGAGCTGG CGGAGGAAGA GTCGCGCCGC TGGGCGGAGA TCATCGAAAT GTCGCTCAAC CTTGAACAGG CCTCCGATAT CGTCGAGCGC ATGGGGAGCG AAATTGCCGA TAAATCGCTG GCAGCACGGC GGGCATTTTC GCTTGATGGC TTGAAGGAAC TGGATGCGCT CTATGAGCAA TTGCTCAGTA ATTTAAAACT GGCAATGTCG GTTTTCTTCT CTGGCGATGT TACCAGCGCC CGTCGTTTGC GCCGCAGCAA GCATCGCTTT CGCATTCTTA ATCGCCGTTA TTCCCATGCT CACGTCGATC GCCTGCATCA GCAAAACGTG CAAAGCATTG AAACCAGTTC GCTACATTTA GGCTTACTGG GAGATATGCA GCGTCTGAAC TCGCTGTTTT GTTCGGTGGC TTACAGTGTG CTGGAACAGC CGGATGAAGA CGAAGGACGG GACGAGTATT AA
|
Protein sequence | MLTLLHLLSA VALLVWGTHI VRTGVMRVFG ARLRTVLSRS VEKKPLAFCA GIGVTALVQS SNATTMLVTS FVAQDLVALA PALVIVLGAD VGTALMARIL TFDLSWLSPL LIFIGVIFFL GRKQSRAGQL GRVGIGLGLI LLALELIVQA VTPITQANGV QVIFASLTGD ILLDALIGAM FAIISYSSLA AVLLTATLTA AGIISFPVAL CLVIGANLGS GLLAMLNNSA ANAAARRVAL GSLLFKLVGS LIILPFVHLL AETMGKLPLP KAELVIYFHV FYNLVRCLVM LPFVDPMARF CKTIIRDEPE LDTQLRPKHL DVSALDTPTL ALANAARETL RIGDAMEQMM EGLNKVMHGE PRQEKELRKL ADDINVLYTA IKLYLARMPK EELAEEESRR WAEIIEMSLN LEQASDIVER MGSEIADKSL AARRAFSLDG LKELDALYEQ LLSNLKLAMS VFFSGDVTSA RRLRRSKHRF RILNRRYSHA HVDRLHQQNV QSIETSSLHL GLLGDMQRLN SLFCSVAYSV LEQPDEDEGR DEY
|
| |