Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4601 |
Symbol | |
ID | 6871368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4442337 |
End bp | 4443968 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642787505 |
Product | sodium-dependent inorganic phosphate |
Protein accession | YP_002218103 |
Protein GI | 198244465 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1283] Na+/phosphate symporter |
TIGRFAM ID | [TIGR00704] Na/Pi-cotransporter [TIGR01013] Phosphate:Na+ Symporter (PNaS) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.635464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAACTT TGCTCCATCT ACTTTCTGCC GTTGCGCTGT TGGTATGGGG AACACATATC GTTCGTACTG GCGTAATGCG CGTGTTTGGC GCGCGTCTAC GCACTGTCCT CAGCCGCAGC GTGGAAAAGA AACCGCTCGC CTTTTGTGCG GGTATCGGTG TTACCGCGCT GGTACAAAGC AGTAACGCCA CCACTTTATT GGTAACGTCG TTTGTCGCCC AGGATCTTGT CGCCCTGACG CCAGCTCTGG TGATTGTGCT GGGCGCTGAT GTGGGTACTG CGCTAATGGC GCGTATTCTC ACCTTTGACT TGTCGTGGCT ATCGCCGCTG CTGATTTTTA TTGGCGTGAT TTTCTTTTTG GGACGTAAGC AGTCCCGCGC CGGGCAGCTG GGTCGCGTCG GTATCGGGCT TGGCCTGATT CTACTGGCGC TGGAGCTGAT TGTGCAGGCA GTGACGCCGA TCACTCAGGC TAATGGCGTG CAGGTCATTT TCGCGTCGCT GACGGGCGAC ATTATGCTGG ATGCGCTGAT TGGCGCGATG TTCGCTATTA TCAGTTATTC CAGCCTGGCG GCGGTGTTGC TGACGGCGAC CCTGACGGCG GCGGGGATTA TATCGTTCCC GGTGGCGTTG TGCCTGGTCA TCGGCGCCAA TCTGGGATCG GGCTTGTTGG CGATGCTCAA CAACAGCGCC GCCAATGCTG CCGCGCGTCG CGTAGCGCTC GGCAGCCTAT TGTTCAAATT GATCGGCAGC CTGGTCATCC TGCCGTTTGT CCATCCGCTG GCGAATCTGA TGGATGAGCT ATCGCTACCG AAGTCAGAGC TGGTGATCTA TTTCCACGTT TTCTATAACC TGGTGCGCTG CCTGGCGATG GTGCCATTTG CCGAGCTGAT GGCGCGTTTT TGTAAACGAA TTATTCGTGA TGAGCCTGAA CTGGATACCC ATCTGAAGCC GAAACATCTG GATGTCAGCG CGCTGGATAC GCCAACGCTG GCGCTGGCTA ATGCTGCCCG TGAGGTGTTG CGCATTGGCG ATGCGATGGA ACAGATGATG GAAGGGCTAA AAAAGGTCAT GCACGGCGAG CCGCGTGAAG AGAAAGCGCT GCGCAAGCTG GCGGATGACG TTAACGTGCT CTACACCGCG ATTAAGCTTT ATCTGGCGCG AATGCCGAAA GACGAGCTGG CGGCGGAGGA GTCCCGTCGG TGGGCGGAGA TTATTGAGAT GGCCCTGAAC CTCGAACAGG CGTCGGATAT TATCGAGCGA ATGGGCAGCG AGATTGCCGA CAAGTCGCTG GCGGCGCGTC GGGCATTTTC AGAAGAAGGA TTGAAAGAGC TGGATGCGCT TTACGATCAA CTGCTCAGTA ATCTGCAACT GGCGATGTCG GTCTTTTTCT CCGGCGATGT CACCAGCGCC CGTCGGTTGC GCCGCAGTAA ACATCGCTTC CGTATACTTA ATCGCCGATA CTCACATGCG CATGTCGACC GCCTGCACCA GCAGAACGTG CAAAGCATTG AAACCAGCTC GCTCCATTTA GGGCTGCTGG GCGATATGAA GCGTCTTAAC TCACTGTTCT GTTCGGTCGC CTATAGCGTA CTGGAACAGC CGGATCAGGA CGAGGAACGG GGCGAGTATT AA
|
Protein sequence | MLTLLHLLSA VALLVWGTHI VRTGVMRVFG ARLRTVLSRS VEKKPLAFCA GIGVTALVQS SNATTLLVTS FVAQDLVALT PALVIVLGAD VGTALMARIL TFDLSWLSPL LIFIGVIFFL GRKQSRAGQL GRVGIGLGLI LLALELIVQA VTPITQANGV QVIFASLTGD IMLDALIGAM FAIISYSSLA AVLLTATLTA AGIISFPVAL CLVIGANLGS GLLAMLNNSA ANAAARRVAL GSLLFKLIGS LVILPFVHPL ANLMDELSLP KSELVIYFHV FYNLVRCLAM VPFAELMARF CKRIIRDEPE LDTHLKPKHL DVSALDTPTL ALANAAREVL RIGDAMEQMM EGLKKVMHGE PREEKALRKL ADDVNVLYTA IKLYLARMPK DELAAEESRR WAEIIEMALN LEQASDIIER MGSEIADKSL AARRAFSEEG LKELDALYDQ LLSNLQLAMS VFFSGDVTSA RRLRRSKHRF RILNRRYSHA HVDRLHQQNV QSIETSSLHL GLLGDMKRLN SLFCSVAYSV LEQPDQDEER GEY
|
| |