Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04421 |
Symbol | putP |
ID | 4776985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 447714 |
End bp | 449483 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640085946 |
Product | sodium/solute symporter family protein glucose transporter |
Protein accession | YP_001016459 |
Protein GI | 124022152 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCA TTGATTGGAT AATCCTCGTT GGTTATCTCG CCATCACTCT CGTGATTGGG CTTTGGTTGT CTCGTCGCAA TCGAAGTGAG GATGATTACT TTGTTGCGGG ACGTCGACTC AGTGGTTGGT TGGCTGGGGC TTCGATGGCC GCAACGACAT TTTCGATTGA TACCCCTCTT TATGTAGCCG GTCTGGTTGG CACACGAGGA TTGGCCGGCA ATTGGGAATG GTGGAGCTTC GGCCTGGCGC ATGTTGCCAT GGCAGTTGTC TTTGCTCCGC TCTGGCGACG GAGTGGGGTG CTGACCGATG CTGCATTCAC CGAGCTGCGT TATGGCGGAG CGCCTGCCGC CTGGCTGCGT GGGATTAAAG CATTTTTGCT CGCACTGCCG GTGAATTGCA TCGGTATCGG CTATGCCTTC CTGGCCATGC GCAAGGTCGT AGAAGCCCTG GGCATAGTGT CTGGCAGCCC TGTGGCAGTG CTTGGCGGTA CCCCTGACAC TGTTGTGCTG TTGGCAGTCG TGGCTGTGAT GGTGCTTGTT TACACAGTTG CTGGCGGCCT CTGGGCTGTC GTGATCACCG ATTTCGTGCA GTTGCTGTTG GCTCTTTTGG GAGCGGTGGC GGTGGCGTGG GCCGCCGTTC ATGCAGCCGG AGGGATGGAC GCCATGCTCG ATCAACTGAG GGGGTTAGAG CGACCTGAAC TGCTGGCGAT CGTGCCCTGG CAATGGAATG GTGATGGCTT TGATTGGATT GGTGGCGCCG AGATCAGTGT GGCCACATTC CTTTCTTATC TCACTGTCCA ATGGTGGAGT TTCCGTCGCA GCGACGGTGG TGGTGAGTTC ATTCAGAGAA TGTTGGCCAC GCGGGACGAG CGTCAGGCTC AACTGGCTGG CTGGGTGTTT CTGGTGGTCA ACTATCTGGT GCGCAGTTGG TTGTGGGTTG TGGTTGCTCT TGCCGCTTTA GTGCTGTTGC CAGCTCAGGC CGACTGGGAA CTCAGTTACC CAACCTTGGC TGTGACCTAC CTGCCACCGG TGGTGCTTGG TCTTGTCGTG GTCTCTCTGG TGGCGGCTTT CATGAGTACG GTCAGTACCG CAGTGAACTG GGGTGCTAGC TATCTCACCC ACGATCTTTA CCAGCGTTTT ATAAGGCCGG ATGCTGATCA GAGAGAGCTG TTGCTGGTCG GTCAATTCAC CAGTGTTTTG CTGCTAGTTC TCGGGGTGAT CACCGCATTA ATTAGCGACA GCATCGGCAC AGTGTTTCGT CTGGTGATTG CTATTGGTAC CGGGCCCGGA GTGGTGCTTG TGCTGCGTTG GTTCTGGTGG CGGATCAATG CTGCTGCCGA ATTGGCGGCG ATGGTTTGTG GTTTTGTGGT GGGCCTTTGT ACATCCGTTG TGCCCTTGGT GACAATTCCT GATTACGGCA CCAGGTTGAT GATCACCACA GCGATTACTG CTGTGGTCTG GGTTGTAGTG ATGCTGGTGA CTCCGCCTGA ATCGCCGCAA GTGCTCGAGC GCTTTGTGAA CCAGGTGCAG CCACCAGGCC CAGGCTGGAG TCGCTTGCGT CAACGCCTGG ATATAACACC GGTGGATTCC CTGGTGAGTT TGTGTTTGCG CTTCATTCTT AGTGTCGGTG TTCTGTTTGG CGGTCTGCTC GGCACTGGCG CGTTTCTGTT ACATCAGCAG CTAGGCGGTT GGATCGGTTT GGTGGTCACC GTGGTCTGTG TGCTGCCTTT AACCCGTGGT TGGCTGCGTA GTCGGATGGG CGTCGCATGA
|
Protein sequence | MAAIDWIILV GYLAITLVIG LWLSRRNRSE DDYFVAGRRL SGWLAGASMA ATTFSIDTPL YVAGLVGTRG LAGNWEWWSF GLAHVAMAVV FAPLWRRSGV LTDAAFTELR YGGAPAAWLR GIKAFLLALP VNCIGIGYAF LAMRKVVEAL GIVSGSPVAV LGGTPDTVVL LAVVAVMVLV YTVAGGLWAV VITDFVQLLL ALLGAVAVAW AAVHAAGGMD AMLDQLRGLE RPELLAIVPW QWNGDGFDWI GGAEISVATF LSYLTVQWWS FRRSDGGGEF IQRMLATRDE RQAQLAGWVF LVVNYLVRSW LWVVVALAAL VLLPAQADWE LSYPTLAVTY LPPVVLGLVV VSLVAAFMST VSTAVNWGAS YLTHDLYQRF IRPDADQREL LLVGQFTSVL LLVLGVITAL ISDSIGTVFR LVIAIGTGPG VVLVLRWFWW RINAAAELAA MVCGFVVGLC TSVVPLVTIP DYGTRLMITT AITAVVWVVV MLVTPPESPQ VLERFVNQVQ PPGPGWSRLR QRLDITPVDS LVSLCLRFIL SVGVLFGGLL GTGAFLLHQQ LGGWIGLVVT VVCVLPLTRG WLRSRMGVA
|
| |