Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18811 |
Symbol | putP |
ID | 4780104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1537222 |
End bp | 1538973 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085170 |
Product | sodium/solute symporter family protein glucose transporter |
Protein accession | YP_001015701 |
Protein GI | 124026586 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0257597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTAA TTGATTGGTT TATTTTATTT GTTTATTTGA TCTTTTCCTT GGTATTGGGA ATTTATATAT CTCTAAAAAA TCATAATGAA GCAGACTATT TTGTTGCTGG TCGTCGTCTT AATGGATTGC TTGCAGGTAT GTCTATGGCT GCAACAACGT TTTCGATTGA CACACCTCTT TATGTGGCCG GAGTTATTGG GACTCGAGGC TTGGCAGGAA ACTGGGAGTG GTGGAGTTTT GGCCTTGCTC ATGTTGCTAT GACGGTTATA TTCGCTCCAC TGTGGAGGCG TAGTGGAGTA TTGACTGACG CAGCCTTTAC TGAATTACGT TATGGGGGGC AAGCAGCAGC TTACTTAAGA GGTATAAAAG CTTTTTTACT CTCAGTTCCA ATCAATTGTA TTGGGATTGG TTATGCTTTC TTAGCAATGC GTAAAGTGGC TGAATCATTA GGTGTGGTAG ATGGACATAT TGTTTTCGGA CCCTTTACTG ACACGATACT TTTAATGATT TTAGTTGCTT TGTTTTTGCT TGTTTACACT GTACTAGGCG GGCTTTGGGC AGTAGTCGTT AATGATTTTG TACAACTTGT ATTAGCTCTT TTAGGCGCAT TTGCTGTTTG CTTTGTGGCA CTTGATGCTT CAGGTGGTAT GACGAATTTG TTAACAAAGT TAGAGGCATT AGATCGACCT GAGCTGCTTT CTTTATTTCC ATGGACATTA AGTAAAAATG GTTTCAATTG GTTGGATAGC TCAGGAATAA GTATTACTAC TTTCTTCGCT TTTATTTCTT TGCAGTGGTG GAGTTTCAGA CGTAGTGATG GTGGTGGGGA ATTTATTCAA CGTTTATTGG CTACAAAAGA TGAAAAACAA GCAACTTTGG CAGGAATAGT TTTTTTAATT GTTAATTATC TTGTTCGCAG TTGGCTTTGG ATTTTAGTTG GCTTAGCTGG TTTAGTCCTC TTGCCTCAAC AAACTGACTG GGAATTAAGT TATCCAAACC TTGCTGTTAC ATACTTACCA CCCGTTGGAC TTGGATTAGT CGTGGTTTCA TTGGTCGCTG CATTTATGAG CACCGTAAGC ACCTCAATCA ATTGGGGGGC AAGTTATCTT GCACATGATC TTTATAAAAG ATTTGTGAGG CCATTTGCTG GGCCTAGAGA AATGATTTTT ATGGGCCAGC TAGCAAGTAT TTTGTTGCTT CTAATTGGAG TCCTCACGGC TTTATTTAGT AACAGTATTG GTTCAATGTT TAGACTTGTT ATTGCGATAG GTACAGGACC AGGTGCCGTT CTTGTACTTA GATGGTTTTG GTGGCGAATT AATGCTCTAG TAGAACTTTC GGCAATGTTA AGTGGCTTTT TTATAGGATT AATCACTTCC GTTTCTCCTT CGTTAACAAT TGAAGATTTC GGAAAAAGAC TTATATTCAC AACTTCATTC TCGGCAGTGA TTTGGTTGCT TACTTTATTT TTTACTGAGC CAGAGTCTGA CGAGACACTC AATAACTTTG TAATGCAGGT AAAACCTCCA GGCCCTGGCT GGAGCAAAAT ACGGAAGAAA TTGAATATTG ATCCAGTTGA TTCTTTTTTA GCGCTTGGAT GTCGCTTCAT TTTAGGCTCA GGTATTCTTT ATGGAGGCTT ACTCAGTATT GGAGCTTTCT TATTACATCA AGAAAGGAGC GCTTGGATTG CACTCTCCAT TGCGGTCAGT TCTGTGTTTT TAATGAAGAA AACAAGACTA ATAACTCAAT AA
|
Protein sequence | MTLIDWFILF VYLIFSLVLG IYISLKNHNE ADYFVAGRRL NGLLAGMSMA ATTFSIDTPL YVAGVIGTRG LAGNWEWWSF GLAHVAMTVI FAPLWRRSGV LTDAAFTELR YGGQAAAYLR GIKAFLLSVP INCIGIGYAF LAMRKVAESL GVVDGHIVFG PFTDTILLMI LVALFLLVYT VLGGLWAVVV NDFVQLVLAL LGAFAVCFVA LDASGGMTNL LTKLEALDRP ELLSLFPWTL SKNGFNWLDS SGISITTFFA FISLQWWSFR RSDGGGEFIQ RLLATKDEKQ ATLAGIVFLI VNYLVRSWLW ILVGLAGLVL LPQQTDWELS YPNLAVTYLP PVGLGLVVVS LVAAFMSTVS TSINWGASYL AHDLYKRFVR PFAGPREMIF MGQLASILLL LIGVLTALFS NSIGSMFRLV IAIGTGPGAV LVLRWFWWRI NALVELSAML SGFFIGLITS VSPSLTIEDF GKRLIFTTSF SAVIWLLTLF FTEPESDETL NNFVMQVKPP GPGWSKIRKK LNIDPVDSFL ALGCRFILGS GILYGGLLSI GAFLLHQERS AWIALSIAVS SVFLMKKTRL ITQ
|
| |