Gene NATL1_18811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18811 
SymbolputP 
ID4780104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1537222 
End bp1538973 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content39% 
IMG OID640085170 
Productsodium/solute symporter family protein glucose transporter 
Protein accessionYP_001015701 
Protein GI124026586 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0257597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTAA TTGATTGGTT TATTTTATTT GTTTATTTGA TCTTTTCCTT GGTATTGGGA 
ATTTATATAT CTCTAAAAAA TCATAATGAA GCAGACTATT TTGTTGCTGG TCGTCGTCTT
AATGGATTGC TTGCAGGTAT GTCTATGGCT GCAACAACGT TTTCGATTGA CACACCTCTT
TATGTGGCCG GAGTTATTGG GACTCGAGGC TTGGCAGGAA ACTGGGAGTG GTGGAGTTTT
GGCCTTGCTC ATGTTGCTAT GACGGTTATA TTCGCTCCAC TGTGGAGGCG TAGTGGAGTA
TTGACTGACG CAGCCTTTAC TGAATTACGT TATGGGGGGC AAGCAGCAGC TTACTTAAGA
GGTATAAAAG CTTTTTTACT CTCAGTTCCA ATCAATTGTA TTGGGATTGG TTATGCTTTC
TTAGCAATGC GTAAAGTGGC TGAATCATTA GGTGTGGTAG ATGGACATAT TGTTTTCGGA
CCCTTTACTG ACACGATACT TTTAATGATT TTAGTTGCTT TGTTTTTGCT TGTTTACACT
GTACTAGGCG GGCTTTGGGC AGTAGTCGTT AATGATTTTG TACAACTTGT ATTAGCTCTT
TTAGGCGCAT TTGCTGTTTG CTTTGTGGCA CTTGATGCTT CAGGTGGTAT GACGAATTTG
TTAACAAAGT TAGAGGCATT AGATCGACCT GAGCTGCTTT CTTTATTTCC ATGGACATTA
AGTAAAAATG GTTTCAATTG GTTGGATAGC TCAGGAATAA GTATTACTAC TTTCTTCGCT
TTTATTTCTT TGCAGTGGTG GAGTTTCAGA CGTAGTGATG GTGGTGGGGA ATTTATTCAA
CGTTTATTGG CTACAAAAGA TGAAAAACAA GCAACTTTGG CAGGAATAGT TTTTTTAATT
GTTAATTATC TTGTTCGCAG TTGGCTTTGG ATTTTAGTTG GCTTAGCTGG TTTAGTCCTC
TTGCCTCAAC AAACTGACTG GGAATTAAGT TATCCAAACC TTGCTGTTAC ATACTTACCA
CCCGTTGGAC TTGGATTAGT CGTGGTTTCA TTGGTCGCTG CATTTATGAG CACCGTAAGC
ACCTCAATCA ATTGGGGGGC AAGTTATCTT GCACATGATC TTTATAAAAG ATTTGTGAGG
CCATTTGCTG GGCCTAGAGA AATGATTTTT ATGGGCCAGC TAGCAAGTAT TTTGTTGCTT
CTAATTGGAG TCCTCACGGC TTTATTTAGT AACAGTATTG GTTCAATGTT TAGACTTGTT
ATTGCGATAG GTACAGGACC AGGTGCCGTT CTTGTACTTA GATGGTTTTG GTGGCGAATT
AATGCTCTAG TAGAACTTTC GGCAATGTTA AGTGGCTTTT TTATAGGATT AATCACTTCC
GTTTCTCCTT CGTTAACAAT TGAAGATTTC GGAAAAAGAC TTATATTCAC AACTTCATTC
TCGGCAGTGA TTTGGTTGCT TACTTTATTT TTTACTGAGC CAGAGTCTGA CGAGACACTC
AATAACTTTG TAATGCAGGT AAAACCTCCA GGCCCTGGCT GGAGCAAAAT ACGGAAGAAA
TTGAATATTG ATCCAGTTGA TTCTTTTTTA GCGCTTGGAT GTCGCTTCAT TTTAGGCTCA
GGTATTCTTT ATGGAGGCTT ACTCAGTATT GGAGCTTTCT TATTACATCA AGAAAGGAGC
GCTTGGATTG CACTCTCCAT TGCGGTCAGT TCTGTGTTTT TAATGAAGAA AACAAGACTA
ATAACTCAAT AA
 
Protein sequence
MTLIDWFILF VYLIFSLVLG IYISLKNHNE ADYFVAGRRL NGLLAGMSMA ATTFSIDTPL 
YVAGVIGTRG LAGNWEWWSF GLAHVAMTVI FAPLWRRSGV LTDAAFTELR YGGQAAAYLR
GIKAFLLSVP INCIGIGYAF LAMRKVAESL GVVDGHIVFG PFTDTILLMI LVALFLLVYT
VLGGLWAVVV NDFVQLVLAL LGAFAVCFVA LDASGGMTNL LTKLEALDRP ELLSLFPWTL
SKNGFNWLDS SGISITTFFA FISLQWWSFR RSDGGGEFIQ RLLATKDEKQ ATLAGIVFLI
VNYLVRSWLW ILVGLAGLVL LPQQTDWELS YPNLAVTYLP PVGLGLVVVS LVAAFMSTVS
TSINWGASYL AHDLYKRFVR PFAGPREMIF MGQLASILLL LIGVLTALFS NSIGSMFRLV
IAIGTGPGAV LVLRWFWWRI NALVELSAML SGFFIGLITS VSPSLTIEDF GKRLIFTTSF
SAVIWLLTLF FTEPESDETL NNFVMQVKPP GPGWSKIRKK LNIDPVDSFL ALGCRFILGS
GILYGGLLSI GAFLLHQERS AWIALSIAVS SVFLMKKTRL ITQ