Gene PMN2A_0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0067 
Symbol 
ID3605474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp621977 
End bp623362 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content36% 
IMG OID637686922 
Productputative sodium:solute symporter, ESS family 
Protein accessionYP_291262 
Protein GI72381907 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID[TIGR00210] sodium--glutamate symport carrier (gltS) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGT CGTTTTTAGG TTTGGAAGAC CTATATAAAA TTAATGCAAT CCCAACGTTA 
GTCCTCTCTT TGGGTTTGCT TGGACTGATA GGAATTCTTT TAACCCTTGG TAGAAGATTG
GATTCCGCGA TGAAGTTGGA AAGATTTGGT ATTCCCATAG CCCTTTTAAT TGGAGCTTTA
GGTTTTTTAA TCGGTCCTTA TGGACCTCTT TCTTTATTAC CAGAAAGGGT TCTGAATACT
TGGATGCAAT TACCAACTCC ATTGCTTACT TTAGTCTTTG CGACCTTAAT GCTAGGAAGA
CCTATTCCAA GAATTAGTGC TTTATGGAAA CCAGTTGCTT CGCAGGCATT GCTTGGACTT
TTATTAGGTT TTGGTCAATA TGTTGTTGGT GGGATAATTG TGCTGTCATT TCTGCTTCCT
TATTTAGGAG TAGATCCACT GATGGGATGC ATTATTGAAG TTGGTTTTGA AGGAGGACAT
GGAGCTGCGG CAATAATGGG AGAAAGTTTT ATGAAGTTAG GTTTTCCTGA GGGATTAGAT
CTGGGCTTTG CAATGGCAAC TGTAGGATTA CTTGCTTCTA CTTTGCTAGG GAGCGGTTTG
GTTGTCCTAG GTAGGTTTTT TGGATGGCTT GTAACTACTG AACAAGAGCT CCCAAATGAT
TTAAATGATA TTGAATTTGC AATCAAACCA ATTGAACAAC TTAAGTCGCT TTTATATAAT
TTTGCTCTAC TAGGATTAGC GGTATTGATT GGAATCTTTT TTCTTTATTG TTTAAGGCTA
TCTTCTACTT TTTCTAGTGA TATAAGTAAG CAGGTGATAT TAGCTTTCCC AGTATTTCCA
TTGGCTTTGA TGGGTTCATT TTTAGTTAGA TTTTTATTGG AAAAAACTGG AAAGACTAAA
TTAGTATCAT CACTTTTTCA ACGAGAGATT GGCATACTTT CAACCGATTT ACTCATAATT
ACCGCGATGG CAGGATTGAA TTTACCTTTA TTAGTTAACT ACTGGGTTCC AATAACCATT
TTAGCCGTTG GTGGATTGAT TTGGAATCTT GTAGGGATGT TGATTTTTTC TAGATTATTT
TTTAGAGAAG AATGGTTTGT AAGAGCAATA GCAGAGTTCG GAAATTCAAC AGGAGTTGCA
GCTAGTGGAC TATTACTTTT GAGATTGGCT GATCCAAGAA ATTCTACTAA TACGTTACCT
GTATTTTCTA TTAAGCAATT ATTTCTTCAA CCACTTCTTT CTGGAGGTCT GATTACTGTA
ATAGCGCCTT TGTTTATTAG TAATTTTGGG CTTAAAGGGT GGACAGAATT TTGTGGATTA
GTTTCATTGT CTTTATGTGT AATAGCAATA TCTCTACAGT CAAGATATAC AAAAGCCTCA
GCATGA
 
Protein sequence
MSLSFLGLED LYKINAIPTL VLSLGLLGLI GILLTLGRRL DSAMKLERFG IPIALLIGAL 
GFLIGPYGPL SLLPERVLNT WMQLPTPLLT LVFATLMLGR PIPRISALWK PVASQALLGL
LLGFGQYVVG GIIVLSFLLP YLGVDPLMGC IIEVGFEGGH GAAAIMGESF MKLGFPEGLD
LGFAMATVGL LASTLLGSGL VVLGRFFGWL VTTEQELPND LNDIEFAIKP IEQLKSLLYN
FALLGLAVLI GIFFLYCLRL SSTFSSDISK QVILAFPVFP LALMGSFLVR FLLEKTGKTK
LVSSLFQREI GILSTDLLII TAMAGLNLPL LVNYWVPITI LAVGGLIWNL VGMLIFSRLF
FREEWFVRAI AEFGNSTGVA ASGLLLLRLA DPRNSTNTLP VFSIKQLFLQ PLLSGGLITV
IAPLFISNFG LKGWTEFCGL VSLSLCVIAI SLQSRYTKAS A