Gene P9303_03801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03801 
Symbol 
ID4776237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp384158 
End bp385543 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content56% 
IMG OID640085883 
Productputative sodium:solute symporter, ESS family 
Protein accessionYP_001016397 
Protein GI124022090 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.489029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAACG GCTTGCAAAG CCTGCTCCAT GCCACCCAGC CAACCAATTT GTGGCTGGCC 
CTAAGTCTGC TGACTGTGCT GGCCTTGTTG CTGAGCTTGG GGCGAAGGCT CGAGCCTGCC
CTGCGACTGG AGCGGCTAGG CCTGCCTATT GCCCTGTTGG CCGGCACAGC AGGCCTGCTG
CTGGGGCCCT ATGGCCCCCT ACCCCTGCTG CCACTGAGCG TAACGGACAT CTGGACTGAA
GTTCCAACAG CGTTGCTTAC CCTGGTCTTC GCCACCTTGA TGCTCGGGCG CCCCCTCCCG
AAAGGTCAAG GCTTATGGGA GCCTGTGGCA TCCCAGGCCA TGCTGGGAAT GATGCTGGGT
TTTGGTCAAT ACCTAGTAGG GGGACTAGCA GTCCTACTGG TGTTGATTCC CTTGCTGGGG
GTTGATCCGC TCATGGGCTG CCTGATTGAA GTGGGTTTTG AGGGCGGCCA TGGGGCAGCG
GCCGTTATGG GTGAAAGCTT TCGCGAACTC GGCTTTCCCG GTGGCCAAGA TCTTGGCCTA
GCAATGGCAA CAGTGGGCCT GCTCACATCA ACCCTTCTGG GCAGTGTCCT GGTGATTTTC
GGCCGTTGGC GTGGATGGGT TGCCCCCCAT GGCCCCACAG AAATCGGAGA CACAGGAGCT
GTTGAAGAAG AAACGAGTTT TGGACAACAA CTTCGCCTGC TGGCCGTAAA TCTTGGCCTG
GCTGGAGCCG CTGTTGCTTG CGGCGTGTTG ATGCTCGAAG GCTTGCGATT ATTGGGCCCT
TGGCTGGGGG AGTTCTACCG GCAAGTCATT CACGTCTTCC CAGTCTTTCC TCTTGCACTA
GGGGGTTCAC TACTGATCCG ACTAGCCCTG GAAGTCTCGG GCCAAACTCA ATGGGTATCT
CAGTTATTGC AACGTGAAAT CGGCATCCTG GCCACCGACC TGCTGATCAC CACTGCAATG
GCAAGCCTGA ATCTGCCGCT GCTGCAACAC GACTGGCTTC CGCTAACTGT GTTGTCTGTA
ACAGGGCTGG CCTGGAATCT ATTGATCATG CTCTTTGTGG CCAGGTTCAC ACTGCGCGAG
GAATGGTTCG AGCGCTCGAT CACCGAGTTC GGACAGGCCA CTGGAGTCGC TGCGAGTGGC
CTTTTGCTGC TTCGACTGGC CGATCCCCGC AACCTTACAA AAGCCTTACC TGTGTTTTCG
ATCAAACAAT TGATCCTGCA ACCCATTCTC TCTGGTGGGG TGATCACCGT TGTAGCCCCT
CTAGCAGTGA CTCGGTTGGG ACTGCTTGGC TGGACAGAAC TTTGCGGCAT TCTTACTGTT
ATATGTATTG GATTAGCGGT GATCATCAAC ATCACCTCAT CATCAGAATC AACAGAAGCT
GCGTAA
 
Protein sequence
MVNGLQSLLH ATQPTNLWLA LSLLTVLALL LSLGRRLEPA LRLERLGLPI ALLAGTAGLL 
LGPYGPLPLL PLSVTDIWTE VPTALLTLVF ATLMLGRPLP KGQGLWEPVA SQAMLGMMLG
FGQYLVGGLA VLLVLIPLLG VDPLMGCLIE VGFEGGHGAA AVMGESFREL GFPGGQDLGL
AMATVGLLTS TLLGSVLVIF GRWRGWVAPH GPTEIGDTGA VEEETSFGQQ LRLLAVNLGL
AGAAVACGVL MLEGLRLLGP WLGEFYRQVI HVFPVFPLAL GGSLLIRLAL EVSGQTQWVS
QLLQREIGIL ATDLLITTAM ASLNLPLLQH DWLPLTVLSV TGLAWNLLIM LFVARFTLRE
EWFERSITEF GQATGVAASG LLLLRLADPR NLTKALPVFS IKQLILQPIL SGGVITVVAP
LAVTRLGLLG WTELCGILTV ICIGLAVIIN ITSSSESTEA A