Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_03801 |
Symbol | |
ID | 4776237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 384158 |
End bp | 385543 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085883 |
Product | putative sodium:solute symporter, ESS family |
Protein accession | YP_001016397 |
Protein GI | 124022090 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0786] Na+/glutamate symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.489029 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAACG GCTTGCAAAG CCTGCTCCAT GCCACCCAGC CAACCAATTT GTGGCTGGCC CTAAGTCTGC TGACTGTGCT GGCCTTGTTG CTGAGCTTGG GGCGAAGGCT CGAGCCTGCC CTGCGACTGG AGCGGCTAGG CCTGCCTATT GCCCTGTTGG CCGGCACAGC AGGCCTGCTG CTGGGGCCCT ATGGCCCCCT ACCCCTGCTG CCACTGAGCG TAACGGACAT CTGGACTGAA GTTCCAACAG CGTTGCTTAC CCTGGTCTTC GCCACCTTGA TGCTCGGGCG CCCCCTCCCG AAAGGTCAAG GCTTATGGGA GCCTGTGGCA TCCCAGGCCA TGCTGGGAAT GATGCTGGGT TTTGGTCAAT ACCTAGTAGG GGGACTAGCA GTCCTACTGG TGTTGATTCC CTTGCTGGGG GTTGATCCGC TCATGGGCTG CCTGATTGAA GTGGGTTTTG AGGGCGGCCA TGGGGCAGCG GCCGTTATGG GTGAAAGCTT TCGCGAACTC GGCTTTCCCG GTGGCCAAGA TCTTGGCCTA GCAATGGCAA CAGTGGGCCT GCTCACATCA ACCCTTCTGG GCAGTGTCCT GGTGATTTTC GGCCGTTGGC GTGGATGGGT TGCCCCCCAT GGCCCCACAG AAATCGGAGA CACAGGAGCT GTTGAAGAAG AAACGAGTTT TGGACAACAA CTTCGCCTGC TGGCCGTAAA TCTTGGCCTG GCTGGAGCCG CTGTTGCTTG CGGCGTGTTG ATGCTCGAAG GCTTGCGATT ATTGGGCCCT TGGCTGGGGG AGTTCTACCG GCAAGTCATT CACGTCTTCC CAGTCTTTCC TCTTGCACTA GGGGGTTCAC TACTGATCCG ACTAGCCCTG GAAGTCTCGG GCCAAACTCA ATGGGTATCT CAGTTATTGC AACGTGAAAT CGGCATCCTG GCCACCGACC TGCTGATCAC CACTGCAATG GCAAGCCTGA ATCTGCCGCT GCTGCAACAC GACTGGCTTC CGCTAACTGT GTTGTCTGTA ACAGGGCTGG CCTGGAATCT ATTGATCATG CTCTTTGTGG CCAGGTTCAC ACTGCGCGAG GAATGGTTCG AGCGCTCGAT CACCGAGTTC GGACAGGCCA CTGGAGTCGC TGCGAGTGGC CTTTTGCTGC TTCGACTGGC CGATCCCCGC AACCTTACAA AAGCCTTACC TGTGTTTTCG ATCAAACAAT TGATCCTGCA ACCCATTCTC TCTGGTGGGG TGATCACCGT TGTAGCCCCT CTAGCAGTGA CTCGGTTGGG ACTGCTTGGC TGGACAGAAC TTTGCGGCAT TCTTACTGTT ATATGTATTG GATTAGCGGT GATCATCAAC ATCACCTCAT CATCAGAATC AACAGAAGCT GCGTAA
|
Protein sequence | MVNGLQSLLH ATQPTNLWLA LSLLTVLALL LSLGRRLEPA LRLERLGLPI ALLAGTAGLL LGPYGPLPLL PLSVTDIWTE VPTALLTLVF ATLMLGRPLP KGQGLWEPVA SQAMLGMMLG FGQYLVGGLA VLLVLIPLLG VDPLMGCLIE VGFEGGHGAA AVMGESFREL GFPGGQDLGL AMATVGLLTS TLLGSVLVIF GRWRGWVAPH GPTEIGDTGA VEEETSFGQQ LRLLAVNLGL AGAAVACGVL MLEGLRLLGP WLGEFYRQVI HVFPVFPLAL GGSLLIRLAL EVSGQTQWVS QLLQREIGIL ATDLLITTAM ASLNLPLLQH DWLPLTVLSV TGLAWNLLIM LFVARFTLRE EWFERSITEF GQATGVAASG LLLLRLADPR NLTKALPVFS IKQLILQPIL SGGVITVVAP LAVTRLGLLG WTELCGILTV ICIGLAVIIN ITSSSESTEA A
|
| |