Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_10831 |
Symbol | |
ID | 4779040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 968423 |
End bp | 969967 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086592 |
Product | hypothetical protein |
Protein accession | YP_001017097 |
Protein GI | 124022790 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTGC GCAAGGAACT AAACCTCACC TCCCTAACCA TGGCAGTGGT GACAGGCACC ATCGGTTCTG GTTGGCTTTT CGCTCCCTAC TTTGCTGCAC AACTGGCTGG AGCAGGCAGC CTGCTGGCAT GGCTGCTGGG CGGTTTTTTA GCCTTACTGC TGGCCTTGGT GTTTGCAGAA CTCGGATCGC TTGTCCCCAA CTCGGGCGCA CTGGCGCAAA TCCCTCTGCT GACTCATGGG CGACTGTCAG GATTCATCGG CGGATGGAGC GTATGGCTTT CTTACGTCAC CATACCGACC ATAGAACTAC TCGCCCTACT GCAATATTTA TCAAGCAGCC TCCCTTGGCT TACGCACGTC CAAGGCAACC GTCAGCTACT CAGTCCCGCG GGTCAGATTG TCGCCGTGAT TCTGCTGGTC TTGCTTTGCT GGATCAACCT GCTTGGAGTG CAAACCCTTT CGCGCTGGAT CAACCTGCTC ACAGCCTGGA AACTGATCGT TCCAGTTTTG GTGTCGATTG TGCTCATGGT TATCAGCAGT CACTGGAGCA ACCTTGCGGT ACCTGTTGGC GGTGATGGTG CTGATGTAGT ACGTGCTGTA GGTAGTGGAG GGATCTTATT CAGCCTACTG GGATTCCGTA CTGCGATGGA TCTTGCTGGC GAAGCACGTA AGCCGGCTAG GGACGTCCCT CTTGCAATGG CCACAGGCCT AGGCATCTGC CTACTGCTCT ATATCACCCT ACAGCTCAGT TTTCTAATCA GCGTGCCACC CACCGAGCTT GGCAACGGTT GGCATGGCCT AATGCTCAGC GCCCATGGCG GGCCGGTGGT GGCTCTTGCA ATGGGTTTCG GCCTTGGATG GATGGTGATT ATTCTTCTGG TGGATGCATT GGTCTCGCCC GGGGCCACAG CTCTTAATTA CATGGGTGTC TCTGCCCGGA TCATCTGGAT GATGGGGAAG TGTGGGCTCT TGCCTAAAGC TCTCGGACGG CTCAATCATC AGGACGTCCC TCATGTAGCC ATAACGCTGA GCATGGTTGT TAGTGCACTG ATGCTCGCGA TTGGACCAGG GTGGCAGACA GTCGTCAACT TCTTAACCAC AACTTTGATT ATCGCCCTAG CAACCGGACC TGTGAGCTTG CTGGCCCTGC GCCGGCAGAT GCCTGATGCG CATCGAGGGT ACCGGCTACC AATGGCGGAT TGGATTTGCC GTCTTGCGTT CGTAACGGCT ACATGGTCAA TCAGCTGGTG CGGGCGAACT GCTCTAGAGG GTTCCGTTGT CTGCATCGCT ATCCCCACAT TAATCTTCGC TGCGGGTCGC TGTTGGCAAG AGAATGGAAT GGAAGTACGT CCAGCACTTT GGTGGGCGCT CTATCTCGGT CTTTTGGTAG GCGATCTGCA ACTTTTCAGT GAAGGGCAGC CTTTGGCACT CCCAACACCT GCAAATATGG CTGTTTTGGC GGTGATGGCA TTGATCGTTC TACCTATAGC GGTTGGAAGC GCCCTACCGG AAAAATCACC TCACGCTTTA CTTGGAACTG AATAA
|
Protein sequence | MGLRKELNLT SLTMAVVTGT IGSGWLFAPY FAAQLAGAGS LLAWLLGGFL ALLLALVFAE LGSLVPNSGA LAQIPLLTHG RLSGFIGGWS VWLSYVTIPT IELLALLQYL SSSLPWLTHV QGNRQLLSPA GQIVAVILLV LLCWINLLGV QTLSRWINLL TAWKLIVPVL VSIVLMVISS HWSNLAVPVG GDGADVVRAV GSGGILFSLL GFRTAMDLAG EARKPARDVP LAMATGLGIC LLLYITLQLS FLISVPPTEL GNGWHGLMLS AHGGPVVALA MGFGLGWMVI ILLVDALVSP GATALNYMGV SARIIWMMGK CGLLPKALGR LNHQDVPHVA ITLSMVVSAL MLAIGPGWQT VVNFLTTTLI IALATGPVSL LALRRQMPDA HRGYRLPMAD WICRLAFVTA TWSISWCGRT ALEGSVVCIA IPTLIFAAGR CWQENGMEVR PALWWALYLG LLVGDLQLFS EGQPLALPTP ANMAVLAVMA LIVLPIAVGS ALPEKSPHAL LGTE
|
| |