Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_17071 |
Symbol | |
ID | 5730068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1532858 |
End bp | 1534273 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641286089 |
Product | hypothetical protein |
Protein accession | YP_001551592 |
Protein GI | 159904248 |
COG category | [S] Function unknown |
COG ID | [COG4370] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03492] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAA GAGATCTAAT AATTGAGTTC TTGCAACTGA TTACAGGAAT CGGTCTCAAG AAAGGTAAGA AGGCGGCACC CAGATTCGAA CTGGGGATAA AGGATTTGCA ATCCTCTGCC TTACCACTTG GCCATGCCGC CGAAGGAAAT AAAGATTTTA CCTCTGGGGA TCGTATCAGT CAGACGACCA AAGATCTTTT AGTTCTTTCT AATGGACACG GTGAAGATCT TATAGCCCTT AGGATTTTAG AGGCTCTACA TCTCTTGGAA CCAAGCTTAA CCTTTGAGGT ACTCCCTTTG GTTGGAGAAG GTAAGGCTTT TGAAAAGGCA GTTTATGAAA AGTGGTTAAT CAAAATAGGA CCTTCTTTTC GCTTGCCTAG TGGAGGATTT AGTAATCAAA GCTTTTCAGG ATTGATTCGA GATATTTCTG CAGGTGTCTT TTGTTTTGCT TATAAGCATT GGCGGTATGT CAGACGATCT GCATTACATG GGAAAGTGAT TCTTGCAGTT GGGGATTTGT TGCCTTTGTT TTTTGCCTGG AGTGGTGGCG GTATGTATGG GTTTATTGGG ACTCCCAAAA GCGATTACAC ATGGACATCA TCTTCAGGGG CTTTGTTGAG TGATTATTAT CATCGCTGCA AAGGCTCAGA ATGGGACCCT TGGGAATGGG TTTTGATGAG ATCTTTAAGA TGTAAATTTG TAGGAGTTAG AGATAAGTTG ACTGCTAGAG GTTTACAGCG GAAGTCAATT AGGGCTTTTG CTCCAGGCAA TCCAATGATG GATGGTTTTC ATAAAGCTGA ATGTCCACAA GACTTATTAA TGTTTAGAAG ATTGTTATTG CTTTGTGGAA GTAGAATGCC AGAGGCATTG ATGAATTTTC GAAGATTAAT ATCTGCAGCT TTGCAAATTA AAAGTCCAAC ACCATTAGCA ATTTTGGTTA CTACTGGGGC AGACCCATCT CTACATGAGC TTGAGCTATG TTTAGAGAAA TTAGGCTTTT CGAAATTTTG TTTGCAAAAC AATTCATTAG GTGTAGATAC CTTTTGGCAG AAGGATCGAT TTAGGGTTTT TATTGGTATT GGAAAATTTC ATGAGTGGGC CACTTATGCT GAGATTGGCC TTGCAAATGC AGGCACTGCT ACTGAGCAAT TAGTAGGACT TGGTACTCCT TGTGTTTCAT TGCCAGGTAA AGGTCCACAA TTTAAAAAAT CATTTGCAAT GCGTCAGGCT CGCCTGCTAG GTGGAGCTGT CTTTCCTTGT AGAAATTCCA AACATTTAGC CGAATCAGTT GAGGTGCTGC TTCGCAATGA CTCATTTCGC GAACAGTTAT CTTTGCAAGG AGTAAAGAGA ATGGGCGCGC ATGGTGGAAG TGCAGCTTTA GCACAATTTG CTTTAGAATT ATTAGTAAGG AGTTAA
|
Protein sequence | MRKRDLIIEF LQLITGIGLK KGKKAAPRFE LGIKDLQSSA LPLGHAAEGN KDFTSGDRIS QTTKDLLVLS NGHGEDLIAL RILEALHLLE PSLTFEVLPL VGEGKAFEKA VYEKWLIKIG PSFRLPSGGF SNQSFSGLIR DISAGVFCFA YKHWRYVRRS ALHGKVILAV GDLLPLFFAW SGGGMYGFIG TPKSDYTWTS SSGALLSDYY HRCKGSEWDP WEWVLMRSLR CKFVGVRDKL TARGLQRKSI RAFAPGNPMM DGFHKAECPQ DLLMFRRLLL LCGSRMPEAL MNFRRLISAA LQIKSPTPLA ILVTTGADPS LHELELCLEK LGFSKFCLQN NSLGVDTFWQ KDRFRVFIGI GKFHEWATYA EIGLANAGTA TEQLVGLGTP CVSLPGKGPQ FKKSFAMRQA RLLGGAVFPC RNSKHLAESV EVLLRNDSFR EQLSLQGVKR MGAHGGSAAL AQFALELLVR S
|
| |