Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24571 |
Symbol | |
ID | 4776123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2158503 |
End bp | 2159708 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087977 |
Product | hypothetical protein |
Protein accession | YP_001018453 |
Protein GI | 124024146 |
COG category | [S] Function unknown |
COG ID | [COG4372] Uncharacterized protein conserved in bacteria with the myosin-like domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGGCT GGCTGCTGAT CCTCTCCCTA CTCGTCCTCG GTGGTGTGCT CTCCACCCTC GGCGATCGAC TTGGCAGCCG TGTAGGCAAA GCAAGGCTGA GCCTGTTCAA TCTCAGACCG CGAAGGACAG CCGTATTCAT CACCGTTCTC ACTGGCAGCC TGATTAGCGC CCTGTCCCTA GGACTGATGC TGCTGGTCAG TCGACAACTG CGAGTAGGAC TTTTTGAACT GGATGACCTT CAGGCCAAAC TCCAACAAAG CCGCGATGCC CTCAATTCAA GTCGGTCGGC CCAGTTCAAT GCCGAGCTTG ATCTCAAGCA GGCCAGGTCG GACAGCAATC AGGTTCAGAA TGAACTGAAG GAAGCCAAAA AGCGAGCTGC CGCCCTTCGC AACGAACTCG CACCACTACA AAAACAAAGA CAACAGCTCG AGGCGGAACG AGCTCGTCTC AGTCGCGACA TTTCCAAAAA AGATGCCGAT ATCCGCCGAA CGGAAGTTGA ACTAGCCAAT GTTCGCAGCA GAATTAGCTC GGCTGAAAAA GAACTGAAGC AACTCGAGAC CAACCTGATT GCCTTACGTC GAGGGGATGT TGTGCTGAGC AGTGGGCAGC AACTTGCCGC AGCCACCCTG CGGCTCGACA ACCCCAGCCA GGCGAAAGCC GTCATCGACC GCCTACTTCA GGAAGCAAAC CTGGAGGCTT TTCGACGCGT ACGTCCCGGT GAAGAAGCCA ATCGACAGAT CCTGCTCGTG CCACGCACCG ATATCAACCG CATCGAACAG ATCATCCGAA AGCCAGGCAC CTGGGTCGTC TATGTGCGCT CTGCCGCAAA CGTGTTGCGT GGGGAGAACG TGGTGTATGC CTTCCCGGAT GCTCGCCAAA ATATCAACAT CGTCCGACAA GGCGAAGTCC TCGCACGAAC GACTCTTGAC CAAAACGAGA AGAGCAGCGA AACCGTGCGC AACCGACTTA GCCTCCTGCT TGCATCAACT CTGGCAGAGG TAAAAAGACG CGGATCCCTC AGTTCAGGAC TGCAGTTCGA TGGCAGTGAA ATGAACCGGC TCGGCAAGGC ATTACTGAAC CGTTCTCAAG AGCGGATTGA GCTAGAAGCC GTGGCACTCC GCAACAGCGA TACAGCCGAT CCGGTAGCAG TTGTCCTGCA GCCCGTGGGT GGTCCTTGGA CAAAGGTTCC CGAAGACAAA CCATGA
|
Protein sequence | MTGWLLILSL LVLGGVLSTL GDRLGSRVGK ARLSLFNLRP RRTAVFITVL TGSLISALSL GLMLLVSRQL RVGLFELDDL QAKLQQSRDA LNSSRSAQFN AELDLKQARS DSNQVQNELK EAKKRAAALR NELAPLQKQR QQLEAERARL SRDISKKDAD IRRTEVELAN VRSRISSAEK ELKQLETNLI ALRRGDVVLS SGQQLAAATL RLDNPSQAKA VIDRLLQEAN LEAFRRVRPG EEANRQILLV PRTDINRIEQ IIRKPGTWVV YVRSAANVLR GENVVYAFPD ARQNINIVRQ GEVLARTTLD QNEKSSETVR NRLSLLLAST LAEVKRRGSL SSGLQFDGSE MNRLGKALLN RSQERIELEA VALRNSDTAD PVAVVLQPVG GPWTKVPEDK P
|
| |