Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15321 |
Symbol | |
ID | 4777983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1335564 |
End bp | 1336667 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640087041 |
Product | hypothetical protein |
Protein accession | YP_001017541 |
Protein GI | 124023234 |
COG category | [S] Function unknown |
COG ID | [COG2327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03609] polysaccharide pyruvyl transferase CsaB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.29569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTCGCCG TTAATCTTTT GTGCAGCTAT TGCTTGGCTA TGGCAGTTCA GGCAATGCAA TGCAAGGCAG TGCTGTTATG CGGCTACTAC GGCGAGCACA ACCTTGGCGA TGATGCTCTT TTGCAGGTTT TACTTCAGGA GCTTCCCTCA CATTTGCAGC CTTGGATCAC GGCAAATGAT TCAAATGTGA TTAAGGCCTT AGCTCCGAAG GCTCAAGTCA TTAACCGACG CTCATTGCTG GAGACCATTA GGGCACTTTT TCAAGTTCAG GGTTTGATTC TTGGTGGCGG CAGCTTGCTG CAGGACAGCA CGAGTTTTAA GAGCCTGATC TACTACTTGA TCTTGATTGT GATAGCCCAA CAACTTGGTG TTCCTGTTGT GCTTTGGGGC CAGGGCCTGG GGCCATTTCG TCATCACCTG AGTCGATGGA TGGTTAGGAG AGTCCTACGT AGGGTGCAGG CGATCAGCTG GAGAGATCCA GAGTCTTTTC AGCTTGCTCA GCGATGGTGC CTACCCTCTC CGATGCTGAT GGCACCAGAT CCAGTTTGGC AACTTCCCTC TAGGCGCTGG CAGGGTGGCC AAGCAGTTGT GCTTTGTTGG CGTCCCACGT CAATGCTTGA TCACTTTGGC TGGCAGCATT TGCTCCAAGC CCTTGAGATA ATCCTTAAAG ATATTGATGC TCCAGTTCAT TGGCTTGCTT TTCATCAACG TCAGGATGAA CAGCTATTCC GACAACTTGA TAGACAAGGA CTGATCAGTT CCAGCTTGAG ATCAAGAAGT CATTTCTTTG CATTTGATTC ACTAACAGAT GTTATGAATC AGTTCTCGAT GGCCAGATTG GTGCTCCCCA TGCGACTGCA TGCACTCATC CTGGCCCAAT TGGCTGGTAG TCCCACAGTG GCACTCAGCT ACGACCCGAA GGTGTCATCT GCAGCTGTGA TGGCAAATGT GCCCTTTACT GATTTGCAAT CGTTGCCTGA TATTCGTTTG CTTGCTGGAC TATGGAGGCA GGCCTTAGAT GTTTCTCCAG ACCTGGGAAA GATCGAAGCA ATCCGCAAAC AAGCTTCTCA ACACAGCAAG ATATTAGACA AGGCCTTTGA TTAA
|
Protein sequence | MVAVNLLCSY CLAMAVQAMQ CKAVLLCGYY GEHNLGDDAL LQVLLQELPS HLQPWITAND SNVIKALAPK AQVINRRSLL ETIRALFQVQ GLILGGGSLL QDSTSFKSLI YYLILIVIAQ QLGVPVVLWG QGLGPFRHHL SRWMVRRVLR RVQAISWRDP ESFQLAQRWC LPSPMLMAPD PVWQLPSRRW QGGQAVVLCW RPTSMLDHFG WQHLLQALEI ILKDIDAPVH WLAFHQRQDE QLFRQLDRQG LISSSLRSRS HFFAFDSLTD VMNQFSMARL VLPMRLHALI LAQLAGSPTV ALSYDPKVSS AAVMANVPFT DLQSLPDIRL LAGLWRQALD VSPDLGKIEA IRKQASQHSK ILDKAFD
|
| |