Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04931 |
Symbol | |
ID | 4778942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 487894 |
End bp | 488889 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640085997 |
Product | hypothetical protein |
Protein accession | YP_001016510 |
Protein GI | 124022203 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG ACCTGCTGGT CTTAACGCTA TTGGTCGTTG TCGTTATCAC CGGTTCTGCC TTGTGCTCAG GTGTGGAAGC TGCACTACTC ACAGTGAATC CTGTGCAAGT GCATGAGCTG GCAGCACGGC CCCAACCAAT CGCTGGCGCT AGACGCTTAG CCCAACTACG TCAACGGCTG GGACGGACTC TGTCTGTACT GGTGATCGCC AACAACGGCT TCAACATTTT CGGCAGCCTG ATGCTTGGCG CCTTCGCGGC TTATGTCTTT GAGAAGCACA ACATCAATGA TGTGGCCTTG CCGTTGTTTT CGGTAGGTCT CACATTGCTC GTGATCGTGC TGGGCGAGAT CCTGCCGAAA TCCCTCGGCA GCCGCCTGGC CCTCTCGGTT TCCCTCTCCA GTGCCCCAGT GCTGCACCTG CTGGGCCTGC TGATGCGCCC TGTGGTGGTG TTGCTCGAGC GGTTATTGCC AGCAATTACC GCCGAGAACG AACTGAGCAC GAATGAGAAT GAAATCAGGC TGCTGGCAAG GCTGGGGTCG CAAAAAGGGC AGATCGAAGC CGATGAAGCT GCCATGATCG CCAAGGTGTT CCAACTCAAC GACCTCACAG CCCGCGACCT GATGATCCCC CGGGTGGCAG CTCCCACCCT CGATGCTGCT GCCAAACTTG AAACACTTCG CCCTGAACTG CTCACCCACA ACGCCGAGTG GTGGGTCGTG CTTGGCAAAG AAGTCGATAA AGTTCTTGGC GTGGCCAGCC GCGAAAGGCT GCTCACCGCC CTTCTGCAGG GTCAGGGTCA CCTCACCCCT GCAGACCTCA GTGAAACGGT GGAATTTGTA CCTGAAATGA TTCGCGCCGA TCGACTACTC ACTGGCTTCC GTCGTGACAA CAGCGGTGTG AGAGTGGTGG TTGATGAGTT CGGCGGTTTC GTGGGCGTGA TCGGAGCCGA AGCGGTCTTG GCCGTACTGG CCGGCTGGTG GAGGAAGTCA AGCTGA
|
Protein sequence | MSNDLLVLTL LVVVVITGSA LCSGVEAALL TVNPVQVHEL AARPQPIAGA RRLAQLRQRL GRTLSVLVIA NNGFNIFGSL MLGAFAAYVF EKHNINDVAL PLFSVGLTLL VIVLGEILPK SLGSRLALSV SLSSAPVLHL LGLLMRPVVV LLERLLPAIT AENELSTNEN EIRLLARLGS QKGQIEADEA AMIAKVFQLN DLTARDLMIP RVAAPTLDAA AKLETLRPEL LTHNAEWWVV LGKEVDKVLG VASRERLLTA LLQGQGHLTP ADLSETVEFV PEMIRADRLL TGFRRDNSGV RVVVDEFGGF VGVIGAEAVL AVLAGWWRKS S
|
| |