Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18881 |
Symbol | |
ID | 4778658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1649443 |
End bp | 1650384 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087397 |
Product | hypothetical protein |
Protein accession | YP_001017895 |
Protein GI | 124023588 |
COG category | [S] Function unknown |
COG ID | [COG4121] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.299649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGGCT GCATCGCTGA GTCGCTTCCT CTCGGAACGC TGTCTGCTTA TGCAACAGCT GACGGCAGTT TCAGCCTGTA CAGCAGTCAT TTTAACGAGG CCTTTCATAG TTCAGCTGGA GCCCTCACAG AAGCCCATGC AAAGTTCGTT GACCCTGCAC AGTTGGAACG CTTCTCTGGC AGCAAACAAC TCAGAGTTCT AGATGTATGT GTGGGGCTTG GTTATAACTC AGCTGCCTTA ATTGATGCTC TGCAGGGAAA TTCAATCAGC CTGCAGTGGT TTGGATTGGA ACTTGACCAC AGACCATTGA CTATGGCGTT GCAACAGTCG AGCTTCAAGA CAATATGGTC TCCTCAAGTG TTGACGATCT TGGAAAGGAT TCGTGATTGC AATAGTTGGC GAGAGGGGAC CAGCAACGGC ACTCTGTTTT GGGGAGACGC CAGGCAAAAA CTAAATTGGC TCCCGAATGA TCTCAAGATC GATCTGATCC TCATGGATGC CTTCTCTCCA AGCTGCTGTC CACAGCTATG GAGCGATGAA TTTCTGACGG CCCTCGCTAG AAAACTGGCG CCAGGAGGAC GGCTGCTCAC TTACTGCCGG GCCGCTGCAG TCAGAGCAAG CCTGCGTAGA GCGGGCTTGC AATTGCGTTC ATTATTGACA GTTCCTGGGG AACGACAAGG TTGGAGCGCT GGCACTTTGG CCATCCTTCC TGATCAACAT GAAGCCACAT CCTCTCAAGG CCCAAATTGG CAACCTTTAA GTCTGATGGA GGAGGAACAT CTACTCACCC GCGCCGCTAT TCCCTATCGA GATCCCAGTG GCCAAGCCAC AGTCGAGGAG ATCATCAAGC GCAGAAGAGA AGAGCAACGA GAGTGCAAGA TGGAAAGCAC CAACGCATGG AAAAGACGGT GGTGCCGGAA CCGAACGGGC GAATGCCAGT AG
|
Protein sequence | MSGCIAESLP LGTLSAYATA DGSFSLYSSH FNEAFHSSAG ALTEAHAKFV DPAQLERFSG SKQLRVLDVC VGLGYNSAAL IDALQGNSIS LQWFGLELDH RPLTMALQQS SFKTIWSPQV LTILERIRDC NSWREGTSNG TLFWGDARQK LNWLPNDLKI DLILMDAFSP SCCPQLWSDE FLTALARKLA PGGRLLTYCR AAAVRASLRR AGLQLRSLLT VPGERQGWSA GTLAILPDQH EATSSQGPNW QPLSLMEEEH LLTRAAIPYR DPSGQATVEE IIKRRREEQR ECKMESTNAW KRRWCRNRTG ECQ
|
| |