Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_02801 |
Symbol | |
ID | 4778667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 292866 |
End bp | 294101 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640085784 |
Product | hypothetical protein |
Protein accession | YP_001016300 |
Protein GI | 124021993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.484397 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGTC GATACCTCGA TAAACAGTGC CCAGCATGCG GCTACACGAT TGCAAAGATG GTCTTTGATG CTGGAGTTAA ACCACTAGCA ACAATTGCAT GGGCAGAGTC TGAAGAAGAA GCAAAGGACG TCAAGTCATT TAAACAAGAA TATATTCAGT GCCTAAACTG CTCCCATGTG TGGAATCATT TATTTGACTG GGAACATGTA CCTTATGGAA ACAAACCTAA CAAGATGTAC AACAATGGAT CACAATGGAA GAAGCATATT GAGTATTTGC GCGGGTGGTT ATCAGATCGA ATGCCTGCCA AACCCACAAT TGTAGACATT GGCTGTGGTG ATGGTAGTTT TCTCATCTGC ATGGCAAATC ACTATAAACA AAAAGGCAGA TTTCTTGGCT TCGATCCGAG TGGAGATGTT GATGCTCAGC AGTCAGAAAT CCACTTTGAT CGCACATTAT TCTCTCCTTT AAAAGACACC GCCAAACACA AGCCAGACCT CATTGTCATG AGACATGTGA TCGAACATCT CACTGCTCCG TCTTCATTTT TGCATTCGCT GGCATGGGGT GCATCTAGTT ACGAAAAGAC AACATATGTA TATTGTGAAG TCCCTTGCAT CGACCGTGTT TTTCAAACAA GTCGTTTAGC AGACTTTTAC TACGAGCACC CATCCCAATT CACAACCTTG TCTTTCACGA GGATGCTAAA AACAGCTGGT CAAATTATTG ATATTCAACA CTCCTATGAT GGAGAAGTGA TTTGCGGGCT AGTGGAACTA AAACCTTCTT CAGAACAGAC CAAAATAAGC AATGGTTCAG ATGCATATTT CTTTAAGACC TCAACCTCAA TCCATCAGAT TGAGCGACAA ATCGACAATC TTTTAGCGGC TCACAAGCTA ATTGCAATCT GGGGCGGGAC CGGCAAGTGT GCTGCCTTTA TGCATCATTA TAGTGTCTCT TGCGATGATA TATCTACTGT TGTCGACTCA GACGAACGCA AATGGGGGAC GTATGTTCCA GGTGTTGGGC AAGAAATCAA ACCACCATCT TATCTGTTGA ACAAGCTGAT TGATGTTCTG CTTATCCCAA CACAGTGGAG AGCTCAAGAC ATTCTCATAG AAGCGTATTC AATGGGCCTG ACTTTCAAGC AAGTGCTCAT CGAGCATAAT GGCAGACTTG TTGACTTCAG AGATGATGAG CATCCATACG CGAAAGATGA GCTCCAGCAA GAATAG
|
Protein sequence | MSGRYLDKQC PACGYTIAKM VFDAGVKPLA TIAWAESEEE AKDVKSFKQE YIQCLNCSHV WNHLFDWEHV PYGNKPNKMY NNGSQWKKHI EYLRGWLSDR MPAKPTIVDI GCGDGSFLIC MANHYKQKGR FLGFDPSGDV DAQQSEIHFD RTLFSPLKDT AKHKPDLIVM RHVIEHLTAP SSFLHSLAWG ASSYEKTTYV YCEVPCIDRV FQTSRLADFY YEHPSQFTTL SFTRMLKTAG QIIDIQHSYD GEVICGLVEL KPSSEQTKIS NGSDAYFFKT STSIHQIERQ IDNLLAAHKL IAIWGGTGKC AAFMHHYSVS CDDISTVVDS DERKWGTYVP GVGQEIKPPS YLLNKLIDVL LIPTQWRAQD ILIEAYSMGL TFKQVLIEHN GRLVDFRDDE HPYAKDELQQ E
|
| |