Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_10331 |
Symbol | |
ID | 4776506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 937749 |
End bp | 939197 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640086542 |
Product | hypothetical protein |
Protein accession | YP_001017047 |
Protein GI | 124022740 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAATT GGTGGGAAGA GCCTTATTTC AATACTCTTT GGGCTTTTGG AAGCAATCCT AACGATGATA ATGTTGGCGC CATAGATGCC TTCCAGCAAT GGGCTGGGCC GGCAGATGGA ATTACTGACC TCTCCATCGG TCAAAATATC ATTGCCATTA TCGATAGCGG TGTTCACTAT ACGCATGAAG ACCTTGCAGA CAATATCCTG ATAAACAAAG CTGAGATTCC TGATAATCAA ATTGATGATG ATCAAAATGG TTATGTCGAT GACTACTATG GATATAACTT TGTTGATAAC AATGGTAATC CATCTGACGA CTCAAAAAAT GGCCATGGAA CACATGTCGC GGGAATTGCT GCTGCTGCTG CCAATGATCT AGGAATTGTC GGCACAAACC CAGCGGCTAA AATTCTGCCA ATAAAGGTAC ATGACAAAAA CGTTGATGTA AAATATTCCA GCCTTATAGC TAGTATTAAT TATGCTGTAA TTCGAGGGGC AAAAGTAATA AATATGAGCC TTGGAGTTGC ACAACCATAT GAACCTTTAT ATGAAGCTAT CCAATTAGCA GAAGAGAATG GATGCTTATT TATTGTATCG GTAGGGAATG ATGATCGTGA TATTGATAAA CAAGACCCGA TGTACCCCGC ATCCTATAGA ATGGAAAGCG GCATAAAAGT AGCTGCATCA AATAAAAATG GCCAAAGAGT AATGGCGGGT GGTCCATGGT CCGATCCACC ATATAGCCCT AGATGGGGCT CAAACTATGG GAAGCAAAGT GTTGATCTTT TTGCACCAGG CATTGATATT TATAGCACTG TTAATACCTC AGATATTGCC TATGGTTATA TGTCAGGCAC CTCGATGGCA ACCCCACTAG TTGCTGGTAT TGCTAGTTCA TTTTGGGCAA GAAATTCTGA TCTTTCTGCA AGCGAAGTAA AAGCAAGAAT CCTCTCAAGT GTTGATGTAC CTGAAGAGGC CTTCGATGGA GATACAGTAA CAGGTGGTCG TATTAATATG GAGGGATTGA ATCAGGGCAT AATAGGTATT TCAAGCAAAA CATCAAGCCA CGAGTTCACT TCCACAACAA ACTCCTTTTC CCATGTGGAA AGATATCAAA AAGAAATGGA TATCACAAAC TGGGTCACAC CTGCAAACCT TAACTTCCAT GATTCAGATG ATCTCAAAGG GAAAACAGTC ATCGGCCTTC TATCTGATGA AATAAGAGAT AAAGAGAAGG TGGTCAATGA TCTTGCTAAA GATATCAAAA CTGGCCAAAA AGAGCTAAAG CACATTGATT TCTTTAAATC AATGGAAGCG CTTGAGCATT CTATATGTAC AATTAAATTA TCAGATCAAG ACGGGGCTCA GCCCAAAGAT GCCATTGAAA CATTATTCAA TGAATTCGGC TATACCAGGT TCTACTTTGA TAATGAAGTC ATAATTTAA
|
Protein sequence | MSNWWEEPYF NTLWAFGSNP NDDNVGAIDA FQQWAGPADG ITDLSIGQNI IAIIDSGVHY THEDLADNIL INKAEIPDNQ IDDDQNGYVD DYYGYNFVDN NGNPSDDSKN GHGTHVAGIA AAAANDLGIV GTNPAAKILP IKVHDKNVDV KYSSLIASIN YAVIRGAKVI NMSLGVAQPY EPLYEAIQLA EENGCLFIVS VGNDDRDIDK QDPMYPASYR MESGIKVAAS NKNGQRVMAG GPWSDPPYSP RWGSNYGKQS VDLFAPGIDI YSTVNTSDIA YGYMSGTSMA TPLVAGIASS FWARNSDLSA SEVKARILSS VDVPEEAFDG DTVTGGRINM EGLNQGIIGI SSKTSSHEFT STTNSFSHVE RYQKEMDITN WVTPANLNFH DSDDLKGKTV IGLLSDEIRD KEKVVNDLAK DIKTGQKELK HIDFFKSMEA LEHSICTIKL SDQDGAQPKD AIETLFNEFG YTRFYFDNEV II
|
| |