Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_18521 |
Symbol | |
ID | 5730222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1685236 |
End bp | 1687086 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641286239 |
Product | hypothetical protein |
Protein accession | YP_001551737 |
Protein GI | 159904393 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0520951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0416558 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAG AACTAACAGA TTTCATAGAA GCAGAAGGAC TGCAACAGTA CGATCCAGAA GCAATTTCTG CAATTTATAA AAAACACCCA TTTAGATTAA TCAAAAGATT ATGGGAAACT TTGATACCTA TCAGCTTGTT TTTATTAGGT GTTGGCTGGG AAAAACTAAT AGGACTATTA AAAAATGAAG AAAAAGCTCG AAAAAGAGCA AAAGAATTTA CTGATCTATT GGTAGACCTA GGGCCAGCTT TTATTAAAGC AGGGCAAGCC CTCTCAACAA GACCGGATAT TGTTCCGCGA GTAGTTTTAG AAGAACTAGC ACAGCTTCAG GACCAACTAC CTGGTTTTGA ATCAAATTTG GCTATGGCAT GTATTGAAGA AGACCTAGGA ATTAAAAAAG AAGAAATTTT TTCTAATATT GAAAAAGAAC CTATTTCTGC AGCTTCGTTG GGGCAAGTAC ATAAAGGTAC TTTACTAAAT GGAGATAAAG TAGCTGTTAA AGTTCAAAGA CCAGGCTTAA GAGAGCAAAT AACTTTAGAT TTATACATAG TAAGAAGTAT AGCTATATGG CTAAAAAACA ATATTAAATT AATTAGAAGT GATTTAGTTG CATTAATTGA TGAGCTAGGA AGAAGAGTAT TTGAAGAAAT GGATTACATC AATGAAGCAG AAAATGCACT AAAATTTAGA AAATTACATT CTCATAATAG AAATATTGCT GTACCAAAAA TATACAAAGA GATAACAAGT AAAAGAATCT TAACAATGGA ATGGATAGAT GGGGTGAAAC TAACGGAACT AGCGGCTGTT AAAAAACTAG GAATAGATCC TGATAGAATG ATTGAAATAG GAGTTAACTG TAGTCTTCAG CAGCTTTTAG AGCATGGTTT CTTTCATGCT GACCCTCACC CAGGTAATTT ACTAGCCTTA GAAGATGGGC GATTATGTTA TCTAGATTTT GGTATGATGA GCGATGTCAC TAGAAAATCT AGAACAGGCC TTATACAAGC TGTTGTACAT TTAGTAAATA AGAATTTTGA CAAATTATCT AAAGATTTTG TAGAGCTTGG ATTCTTATCT GAAGAAGTAG ATTTAGAGCC AATAGTGCCA GCATTTGAAA GTGTATTTAG TAATGCATTA GAAATGGGTG TAAATAAAAT GGATTTCAAA AGTGTTACAG ATGACTTGTC AGGTGTTATG TATAAATTTC CTTTTCAACT GCCGCCTTAT TACGCATTAA TCATTAGATC ATTAATTACG CTTGAAGGAA TAGCATTAAG TGTTGATGGT GATTTTAAGA TACTTGGAGC AGCATACCCA TACTTTGCTC GTAGACTTAT GGAAGATCCA GATCCTCAAC TTAGAAAAAG TTTAAAAGAA ATGCTTTTTG ATGGAAATAC TTTCAGATGG AACAGATTAG ATGAATTAAT TTCTAGTGCT TCAAAACAAG CTGATATAGA TATTGAAAAC TTATTAGATA AAGTTTTAGA TTTTCTATTT TCTGAGAAGG GGGGAGTCTT AAGGAATGAA CTTGTAGAGG CTATTATTAA TAAGTTTGAT GCTATTACAT GGAATACTGT TCAAAGCATA AATAAAAAAC TTCCTATACA AATCAGATCA AATTCTATAA ATAGTAATCA AAGTGATTTC ATGATAGAAA TAGAGCCTAT CAAAAAATTA GTTAGTATAT TAGAAAGCTT ACCAGGTTTT AATAGAGAAA TAATTATAAA GAAAGTTCCA AGGATACTAA AAGAAAAAGA TACAAGAAAA ATGGGAATTA AAATTGCAAA AGGAATAACT GAAAAAAGCA TGGTCAGAAT GATTAAATTA GCTGCAGGAG TTAATCAATA A
|
Protein sequence | MNKELTDFIE AEGLQQYDPE AISAIYKKHP FRLIKRLWET LIPISLFLLG VGWEKLIGLL KNEEKARKRA KEFTDLLVDL GPAFIKAGQA LSTRPDIVPR VVLEELAQLQ DQLPGFESNL AMACIEEDLG IKKEEIFSNI EKEPISAASL GQVHKGTLLN GDKVAVKVQR PGLREQITLD LYIVRSIAIW LKNNIKLIRS DLVALIDELG RRVFEEMDYI NEAENALKFR KLHSHNRNIA VPKIYKEITS KRILTMEWID GVKLTELAAV KKLGIDPDRM IEIGVNCSLQ QLLEHGFFHA DPHPGNLLAL EDGRLCYLDF GMMSDVTRKS RTGLIQAVVH LVNKNFDKLS KDFVELGFLS EEVDLEPIVP AFESVFSNAL EMGVNKMDFK SVTDDLSGVM YKFPFQLPPY YALIIRSLIT LEGIALSVDG DFKILGAAYP YFARRLMEDP DPQLRKSLKE MLFDGNTFRW NRLDELISSA SKQADIDIEN LLDKVLDFLF SEKGGVLRNE LVEAIINKFD AITWNTVQSI NKKLPIQIRS NSINSNQSDF MIEIEPIKKL VSILESLPGF NREIIIKKVP RILKEKDTRK MGIKIAKGIT EKSMVRMIKL AAGVNQ
|
| |