Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_11361 |
Symbol | |
ID | 4717848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 956660 |
End bp | 957733 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640078851 |
Product | WD-40 repeat-containing G-protein |
Protein accession | YP_001009527 |
Protein GI | 123968669 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAAA TAGAACCATT TAGCCCAAGA GGCATGTTCC ATGAAGGATG GACTGCTGAA GTTAACGATT ACGCCATAGC ATGTGGCTGG GCTCTTAAAG GAAAACAATT TATTGTTGGC GACGTAGCAG GAGGTCTTTT TTCGTTCGAA GGAGATACTG GAAAAATTAT TTGGAAAAAA GAAAATACTC ACTCTGGTGG TCTACTAGCA ATGGCAATTC ATCCAGAGGG TGAAATTTTT GCAACATCAG GTCAGGATGG AAATGTTCAA ATTTGTAATT GCCATGAAGG TAAAGTTATT AAAACACTTG ATCTTGGCAA AGGTTGGGTA GAACACCTCA AGTGGTCTAA TGACGGTTTA TTTCTAGCGA TAGCTTCCTC AAAAAAAGTA TATGTTTTTA ATGAAATTGG TGAAGAGAAA TGGATCTCAG AAGACCATCC AAGCACAGTA AGTGCAATAA CGTGGTCAAA TAAAAATGAG CTAGCAACTG CTTGCTATGG AAGAGTGACA TTCTTTGACA TAGTTAATAA TAAAACGAAT CAAAAACTCG AATGGCAAGG ATCATTGGTA TCTATGGAAT TAAGCCCTGA TGGAGATATA GTCGCCTGTG GGAGTCAAGA CAATTCAGTT CATTTTTGGA GAAGATCAAC AGGAATGGAT GCTGAAATGA CTGGATACCC TGGAAAACCA AGTCACCTTT CTTTTGATGA CAGCGGAAAA TTATTAGCGA CTAGCGGCAG TGAAAGAATT ACAGTATGGA GCTTTATAGG AAATGGTCCA GAAGGGACTA TGCCAGGAGA GCTATGTCAC CATACAGAAC CTATTTCGAG CTTAGCCTTT TCAAATAAAG GTATGCTTGT AGCCTCAGGA TCTAGGGATG GTTCTGTTGT CGCAAGTTTC CTAAAAAATG ACGGCAATGG AGACCCAGTT GGGGCTGCAT TTGCCGGAGA TTTGGTAGGG GCAATATCAT GGAGACCTGA TGATTGTGCA CTTGCAGCAG TTAATGCAAA AGGTGTTGTA AATGTATGGA AATTTAAAGT TCGTACTAAC CTTTTTTCAA AGGGATTTAA GTAA
|
Protein sequence | MPEIEPFSPR GMFHEGWTAE VNDYAIACGW ALKGKQFIVG DVAGGLFSFE GDTGKIIWKK ENTHSGGLLA MAIHPEGEIF ATSGQDGNVQ ICNCHEGKVI KTLDLGKGWV EHLKWSNDGL FLAIASSKKV YVFNEIGEEK WISEDHPSTV SAITWSNKNE LATACYGRVT FFDIVNNKTN QKLEWQGSLV SMELSPDGDI VACGSQDNSV HFWRRSTGMD AEMTGYPGKP SHLSFDDSGK LLATSGSERI TVWSFIGNGP EGTMPGELCH HTEPISSLAF SNKGMLVASG SRDGSVVASF LKNDGNGDPV GAAFAGDLVG AISWRPDDCA LAAVNAKGVV NVWKFKVRTN LFSKGFK
|
| |