Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_10221 |
Symbol | |
ID | 5731825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 913582 |
End bp | 915132 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285389 |
Product | GTPase SAR1 and related small G protein |
Protein accession | YP_001550907 |
Protein GI | 159903563 |
COG category | [R] General function prediction only |
COG ID | [COG1100] GTPase SAR1 and related small G proteins |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.524471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.183753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAACC ACAGTAAACA GCCTCTAATT TTGATAGCAG CAATATGTTT AATCTTGATT TTCTCAGGAC TAATAGCTGC TCTTGTTCGC TTAATTAATA TTCCAGCAAT ACTTCTTACA TTATTAATAT TATGCTATTT CATCTATCAA AAACGTTGGA ATTGGTTAAG GCGTTTCTTT CTTAAAAGAA TTTTAATTCA TTATAAAAAT AATTATCGCC GTTTCTCTCC GAAAAGCAGT AGGCAAGCTG CAAGACGAAG CCTTGAAAGT ATTGATCGAC TAATTGATCG CATTCATAAC AATGTTTCGG CAGAAGCATT AAAACAACGT AGAGCCTCTG TAGAGCAAGA ATTAGTAAGA GGAGACATCA CAGTGGTTCT ATTTGGAACT GGCTCTAGTG GAAAAACTAC TCTCATAAGA GCACTCCTTA AAGAAATTGT TGGGGAGGTA TCAGCAACTA TGGGGACAAC AAAAACAAGT CATACATATA GATTGCGACT AAAGGGGCTT GAAAGAGGTA TTCAGATAAT AGATACACCA GGCATTCTTG AAACTGGCGA AGAAGGTAAT AAAAGAGAAA AAGAATCTTT TTTAAAAGCA AGCCGTGCTG ATCTAATAAT CGTTGTTGTG GATACTGACC TAAGATCCAT CGAAATGAAG CTTATAGCCA CACTTGCTAA AGGGGGAAAA AGGTTATTGC TCGTACTGAA CAAATGCGAC CTTCGTGGTG AAGAAGAAAT TCGTAGACTT TTATTAACTC TAAGAAGACA TACAAAAGAC TTGATCAATC CTGAAGATGT AATAGCCACT TCAGCATCTC CACAGTCGAT ACCAGTTCCA GGTGGTTACC CTCTACAACC ACTCCCCGAG ATTGATGGAT TAATTAGGCA AGTGGCAAGG ATTCTCCATG AAGAAGGAGA GGAGCTTATC GCCAGTAATA TACTTTTGCA ATGTAAAAAT CTTGGGGATT CTGGGAGAAA ACTTCTAACA AATCAACGCA AGATAGCAGC GAAAAATTGT GTAGAACGCT ATGCATGGAT AAGCAGTGGA GTTGTTGCAA TAACACCTCT ACCGGGTGTT GACATGATTG GGGCCGCTGC TGTTAATGGT CAAATGGTTA TGGAAATAGC GCGAAACTAT GGGCTTAAGC TAACCCGAAA AAGGTCTCAA GAACTAGCAC TTTCAGTTGG CAGAACTCTT GCAGGGCTAG GAATAGTAAA AGGTGGGATG TCCATAATAA GTAATTCGCT AAGCCTAACC CTTCCAACAA TAGTTATTGG GAAGGTCGTT CAGGGTATTA CTGCTGCTTG GCTCACAAAA GTAGCTGGCG AGAGCTTTAT TACCTACTTC AGTCAAGATC AAGACTGGGG AGATGGTGGC ATACAAGAAG TTGTCCAACG CCATTATAAT TTATATAGGA GGGAATCTAG CCTAAAAAGT TTTATACAGA CAGCACTAGA TAGAGTAGTC GAACCATTGA AAGAGGAGCG CAGAAGAGAG CTCCCTCCAC ACCTAAAGCT TCGGGAGGAG GAGGAAGTAG AGGACCTCTA A
|
Protein sequence | MHNHSKQPLI LIAAICLILI FSGLIAALVR LINIPAILLT LLILCYFIYQ KRWNWLRRFF LKRILIHYKN NYRRFSPKSS RQAARRSLES IDRLIDRIHN NVSAEALKQR RASVEQELVR GDITVVLFGT GSSGKTTLIR ALLKEIVGEV SATMGTTKTS HTYRLRLKGL ERGIQIIDTP GILETGEEGN KREKESFLKA SRADLIIVVV DTDLRSIEMK LIATLAKGGK RLLLVLNKCD LRGEEEIRRL LLTLRRHTKD LINPEDVIAT SASPQSIPVP GGYPLQPLPE IDGLIRQVAR ILHEEGEELI ASNILLQCKN LGDSGRKLLT NQRKIAAKNC VERYAWISSG VVAITPLPGV DMIGAAAVNG QMVMEIARNY GLKLTRKRSQ ELALSVGRTL AGLGIVKGGM SIISNSLSLT LPTIVIGKVV QGITAAWLTK VAGESFITYF SQDQDWGDGG IQEVVQRHYN LYRRESSLKS FIQTALDRVV EPLKEERRRE LPPHLKLREE EEVEDL
|
| |