Gene P9211_10221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10221 
Symbol 
ID5731825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp913582 
End bp915132 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content40% 
IMG OID641285389 
ProductGTPase SAR1 and related small G protein 
Protein accessionYP_001550907 
Protein GI159903563 
COG category[R] General function prediction only 
COG ID[COG1100] GTPase SAR1 and related small G proteins 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.524471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.183753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACC ACAGTAAACA GCCTCTAATT TTGATAGCAG CAATATGTTT AATCTTGATT 
TTCTCAGGAC TAATAGCTGC TCTTGTTCGC TTAATTAATA TTCCAGCAAT ACTTCTTACA
TTATTAATAT TATGCTATTT CATCTATCAA AAACGTTGGA ATTGGTTAAG GCGTTTCTTT
CTTAAAAGAA TTTTAATTCA TTATAAAAAT AATTATCGCC GTTTCTCTCC GAAAAGCAGT
AGGCAAGCTG CAAGACGAAG CCTTGAAAGT ATTGATCGAC TAATTGATCG CATTCATAAC
AATGTTTCGG CAGAAGCATT AAAACAACGT AGAGCCTCTG TAGAGCAAGA ATTAGTAAGA
GGAGACATCA CAGTGGTTCT ATTTGGAACT GGCTCTAGTG GAAAAACTAC TCTCATAAGA
GCACTCCTTA AAGAAATTGT TGGGGAGGTA TCAGCAACTA TGGGGACAAC AAAAACAAGT
CATACATATA GATTGCGACT AAAGGGGCTT GAAAGAGGTA TTCAGATAAT AGATACACCA
GGCATTCTTG AAACTGGCGA AGAAGGTAAT AAAAGAGAAA AAGAATCTTT TTTAAAAGCA
AGCCGTGCTG ATCTAATAAT CGTTGTTGTG GATACTGACC TAAGATCCAT CGAAATGAAG
CTTATAGCCA CACTTGCTAA AGGGGGAAAA AGGTTATTGC TCGTACTGAA CAAATGCGAC
CTTCGTGGTG AAGAAGAAAT TCGTAGACTT TTATTAACTC TAAGAAGACA TACAAAAGAC
TTGATCAATC CTGAAGATGT AATAGCCACT TCAGCATCTC CACAGTCGAT ACCAGTTCCA
GGTGGTTACC CTCTACAACC ACTCCCCGAG ATTGATGGAT TAATTAGGCA AGTGGCAAGG
ATTCTCCATG AAGAAGGAGA GGAGCTTATC GCCAGTAATA TACTTTTGCA ATGTAAAAAT
CTTGGGGATT CTGGGAGAAA ACTTCTAACA AATCAACGCA AGATAGCAGC GAAAAATTGT
GTAGAACGCT ATGCATGGAT AAGCAGTGGA GTTGTTGCAA TAACACCTCT ACCGGGTGTT
GACATGATTG GGGCCGCTGC TGTTAATGGT CAAATGGTTA TGGAAATAGC GCGAAACTAT
GGGCTTAAGC TAACCCGAAA AAGGTCTCAA GAACTAGCAC TTTCAGTTGG CAGAACTCTT
GCAGGGCTAG GAATAGTAAA AGGTGGGATG TCCATAATAA GTAATTCGCT AAGCCTAACC
CTTCCAACAA TAGTTATTGG GAAGGTCGTT CAGGGTATTA CTGCTGCTTG GCTCACAAAA
GTAGCTGGCG AGAGCTTTAT TACCTACTTC AGTCAAGATC AAGACTGGGG AGATGGTGGC
ATACAAGAAG TTGTCCAACG CCATTATAAT TTATATAGGA GGGAATCTAG CCTAAAAAGT
TTTATACAGA CAGCACTAGA TAGAGTAGTC GAACCATTGA AAGAGGAGCG CAGAAGAGAG
CTCCCTCCAC ACCTAAAGCT TCGGGAGGAG GAGGAAGTAG AGGACCTCTA A
 
Protein sequence
MHNHSKQPLI LIAAICLILI FSGLIAALVR LINIPAILLT LLILCYFIYQ KRWNWLRRFF 
LKRILIHYKN NYRRFSPKSS RQAARRSLES IDRLIDRIHN NVSAEALKQR RASVEQELVR
GDITVVLFGT GSSGKTTLIR ALLKEIVGEV SATMGTTKTS HTYRLRLKGL ERGIQIIDTP
GILETGEEGN KREKESFLKA SRADLIIVVV DTDLRSIEMK LIATLAKGGK RLLLVLNKCD
LRGEEEIRRL LLTLRRHTKD LINPEDVIAT SASPQSIPVP GGYPLQPLPE IDGLIRQVAR
ILHEEGEELI ASNILLQCKN LGDSGRKLLT NQRKIAAKNC VERYAWISSG VVAITPLPGV
DMIGAAAVNG QMVMEIARNY GLKLTRKRSQ ELALSVGRTL AGLGIVKGGM SIISNSLSLT
LPTIVIGKVV QGITAAWLTK VAGESFITYF SQDQDWGDGG IQEVVQRHYN LYRRESSLKS
FIQTALDRVV EPLKEERRRE LPPHLKLREE EEVEDL