Gene P9211_03881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03881 
SymbolengA 
ID5730969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp364795 
End bp366165 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content42% 
IMG OID641284745 
ProductGTP-binding protein EngA 
Protein accessionYP_001550273 
Protein GI159902929 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGTC CAATTGTTGC CATAATTGGA CGTCCAAATG TTGGTAAGTC TACTCTTGTC 
AATCGACTCT GTGGCAGTCG AGAGGCGATA GTCGATGATC AGCCTGGAGT GACTCGAGAC
AGAACATATC AAGATGCTTT TTGGGCAGAT AGAGAATTTA AAGTAGTAGA TACTGGGGGA
CTTGTATTTG ACGATGAAAG TGAATTTTTA CCAGAAATAC GCCAACAAGC AAAGCTTGCT
CTTTCAGAGG CTTCAGTAGC TCTGATTGTT GTTGACGGTC AAGAAGGAGT TACCACTGCA
GATAAAGAGA TAGCTTCATG GCTGAGACAT TGTGAATGTC CGACTTTAGT AGCAGTAAAC
AAGTGTGAAT CCCCTGAGCA GGGCCTTGCT ATGGCAGCAG ACTTTTGGAG CCTTGGACTT
GGAGAGCCTT ATCCAGTTTC TGCAATACAT GGTTCAGGTA CTGGAGAGCT GCTTGACCAA
GTGATATTGC TATTGCCATC TAAGGAGTCC AGCGAGGAAG AGGATGAACC TATTCAATTG
GCAATTATTG GCAGGCCAAA TGTAGGCAAA TCAAGTCTAT TGAATTCAAT ATGTGGCGAG
ACCAGGGCAA TTGTTAGCTC TATTAGAGGT ACTACGAGGG ATACAATCGA TACTCTTTTA
AAAAGAGAAC AGCAAGCTTG GAAGTTAATT GATACAGCTG GTATTCGTAG ACGACGCAGT
GTGAGTTATG GTCCAGAGTA TTTTGGAATT AATAGAAGTT TGAAAGCAAT TGAAAGAAGT
GATGTTTGCT TATTAGTTAT AGATGCTTTA GATGGGGTGA CAGAGCAGGA TCAGAGACTC
GCTGGCAGAA TAGAACAAGA GGGAAAAGCC TGTTTAGTTG TAGTTAATAA ATGGGATGCA
GTTGAGAAAG ATACTTACAC AATGCCACTT ATGGAAAAGG AGTTACGTTC AAAGCTTTAT
TTTCTTGATT GGGCTGACAT GTTGTTTACT TCCGCCCTAA CTGGTCAAAG GGTTCAATTG
ATTTTCAACT TGGCATCTTT AGCTGTAGAA CAACATCGCA GAAGAGTTAG TACATCTGTC
GTTAATGAAG TCCTCTCAGA GGCTTTAACT TGGAGGAGCC CACCAACAAC TCGTGGTGGC
AGGCAAGGTC GCCTTTATTA CGGGACACAA GTCTCAACTC AGCCTCCAAG CTTTAGCCTT
TTTGTTAACG AACCTAAGCT TTTTGGTGAT TCATATAGAA GATACATCGA AAGACAACTG
AGAGAAGGCC TTGGCTTTGA AGGCACTCCA TTGAAGTTGT TTTGGAGAGG GAAGCAACAG
CGTGCTGCAC AAAAAGATTT AGCTCGCCAA AAAGAAAATT TATCTAAATA G
 
Protein sequence
MGRPIVAIIG RPNVGKSTLV NRLCGSREAI VDDQPGVTRD RTYQDAFWAD REFKVVDTGG 
LVFDDESEFL PEIRQQAKLA LSEASVALIV VDGQEGVTTA DKEIASWLRH CECPTLVAVN
KCESPEQGLA MAADFWSLGL GEPYPVSAIH GSGTGELLDQ VILLLPSKES SEEEDEPIQL
AIIGRPNVGK SSLLNSICGE TRAIVSSIRG TTRDTIDTLL KREQQAWKLI DTAGIRRRRS
VSYGPEYFGI NRSLKAIERS DVCLLVIDAL DGVTEQDQRL AGRIEQEGKA CLVVVNKWDA
VEKDTYTMPL MEKELRSKLY FLDWADMLFT SALTGQRVQL IFNLASLAVE QHRRRVSTSV
VNEVLSEALT WRSPPTTRGG RQGRLYYGTQ VSTQPPSFSL FVNEPKLFGD SYRRYIERQL
REGLGFEGTP LKLFWRGKQQ RAAQKDLARQ KENLSK