Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03881 |
Symbol | engA |
ID | 5730969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 364795 |
End bp | 366165 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641284745 |
Product | GTP-binding protein EngA |
Protein accession | YP_001550273 |
Protein GI | 159902929 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGTC CAATTGTTGC CATAATTGGA CGTCCAAATG TTGGTAAGTC TACTCTTGTC AATCGACTCT GTGGCAGTCG AGAGGCGATA GTCGATGATC AGCCTGGAGT GACTCGAGAC AGAACATATC AAGATGCTTT TTGGGCAGAT AGAGAATTTA AAGTAGTAGA TACTGGGGGA CTTGTATTTG ACGATGAAAG TGAATTTTTA CCAGAAATAC GCCAACAAGC AAAGCTTGCT CTTTCAGAGG CTTCAGTAGC TCTGATTGTT GTTGACGGTC AAGAAGGAGT TACCACTGCA GATAAAGAGA TAGCTTCATG GCTGAGACAT TGTGAATGTC CGACTTTAGT AGCAGTAAAC AAGTGTGAAT CCCCTGAGCA GGGCCTTGCT ATGGCAGCAG ACTTTTGGAG CCTTGGACTT GGAGAGCCTT ATCCAGTTTC TGCAATACAT GGTTCAGGTA CTGGAGAGCT GCTTGACCAA GTGATATTGC TATTGCCATC TAAGGAGTCC AGCGAGGAAG AGGATGAACC TATTCAATTG GCAATTATTG GCAGGCCAAA TGTAGGCAAA TCAAGTCTAT TGAATTCAAT ATGTGGCGAG ACCAGGGCAA TTGTTAGCTC TATTAGAGGT ACTACGAGGG ATACAATCGA TACTCTTTTA AAAAGAGAAC AGCAAGCTTG GAAGTTAATT GATACAGCTG GTATTCGTAG ACGACGCAGT GTGAGTTATG GTCCAGAGTA TTTTGGAATT AATAGAAGTT TGAAAGCAAT TGAAAGAAGT GATGTTTGCT TATTAGTTAT AGATGCTTTA GATGGGGTGA CAGAGCAGGA TCAGAGACTC GCTGGCAGAA TAGAACAAGA GGGAAAAGCC TGTTTAGTTG TAGTTAATAA ATGGGATGCA GTTGAGAAAG ATACTTACAC AATGCCACTT ATGGAAAAGG AGTTACGTTC AAAGCTTTAT TTTCTTGATT GGGCTGACAT GTTGTTTACT TCCGCCCTAA CTGGTCAAAG GGTTCAATTG ATTTTCAACT TGGCATCTTT AGCTGTAGAA CAACATCGCA GAAGAGTTAG TACATCTGTC GTTAATGAAG TCCTCTCAGA GGCTTTAACT TGGAGGAGCC CACCAACAAC TCGTGGTGGC AGGCAAGGTC GCCTTTATTA CGGGACACAA GTCTCAACTC AGCCTCCAAG CTTTAGCCTT TTTGTTAACG AACCTAAGCT TTTTGGTGAT TCATATAGAA GATACATCGA AAGACAACTG AGAGAAGGCC TTGGCTTTGA AGGCACTCCA TTGAAGTTGT TTTGGAGAGG GAAGCAACAG CGTGCTGCAC AAAAAGATTT AGCTCGCCAA AAAGAAAATT TATCTAAATA G
|
Protein sequence | MGRPIVAIIG RPNVGKSTLV NRLCGSREAI VDDQPGVTRD RTYQDAFWAD REFKVVDTGG LVFDDESEFL PEIRQQAKLA LSEASVALIV VDGQEGVTTA DKEIASWLRH CECPTLVAVN KCESPEQGLA MAADFWSLGL GEPYPVSAIH GSGTGELLDQ VILLLPSKES SEEEDEPIQL AIIGRPNVGK SSLLNSICGE TRAIVSSIRG TTRDTIDTLL KREQQAWKLI DTAGIRRRRS VSYGPEYFGI NRSLKAIERS DVCLLVIDAL DGVTEQDQRL AGRIEQEGKA CLVVVNKWDA VEKDTYTMPL MEKELRSKLY FLDWADMLFT SALTGQRVQL IFNLASLAVE QHRRRVSTSV VNEVLSEALT WRSPPTTRGG RQGRLYYGTQ VSTQPPSFSL FVNEPKLFGD SYRRYIERQL REGLGFEGTP LKLFWRGKQQ RAAQKDLARQ KENLSK
|
| |