Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04431 |
Symbol | engA |
ID | 4780821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 405593 |
End bp | 406963 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640083720 |
Product | GTP-binding protein EngA |
Protein accession | YP_001014272 |
Protein GI | 124025156 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0422674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCGCTAC CAGTAGTCGC AATAATTGGA CGCCCAAATG TTGGGAAATC TACATTGGTG AATCGCTTAT GTCAGAGCAG AGAAGCCATT GTTCATGATG AGCCGGGGGT AACGAGAGAT CGAACTTATC AAGATGGATT CTGGAGGGAT AGAGATTTTA AAGTTGTAGA TACTGGAGGG CTGGTTTTTG ATGACGATAG TGAGTTCCTC CCTGAAATTA GAGAGCAAGC TAATCTTGCG CTTGAGGAAG CCGTAGTTGC ATTAGTAATT GTTGATGGCC AGGAGGGAAT TACTACCGCT GATGAATCAA TTGCTGAATT TTTAAGGTCT CGTTCCTGCA AAACCCTCGT GGTGGTTAAT AAATGTGAAT CTCCCGAACA AGGTTTAGCA ATGGCAGCTG AATTTTGGAA GCTTGGTCTT GGTGAGCCCT ATCCAATCTC TGCCATACAT GGAGTAGGTA CAGGCGATCT GCTTGATCAG GTGGTTAATT TGTTTCCCTC TAAAGATTTA GATGAAGTTA GTGATTCTCC TGTTCAATTG GCAATTATTG GGAGACCAAA TGTAGGTAAG TCCAGTCTTC TTAATTCTAT TTGTGGAGAG ACAAGGGCAA TTGTTAGCTC TATTAGGGGT ACAACTCGAG ATACGATTGA TACTCGAATT ACTCATCAGG GTAAGGAATG GAAATTAGTT GATACGGCGG GAATACGTAG ACGTAGAAGT GTTAATTATG GCCCAGAATT TTTTGGTATT AATCGCAGTT TTAAGGCAAT AGAAAGAAGT GATGTCTGTG TGTTGGTTAT AGATGCTTTG GATGGCGTCA CAGAACAAGA TCAAAGGCTT GCAGGTAGAA TTGAGCAGGA AGGAAGAGCT TGTTTGATAG TCATTAATAA ATGGGATGCT GTAGAAAAAG ATAGTCACAC AATGTCTGCA ATGGAAAAAG ACATTCGTTC AAAATTATAT TTTCTCGATT GGGCCCAGAT GATCTTTACA TCAGCAGTTA CGGGTCAAAG AGTAGAAGGT ATTTTTGCAT TAGCTACTTT GGCCGTTGAT CAGAGTAGAA GAAGGGTAAC TACTTCAGTT GTTAATGAGG TGCTGACTGA GGCCTTAAAA TGGAGAAGTC CTCCTACAAC AAGAGGGGGA AAACAAGGGC GTCTTTATTA CGGTACTCAA GTAGCTATTA ATCCTCCCAG TTTTACTCTG TTCGTGAATG AACCTAAATT ATTTGGTGAA ACTTATCGAA GATATATTGA GAGACAAATT AGAGAGGGTC TTGGTTTTGA AGGGACTCCT ATAAAGTTAT TTTGGAGAGG GAAGCAGCAA CGCGATGTCG AAAAAGATAT GGCACGCCAA CAGAAAGGGG TCCAAAATTA G
|
Protein sequence | MALPVVAIIG RPNVGKSTLV NRLCQSREAI VHDEPGVTRD RTYQDGFWRD RDFKVVDTGG LVFDDDSEFL PEIREQANLA LEEAVVALVI VDGQEGITTA DESIAEFLRS RSCKTLVVVN KCESPEQGLA MAAEFWKLGL GEPYPISAIH GVGTGDLLDQ VVNLFPSKDL DEVSDSPVQL AIIGRPNVGK SSLLNSICGE TRAIVSSIRG TTRDTIDTRI THQGKEWKLV DTAGIRRRRS VNYGPEFFGI NRSFKAIERS DVCVLVIDAL DGVTEQDQRL AGRIEQEGRA CLIVINKWDA VEKDSHTMSA MEKDIRSKLY FLDWAQMIFT SAVTGQRVEG IFALATLAVD QSRRRVTTSV VNEVLTEALK WRSPPTTRGG KQGRLYYGTQ VAINPPSFTL FVNEPKLFGE TYRRYIERQI REGLGFEGTP IKLFWRGKQQ RDVEKDMARQ QKGVQN
|
| |