Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2013 |
Symbol | engA |
ID | 4569540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2325378 |
End bp | 2326691 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766594 |
Product | GTP-binding protein EngA |
Protein accession | YP_912449 |
Protein GI | 119357805 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00580637 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCTC TCATCGCTAT CGTTGGCCGG CCGAATGTCG GAAAGTCGAT GCTTTTCAAC AGAATACTTC GCGAAAAAAG CGCCATTGTA GACAGTACGC CCGGCGTCAC CAGAGACCGC CACATCTCTC CCGGAGAGTG GCAGGGAAAA CAATTTCTTC TTATGGATAC CGGCGGTTAC TGCCCCGAAG GGGATGTGAT CAGCATGGCT ATGCTCGAAC AAACGCTGAT GGCTATCCGC GACGCCGATA TCATCCTCTT TCTTGCCGAT GTACGATCAG GACTAACCTA CGACGACCTT GAAATCAGCA AGCTTCTGCA GCGGACATTC CAGCACAAGC AGATCTTTTT TGCGGTCAAC AAGGTTGAAA CCCCGCAACT TTCCATCGAT GCGGAATCAT TTGTCAGTAC CGGTTTCACG AAACCCTATT TCGTTTCGGC AAGAGACGGC AGCGGCGTCG CCGAACTGCT TGACGATATG CTTGATTCGT TGCCCGTCCA GGAAAAACAA CTCGTCGAAA AAGACCTGCC GACAAATCTC GCCGTTGTCG GACGTCCCAA TGTCGGCAAA TCAAGTTTCG TCAATGCCCT GCTCGGAGCA AACCGCCTGA TCGTGTCAGA CATACCAGGC ACCACGCGTG ATGCTATCGA CAGCCGCTTT ACCCGCAAAA AACAGGATTT CGTCCTGATA GACACGGCCG GACTGAGAAA ACGCACCAAA ATCGACGCCG GCATCGAGTA CTACAGCTCC CTGCGAACCG ATAAAGCCAT CGAGCGATGC GACGTCGCAC TGGTAATGAT CGACGCCAGA ACGGGCATCG AAAACCAGGA TATGAAAATA ATCAATATGG CCGTAGAACG TAAAAGAGGC GTCCTGCTTC TGATAAACAA ATGGGATCTG GTCGAAAAGG ATTCCAAAAC CAGTGCCCAT TACGAAAAAG AGGTTCGCTC GCACATGGGA AACCTTGCAT ATATACCCCT GCTGTTTATT TCTGCCCTGA CTAAAAAAAA TCTCTACAGG GCGATCGATA CAGCTCAGGA GATCAGCGAA AACCGGTCGC GAAAAATCAC CACCAGCGCC CTGAACCGCT TTCTTGAAGT TGCCCTTGCA GAAAAACATC CGTCAACGAA ATCCGGCAAG GAGCTGAAAA TAAAATACAT GACCCAGATC GAAGCACCGT GGCCGGTATT TGCTTTTTTC TGCAACGATC CCGAACTTCT GCAGACCAAT TTCAGAAAGT TTCTTGAAAA CAAGCTGCGC GAACACTTCA AACTCGAAGG AGTAACCGTT TCGCTGCGCT TTTTCAAAAA ATGA
|
Protein sequence | MKPLIAIVGR PNVGKSMLFN RILREKSAIV DSTPGVTRDR HISPGEWQGK QFLLMDTGGY CPEGDVISMA MLEQTLMAIR DADIILFLAD VRSGLTYDDL EISKLLQRTF QHKQIFFAVN KVETPQLSID AESFVSTGFT KPYFVSARDG SGVAELLDDM LDSLPVQEKQ LVEKDLPTNL AVVGRPNVGK SSFVNALLGA NRLIVSDIPG TTRDAIDSRF TRKKQDFVLI DTAGLRKRTK IDAGIEYYSS LRTDKAIERC DVALVMIDAR TGIENQDMKI INMAVERKRG VLLLINKWDL VEKDSKTSAH YEKEVRSHMG NLAYIPLLFI SALTKKNLYR AIDTAQEISE NRSRKITTSA LNRFLEVALA EKHPSTKSGK ELKIKYMTQI EAPWPVFAFF CNDPELLQTN FRKFLENKLR EHFKLEGVTV SLRFFKK
|
| |