Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2002 |
Symbol | |
ID | 6375695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2154432 |
End bp | 2155523 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642684493 |
Product | GTP-dependent nucleic acid-binding protein EngD |
Protein accession | YP_001960393 |
Protein GI | 189500923 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0012] Predicted GTPase, probable translation factor |
TIGRFAM ID | [TIGR00092] GTP-binding protein YchF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00393198 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0943361 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTTAC GTTGCGGGAT CGTCGGGTTA CCCAATGTCG GTAAATCAAC TCTTTTCAAT GCCATAACGG CCAAACAGGC CGAAGCTGAA AATTATCCCT TCTGCACGAT TGAGCCCAAT GTAGGAATGG TGCTTGTTCC GGATGAAAGA CTTCAAAAAC TGGCTGATAT CGTCAAAACC CAGACGATCA TTCCGGCCAC AATCGAACTT GTGGATATCG CGGGCCTCGT GCGGGGTGCA AGCAAAGGCG AAGGTCTGGG AAACCAGTTC CTGTCGCATA TCAGAGAGGT CGATACCATC GTTCACGTTG TACGCTGTTT TGACGATCCC GATGTAATCC ATGTCCACGG CGTGGTGAAC CCGGCTGACG ATATCGCAAC GATCGAAACA GAACTGATGC TGGCGGACCT CGACAGCATG GAAAAAAGGC TCGAAAAACT GAAGAAGAAC GCAAGAAAAG AAAAGGAGCT CCTCCCGGCG GTCGCCCTTG CTGAAAAAAT CATCTCAGGG CTTGGCCAAG GTATCCCCGC GCGCGTCCTG ATGGAAACGG AAGAGGAGAA ATCTCTGGGA AAACAGTTTT TCCTGCTGTC AGCCAAACCC GTCCTTTACG CTGCGAATGT CTCCGACAAC GATCTGCCCG ACGGCAACAG CTTCACGGCT CAGGTTGCTG AAATCGCCGA AAAGGAAGGC GAAAAAATGC TGATTATCTG TGCAAAAACC GAAGCGGAAA TAGCTGAACT TCCTGAAGAG GAGCGTCCTG AATTTCTCGA AAGCCTCGGT CTTGAAATGT CAGGCCTTGA TCGTATCATA CAAACCGCCT ATGACCTGCT CGGACTGCAC ACATACTTTA CTGCCGGCGA AAAAGAAGTA CGTGCCTGGA CTATCCGAAA AGGCGCTGCC GCACCTGAAG CCGCGGCAGC GATTCATACC GATTTTGAAA AAGGGTTCAT ACGCGCCGAG GTAATCGCCT ACACCGACAT GATCACCTGC GGTTCTGAAC AGAAAGCCAA AGAAGCCGGC AAAATGCGCT CTGAAGGCAA GGAATACATT GTCAAGGACG GAGATGTGAT TGTTTTCCGA TTTAATGTAT AG
|
Protein sequence | MALRCGIVGL PNVGKSTLFN AITAKQAEAE NYPFCTIEPN VGMVLVPDER LQKLADIVKT QTIIPATIEL VDIAGLVRGA SKGEGLGNQF LSHIREVDTI VHVVRCFDDP DVIHVHGVVN PADDIATIET ELMLADLDSM EKRLEKLKKN ARKEKELLPA VALAEKIISG LGQGIPARVL METEEEKSLG KQFFLLSAKP VLYAANVSDN DLPDGNSFTA QVAEIAEKEG EKMLIICAKT EAEIAELPEE ERPEFLESLG LEMSGLDRII QTAYDLLGLH TYFTAGEKEV RAWTIRKGAA APEAAAAIHT DFEKGFIRAE VIAYTDMITC GSEQKAKEAG KMRSEGKEYI VKDGDVIVFR FNV
|
| |