Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4159 |
Symbol | |
ID | 3911967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4731466 |
End bp | 4732563 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637886063 |
Product | GTP-dependent nucleic acid-binding protein EngD |
Protein accession | YP_487762 |
Protein GI | 86751266 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0012] Predicted GTPase, probable translation factor |
TIGRFAM ID | [TIGR00092] GTP-binding protein YchF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.471291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.369246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTCA AATGCGGGAT CGTCGGGCTG CCGAATGTCG GCAAGTCGAC GCTGTTCAAT GCGCTGACCG AGACCGCGGC GGCGCAGGCG GCCAACTATC CGTTCTGCAC CATCGAGCCG AATGTCGGCG AGGTCGCGGT CCCGGATCCG CGGCTCGACA AGCTTGCCGA GGTCGGCAAG TCGCAGCAGA TCATTCCGAC GCGGCTGACC TTCGTCGACA TCGCCGGTCT GGTGAAGGGC GCTTCCAAGG GTGAAGGCCT CGGCAATCAG TTCCTCGCCA CCATCCGCGA GGTCGACGCG ATCGCTCATG TGGTGCGCTG TTTCGAGGAC GGTGACATCA CCCATGTCGA GGGAAGGGTC GCGCCGATCG CCGACATCGA CACGATCGAG ACCGAGCTGA TGCTCGCCGA TCTCGACAGC CTCGAGAAGC GCGTCGACAA CCTCACCAAG AAGGCCAAGG GCGGCGACAA GGATTCCAAG GAACAGCTCG AACTGGTCAC CCGCGCGCTG ACGCTGCTGC GCGAGGGGCG GCCGGCGCGC TTCCTCGAAC GCAAGCCGGA GGAAGAGCGC GCGTTCCGGA TGCTCGGGCT GTTGACCTCG AAGCCGGTTC TGTACGTCTG CAACGTCGAG GAAGGCTCCG CCGCCGAGGG CAATGCATTC TCGCAAGCGG TGATGGCGCG CGCCAAGGAC GAAGGCGCGG TCGCGGTGGT GATTTCCGCC AAGATCGAAT CCGAAATCGC GACGCTGTCG AAAGAAGAGC GCGTCGATTT CCTCGATACG CTGGGGCTGC ACGAGGCCGG GCTCGACCGG CTGATCCGCG CCGGCTACGA GCTCTTGCAC CTCATCACCT ATTTCACCGT CGGCCCCAAG GAAGCCCGCG CCTGGACCAT CACCAAAGGC ACCAAGGCGC CGCAGGCGGC GGCCGTGATC CATACCGATT TCGAGAAGGG CTTCATCCGC GCCGAAACCA TCGCCTATGA CGACTACACC ACGCTCGGCG GCGAAGCCGG CGCCCGCGAT GGCGGCAAGC TGCGGCTGGA AGGCAAGGAA TACGTCGTCG CCGACGGCGA CGTGATGCAT TTCCGATTCA ATACGTGA
|
Protein sequence | MGFKCGIVGL PNVGKSTLFN ALTETAAAQA ANYPFCTIEP NVGEVAVPDP RLDKLAEVGK SQQIIPTRLT FVDIAGLVKG ASKGEGLGNQ FLATIREVDA IAHVVRCFED GDITHVEGRV APIADIDTIE TELMLADLDS LEKRVDNLTK KAKGGDKDSK EQLELVTRAL TLLREGRPAR FLERKPEEER AFRMLGLLTS KPVLYVCNVE EGSAAEGNAF SQAVMARAKD EGAVAVVISA KIESEIATLS KEERVDFLDT LGLHEAGLDR LIRAGYELLH LITYFTVGPK EARAWTITKG TKAPQAAAVI HTDFEKGFIR AETIAYDDYT TLGGEAGARD GGKLRLEGKE YVVADGDVMH FRFNT
|
| |