Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4067 |
Symbol | |
ID | 3911874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4640748 |
End bp | 4641959 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885971 |
Product | cobalamin synthesis protein, P47K |
Protein accession | YP_487671 |
Protein GI | 86751175 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGAAT CCCGACTTCC GGTCACGGTG CTGTCCGGCT TTCTCGGCGC CGGAAAGACC ACCTTGCTCA ACCGCATGCT GACCAATCGC GAGGGCCGGC GTATCGCGGT GATCGTCAAT GACATGAGCG AGGTGAACAT CGACGCCGAA CTGGTGCGAC AGGACGGCGG GCTGACGCGA TCGGAAGAAT CGCTGGTGGA AATGACCAAT GGCTGCATCT GCTGCACGTT GCGCGAAGAT CTCCTGATCG AGGTCCGCAA GCTCGCCGAA AGCGGCCGGT TCGACGCCCT GGTGATCGAA TCGACGGGCA TCGCCGAGCC GCTGCCGATC GCCACCACCT TCGAGTTCCG TGACGAAGAA GGCGCCAGCC TGTCCGACAT CGCCCGGCTC GACACCATGG TGACCGTCGT TGACGCCGCG AGCCTGCTCT CCAACTATTC GAGCCAGGAG TTCCTGCGCG ACCGTGGCGA GGTTGCCGGC GAGCAGGACG ATCGCACTTT GGTCGACCTG CTGGTCGAAC AGATCGAGTT CGCCGACGTC ATCGTGCTCA ACAAGGTCTC GGCCACCTCG CCGGCCCGAC TCGAAGCGGC GCGGGCGATC GTCCGCTCGC TGAATTCGGA CGCCCGGATC ATCGAGACGG ATTTCGGCGA CGTTCCGCTG CAGTCGGTGC TGCACACCGG GCTGTTCGAT TTCGAGCGGG CGCATCAGCA CCCGCTGTGG TTCAAAGAGT TGAACGGCTT CAAGGATCAC GTACCCGAAA CCGACGAATA CGGCATCAGC TCGTTCGTGT ATCGCGCCCG GAGACCATTT CATCCGGCAC GGTTTCAGGG GTTCTGCAAC TGCAGCTGGC CGGGCGTGAT CCGCGCCAAG GGGTTCTTCT GGCTCGCAAC CCGGCCGCAT TACGTGGGCG AATTGGCCCA GGCCGGCGCC GTGGTGCGCA CCTCGAAGCG TGGCCTGTGG TGGTCGGCGG TGCCCAAGGC GCGATGGCCC GATCAGGAGT CCTGGCGCGA AGCGATGCAG CCCTATTTCG ACCCGGTGTG GGGCGACCGG CGCCAGGAAA TCGTCTTCAT CGGCACCGGC GAGATGGACG AGACCGCCTT GCGCCGGCAG CTCGACGCCT GTCTGGTCGG CGACCCTGCA CAGTTCACGC CGGACGCCTG GCGAGCACTG CCGGATCCGT TCCCGAATTG GGCTCCGGCC GAGGTCTCAT AG
|
Protein sequence | MLESRLPVTV LSGFLGAGKT TLLNRMLTNR EGRRIAVIVN DMSEVNIDAE LVRQDGGLTR SEESLVEMTN GCICCTLRED LLIEVRKLAE SGRFDALVIE STGIAEPLPI ATTFEFRDEE GASLSDIARL DTMVTVVDAA SLLSNYSSQE FLRDRGEVAG EQDDRTLVDL LVEQIEFADV IVLNKVSATS PARLEAARAI VRSLNSDARI IETDFGDVPL QSVLHTGLFD FERAHQHPLW FKELNGFKDH VPETDEYGIS SFVYRARRPF HPARFQGFCN CSWPGVIRAK GFFWLATRPH YVGELAQAGA VVRTSKRGLW WSAVPKARWP DQESWREAMQ PYFDPVWGDR RQEIVFIGTG EMDETALRRQ LDACLVGDPA QFTPDAWRAL PDPFPNWAPA EVS
|
| |