Gene RPB_4159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4159 
Symbol 
ID3911967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4731466 
End bp4732563 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID637886063 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_487762 
Protein GI86751266 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.471291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.369246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCA AATGCGGGAT CGTCGGGCTG CCGAATGTCG GCAAGTCGAC GCTGTTCAAT 
GCGCTGACCG AGACCGCGGC GGCGCAGGCG GCCAACTATC CGTTCTGCAC CATCGAGCCG
AATGTCGGCG AGGTCGCGGT CCCGGATCCG CGGCTCGACA AGCTTGCCGA GGTCGGCAAG
TCGCAGCAGA TCATTCCGAC GCGGCTGACC TTCGTCGACA TCGCCGGTCT GGTGAAGGGC
GCTTCCAAGG GTGAAGGCCT CGGCAATCAG TTCCTCGCCA CCATCCGCGA GGTCGACGCG
ATCGCTCATG TGGTGCGCTG TTTCGAGGAC GGTGACATCA CCCATGTCGA GGGAAGGGTC
GCGCCGATCG CCGACATCGA CACGATCGAG ACCGAGCTGA TGCTCGCCGA TCTCGACAGC
CTCGAGAAGC GCGTCGACAA CCTCACCAAG AAGGCCAAGG GCGGCGACAA GGATTCCAAG
GAACAGCTCG AACTGGTCAC CCGCGCGCTG ACGCTGCTGC GCGAGGGGCG GCCGGCGCGC
TTCCTCGAAC GCAAGCCGGA GGAAGAGCGC GCGTTCCGGA TGCTCGGGCT GTTGACCTCG
AAGCCGGTTC TGTACGTCTG CAACGTCGAG GAAGGCTCCG CCGCCGAGGG CAATGCATTC
TCGCAAGCGG TGATGGCGCG CGCCAAGGAC GAAGGCGCGG TCGCGGTGGT GATTTCCGCC
AAGATCGAAT CCGAAATCGC GACGCTGTCG AAAGAAGAGC GCGTCGATTT CCTCGATACG
CTGGGGCTGC ACGAGGCCGG GCTCGACCGG CTGATCCGCG CCGGCTACGA GCTCTTGCAC
CTCATCACCT ATTTCACCGT CGGCCCCAAG GAAGCCCGCG CCTGGACCAT CACCAAAGGC
ACCAAGGCGC CGCAGGCGGC GGCCGTGATC CATACCGATT TCGAGAAGGG CTTCATCCGC
GCCGAAACCA TCGCCTATGA CGACTACACC ACGCTCGGCG GCGAAGCCGG CGCCCGCGAT
GGCGGCAAGC TGCGGCTGGA AGGCAAGGAA TACGTCGTCG CCGACGGCGA CGTGATGCAT
TTCCGATTCA ATACGTGA
 
Protein sequence
MGFKCGIVGL PNVGKSTLFN ALTETAAAQA ANYPFCTIEP NVGEVAVPDP RLDKLAEVGK 
SQQIIPTRLT FVDIAGLVKG ASKGEGLGNQ FLATIREVDA IAHVVRCFED GDITHVEGRV
APIADIDTIE TELMLADLDS LEKRVDNLTK KAKGGDKDSK EQLELVTRAL TLLREGRPAR
FLERKPEEER AFRMLGLLTS KPVLYVCNVE EGSAAEGNAF SQAVMARAKD EGAVAVVISA
KIESEIATLS KEERVDFLDT LGLHEAGLDR LIRAGYELLH LITYFTVGPK EARAWTITKG
TKAPQAAAVI HTDFEKGFIR AETIAYDDYT TLGGEAGARD GGKLRLEGKE YVVADGDVMH
FRFNT