Gene RPB_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1217 
Symbol 
ID3910152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1391472 
End bp1392851 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content68% 
IMG OID637883111 
Producthypothetical protein 
Protein accessionYP_484838 
Protein GI86748342 
COG category[S] Function unknown 
COG ID[COG3864] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.555758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGA CCATCGCGCT GTATTGCCAT CGCGGAACGC GCGCGATCCA GCGCATGGTC 
GAGTTCGCGC CCTCCACCGG CGGGCTGGCA TTATGGGTGC GGCATCAGGA CCTGACCGCG
GACAGCGATA CGGCGGCCGT GGTCGTACTC ACCGACGGCA CCACCGTGTA TTACGGCGCC
GCCTTCGACA AGCTGCCGCT GCCCGAACAA GTCGGCCTCG TCGCCCACGA GGTGCTGCAC
ATCGCGCTGC GCCATCCGCA GCGCTTCGTC GAACTGCAGC GCGTGATCGG CGACGTCGAC
CTCGAATTGT TCAACATCTG CGCCGACGCC ATCGTCAATT CGACGCTGGC GCATCTGAGC
TGGCTGACGC TGCCGGAAAA GTCCGTGATG CTGGAGCAGA TCCTCGCCAA GGCGCTGAGG
CGCGAGCAGG ACGCCGAGGC GGCGCTGCTA GAATGGGACG TCGAGAAGCT GTATCGCGCG
ATCGACGATC GCGACAGCGA CAGCAACAAC GGCAAGTCGA AGACCGGCAA CAAATCGCAA
GCGGGCTCGC AGTCCGACGC CTCGGGCGCC GGCGGCGGCG ACCCGTCGCA GTCGCAATCC
GAATCCGCGC AGGAGAGCGC CGAACAGCGC GCCGACGGCG CCCGCGCGTC CAAGGTGCGC
GAGCTCGGCG CAGGCGGCGT CCGCGATCTG GTGCCGAATC CGGAATCACA ATCGGCGCCG
GAACACGAAG CCGAGCACGC CCGCGAATGG AGCGAGCGGA TCCTGCGCGG CCACGCCGGC
GACGGCGCGT TTTCGATGCT GCGGGCGCTG ATCGCAGACT TGCCGCGCAG CCGCACACCG
TGGGCGCAGG TGCTGCGCGT GCAGCTTGCG CGCGGGCTGG CGCGAAAACC GTCGCTGACC
TGGTCGCGGC CGGCGCGCTC CTACATCGCC AATCAGGGCC GCGCCGGCCA ACACCGGATG
CCGTTCGAGC CGGGATTCTC CGCGACCAAG AACGAGCCGC GGCTGGCGCT GATCATCGAC
GTCTCCGGCT CGATCGACGA CCAATTGATG GAGCGCTTCG CGCGCGAGAT CGAGACCATC
ACCCGGCGGC AGGAGGCCGG GCTGGTGCTG ATCATCGGCG ACGAGCGCGT GCGGCAGGTC
GAATTCTTCG AGCCGGGCCG GCGTTTCGTG CTAAGCGAGA TCGAATTCAC CGGCGGCGGC
GGCACCGATT TCACCCCACT CCTCGCGGAG GCGGACCGGC ACAGGCCGGA TATCGCGGTG
GTGCTGACCG ATCTCGAAGG TCCGGCGGAT TTCAAGCCGC GCTGGCCGGT GATCTGGGCG
GTTCCGGAGA ACTACTCACA TGCGGTGCAG CCGTTCGGCC GGCTGCTGAC GTTGAACTAA
 
Protein sequence
MPETIALYCH RGTRAIQRMV EFAPSTGGLA LWVRHQDLTA DSDTAAVVVL TDGTTVYYGA 
AFDKLPLPEQ VGLVAHEVLH IALRHPQRFV ELQRVIGDVD LELFNICADA IVNSTLAHLS
WLTLPEKSVM LEQILAKALR REQDAEAALL EWDVEKLYRA IDDRDSDSNN GKSKTGNKSQ
AGSQSDASGA GGGDPSQSQS ESAQESAEQR ADGARASKVR ELGAGGVRDL VPNPESQSAP
EHEAEHAREW SERILRGHAG DGAFSMLRAL IADLPRSRTP WAQVLRVQLA RGLARKPSLT
WSRPARSYIA NQGRAGQHRM PFEPGFSATK NEPRLALIID VSGSIDDQLM ERFAREIETI
TRRQEAGLVL IIGDERVRQV EFFEPGRRFV LSEIEFTGGG GTDFTPLLAE ADRHRPDIAV
VLTDLEGPAD FKPRWPVIWA VPENYSHAVQ PFGRLLTLN