Gene RPB_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0215 
Symbol 
ID3909457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp242500 
End bp243780 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID637882097 
Producthypothetical protein 
Protein accessionYP_483837 
Protein GI86747341 
COG category[S] Function unknown 
COG ID[COG4487] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC TGACCATCAT TTGCCCGAAC TGCGCCACCA GCGTCCCGCT GACGGAATCG 
CTGGCGGCGC CGCTGCTCAA GGATACGCAG AGCAAATACG AGCGGCTGAT CGCGCAGAAG
GACAAGGACA TCGCCGGCCG CGAGGCCGCG CTGGAGGCGC AGCGGGCCGA TCTCGACAAC
GCGAAAGCCG CCGTCGGCCA GCAGGTCGCC GAGCGGATCG CGGTCGAGCG CACCCGGATC
GCCGCCGAGG AAGCCGCCAA AGCGAAGCGG CTCGCGGCCG ACGACCTCGA CGCCAAGGCG
CGGCAACTCG CCGAACTCAC CGAGGCGATG CAGCAGAAGG ACGTCAAGCT CGCCGAAGCG
CAGCGGGCGC AGGCGGCGTT CCTGCAGAAG CAGCGCCAGC TCGAAGACGA GAAGCGCGAG
CTCGACCTCA CCATCGAAAA GCGCGTGCAG GCGTCGCTGG AGAGCGTGCG CAGCAAGGCC
AAACAGGATG CCGAAGAGGG GCTGCGGCTC AAGGTCGCCG AGAAGGAAGA AACCATCGCG
ACGATGCAGC GGCAGATCGA CAAGCTGAAA TCCGAGCAGG GCTCGCAGCA ATTGCAGGGC
GAGGTGATGG AGCTCGAGCT CGAAGCCTCG CTGCGCGCAC GCTTCCCGCA GGATTCGATC
GAGCCGGTGC CGAAGGGCGA GTTCGGCGGC GACGTGCTGC ACCGCGTGGT CAACGCCGCC
AATCAGCCCT GCGGCACGAT CCTGTGGGAA TCCAAGCGCA CCAAGAACTG GACCGACGGC
TGGCTGACCA AGCTGCGCGA CGACCAGCGC AAGGCCAAGG CCGAGCTGGC GCTGATCGTC
TCCAACGCGC TGCCGAAGGG CGTGCACAGC TTCGATCACA TCGACGGCGT CTGGGTCGCC
GAGGCGCGCT GCGCGATCCC GGTCGCGATC GCGCTGCGAC AGTCGCTGAT CGAGCTCGCC
GCCGCGCGCC AGGCCGGCGA AGGCCAGCAG ACCAAGACCG AGCTGGTGTA TCACTATCTC
ACCGGGCCGC GGTTCCGGCA GCGGGTCGAG GCGATCGTCG AGAAATTCAC CGAGATGCAG
TCCGACCTCG ACAAGGAACG CCGCTCGATG ATGCGGATGT GGGCGAAGCG CGAGGCGCAG
ATCCGCGGCG TGCTGGAAGC GACCGCCGGG ATGTATGGCG ACCTGCAGGG CATCGCCGGC
AAGGCGCTGG GCGAGATCGA CGGCATGGCG CTGCCGATGC TGGAGGATTT CAGCGACGAC
GAGGCCGATC AGGCGGCGTG A
 
Protein sequence
MTELTIICPN CATSVPLTES LAAPLLKDTQ SKYERLIAQK DKDIAGREAA LEAQRADLDN 
AKAAVGQQVA ERIAVERTRI AAEEAAKAKR LAADDLDAKA RQLAELTEAM QQKDVKLAEA
QRAQAAFLQK QRQLEDEKRE LDLTIEKRVQ ASLESVRSKA KQDAEEGLRL KVAEKEETIA
TMQRQIDKLK SEQGSQQLQG EVMELELEAS LRARFPQDSI EPVPKGEFGG DVLHRVVNAA
NQPCGTILWE SKRTKNWTDG WLTKLRDDQR KAKAELALIV SNALPKGVHS FDHIDGVWVA
EARCAIPVAI ALRQSLIELA AARQAGEGQQ TKTELVYHYL TGPRFRQRVE AIVEKFTEMQ
SDLDKERRSM MRMWAKREAQ IRGVLEATAG MYGDLQGIAG KALGEIDGMA LPMLEDFSDD
EADQAA