Gene RPB_2542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2542 
Symbol 
ID3910331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2911187 
End bp2912518 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content65% 
IMG OID637884440 
Producthypothetical protein 
Protein accessionYP_486157 
Protein GI86749661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.15406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCA CCACGAAGAA GATCAAGCCC AAGGAGCGGG ACACGATCAT TCAGGCGCTA 
TCCGCCGGTG TCGTGCCGAG GCTCGGGTTG CCGTACATCC AGGTCGGGCG CGCCGCCGAG
ATCGCCGCGC TGCTGCGGGA CGTCGATCGT ATCGGCGACG GCGGGGCCTG CGTACGCTTC
GTGATTGGCG AGTACGGCGC CGGCAAAACC TTCTTCGCCA ATCTGATCCG GCTGATCGCA
CTTGAACGGA AGTGCGTCAC GATCCATGCG GATCTGGCGC CTGACCGCCG CATCCACGCG
ACCGGTGGCC AGGCGCGCGC GCTGTATTCG GAGGCTGTCC GAAACATGGC GACCCGGACG
AAGCCCGAGG GAGGCGCGCT GGCCGGCGTC GTGGAGCGCC TTGTCACCGA TTCCGTGAAG
GAGGCGGCGG AGCGCAGCAT CCCGGTCGAG ACCGTGATCG ACCAGAAGCT GGCGCCGATC
CAGGAGTTCG TCGGCGGCTA CGACTTCGCG GTCGTGCTGA AGGCGTACTG GAAGGGAAGC
GAGGAGGGCG ATGAGGAGCT GAAAGCGGCG GCGCTGCGCT GGCTCCGCGG AGAGTTTTCC
ACCAAAACGG AGGCGCGCCA AGCGCTTGGG GTTCGCACCA TCATCGACGA CGGCGACATC
TACGACAGCC TGAAGTCTCT GGCTTGTCTG ACCCGCATCG CTGGCTACGC CGGACTGGTG
GTGATGTTCG ATGAGATGGT CAATCTCTAC AAGCTTCAGA GTTCGCAGGC GCGCAACCAA
AACTTTGAGG AGATCCTCCG GATCGTGAAC GACGCGCTCC AGGGCAACAC GTCGGGCATC
GGCTTCGTGA TGTGCGGGAC CCCGGAATTC CTGATGGATA CGCGGCGGGG TCTATACAGC
TACGAAGCCC TCCAGTCCCG TCTGGCCGAG AACCGCTTCG CCACCGGCGG TCTCGTGGAC
TACAGCGGCC CGGTCCTCCG GCTTCAGAAT CTGACGCCGG AGGACATGAT GGTCCTGCTG
ACCAACATCC GCGCGGTCTT CGCCGGCGGC GATCCGGAGA GATTCCTGGT GCCGGACGAG
GCGCTGCACG CCTTCATGGA TCACTGCAAC AAGCGCATCG GAGAAGCCTA CTTTCGGGCG
CCCCGGACGA CCGTGAAGGC GTTCGTGCAG ATGTTGTCGG TGCTGGAGCA GAACCCGACC
GCGAAGTGGC AGGATCTCCT GGGGCAGGTC GAGGTCGCCC CGGACGCTCC TGATACCCAG
CCGACCACGG AAGGCGAGAC GTCCAACTCG GGCGAGGAGG GTGATGAGCT CACCAAGCTT
CGCCTCCCTT GA
 
Protein sequence
MTSTTKKIKP KERDTIIQAL SAGVVPRLGL PYIQVGRAAE IAALLRDVDR IGDGGACVRF 
VIGEYGAGKT FFANLIRLIA LERKCVTIHA DLAPDRRIHA TGGQARALYS EAVRNMATRT
KPEGGALAGV VERLVTDSVK EAAERSIPVE TVIDQKLAPI QEFVGGYDFA VVLKAYWKGS
EEGDEELKAA ALRWLRGEFS TKTEARQALG VRTIIDDGDI YDSLKSLACL TRIAGYAGLV
VMFDEMVNLY KLQSSQARNQ NFEEILRIVN DALQGNTSGI GFVMCGTPEF LMDTRRGLYS
YEALQSRLAE NRFATGGLVD YSGPVLRLQN LTPEDMMVLL TNIRAVFAGG DPERFLVPDE
ALHAFMDHCN KRIGEAYFRA PRTTVKAFVQ MLSVLEQNPT AKWQDLLGQV EVAPDAPDTQ
PTTEGETSNS GEEGDELTKL RLP