Gene RPB_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2000 
Symbol 
ID3909506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2272021 
End bp2273340 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content69% 
IMG OID637883894 
Producthypothetical protein 
Protein accessionYP_485619 
Protein GI86749123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.846413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAACGAC AATTCGCGCC GCGATTGCGG CAATTGCACT GGTACGAGAT CGCGCTGATC 
GCCGCCGTCG CGCTGGCGCT CTGCATCTAT CTGCCGATCG TGGTGAAGCG CTCGGCGGTG
CACGGGTTCG GCGACGTCCA GGTGTTCTTC CGCGCCGGCT GGGCGGTGTG GACAGGCTAT
CCCCTGTATG AAGTCGCCGA TCATCACGGC TGGACCTATC ACTACCCGCC GTTCTTCGCG
TTGCTGATGG GGCCGTTCGC CTATCCGCTC GACGGCTATC CCAAACCGGC CTGGGCGCTG
CCGCTGCCGA TGTCGGTCGC GGTGTGGTAC GCGCTCAGCG CCGCCGCCGT GCTGATCGCG
ATCGACTGGT GGGCGAGGGC GCTCGAACGC GGCGCTCCGC CTCTGCTCCA GGAGGCCGGC
TGGAACGGGT GGTGGATGCT GCGGATCGGG CCGCTGGCGA CGCTGCTGCC GTTCATCGGC
GACGGTTTCG GCCGCGGGCA GCCGAGCGCG CTGGTGCTGC TGACGATGGT GGCGTTCCTC
GTGCTTTACG TGCAGGGCCG GATCTATGCG GCGGCGTGCG CGCTGGCGAT CGGCTTCACC
ATCAAGCTGT TTCCGATCGC GCTGCTGCTG TTTCCGATAC TGCGGCGGGA CGTGAAAACG
GTGCTGGCCA CCGCCGGCTT CAGTCTCGTC TTCCTGTTCG TCGTGCCGAC GCTGTGCCTC
GGGCCGGCCG AAGTGATCAA ACTCTACACC GCGATGTGGA CCGAGCATCT CAACGGCATC
CTCACCGGCG TGCCGAACGC CAAGATTGCC GCCGAAATCA CCTTCACCTC GTACGACATG
CTGAGCATCG CCGCGATGCT GGCGCGGATC GGCGCTGGCG GCCTGCCCGC GGCCGACACG
CTGCCGGGAT TCGCCACCGC GGGACAGATG GCCTTCAACG CCGCGTTCGC GGCCGCGCTT
CTGTGGATCG GCCACGGCCG GTTCTGGCGC CTGACCGGGC CGCAACCGCC GCCGTCCGAA
GCGCTGCTGA TCGGCGGCGC CATCCTGTGC GCCGCGCTGC CGGTGATGCT GTCGGTGTCG
CAGCCGAACT ACGTGGCGTT CGTCGCGCCG TTGGTCGCAG TGGTGCAGGT CGACGCGTGG
CGACGCAGCG GCGAGGTCAG GGTGGTGCCG CTGCTGATCG CCTGGGCCGG CGTGATGTGG
CTCGGCATGG TCGCGACCGA AGGCGGGGTC TGGCAGCCGC TGCGCGTGAT CGGGCTGGCG
ACGCCGGCGA TCGTGGCGCT GCTCGGCTGG GGCCTCGTCT GCCTGCGGCG GGCGCGATAG
 
Protein sequence
MQRQFAPRLR QLHWYEIALI AAVALALCIY LPIVVKRSAV HGFGDVQVFF RAGWAVWTGY 
PLYEVADHHG WTYHYPPFFA LLMGPFAYPL DGYPKPAWAL PLPMSVAVWY ALSAAAVLIA
IDWWARALER GAPPLLQEAG WNGWWMLRIG PLATLLPFIG DGFGRGQPSA LVLLTMVAFL
VLYVQGRIYA AACALAIGFT IKLFPIALLL FPILRRDVKT VLATAGFSLV FLFVVPTLCL
GPAEVIKLYT AMWTEHLNGI LTGVPNAKIA AEITFTSYDM LSIAAMLARI GAGGLPAADT
LPGFATAGQM AFNAAFAAAL LWIGHGRFWR LTGPQPPPSE ALLIGGAILC AALPVMLSVS
QPNYVAFVAP LVAVVQVDAW RRSGEVRVVP LLIAWAGVMW LGMVATEGGV WQPLRVIGLA
TPAIVALLGW GLVCLRRAR