Gene RPB_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1940 
Symbol 
ID3908019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2208359 
End bp2209441 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content67% 
IMG OID637883834 
Producthypothetical protein 
Protein accessionYP_485559 
Protein GI86749063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.310986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGCC ACAGGTCCGG CTTTCGGCGG CGGTTGTTTG GCGGTAGGCT GGCAACCATG 
ATGTGGGGAT TAGCGGTGCT GCGCAAAGCT TTCATGAGAG TGTTCTCGGC CCATCCGGTC
GAGACCATCG CCGCCTGCAC CGGTCTGTTC ATGATCGTGC TCGGCTTCGC GCTCGGCACC
CAGACCGGCT GGGTCTGGTT CTGGGGCCAG GTCTCGGCGA TCTGCGGCGC CAGCCTCACC
ACCCTCGCCA TCGTGTGGAC GGTGCGCCAG CGCGCCGCCG AAGCGCGCAC CGCCGCCCAT
CTCAACGTGC TGGCGCGCCA GCTCGAGACC TCGATCAGCC AGATCCACAA ATCGGTCAAG
GGTGGCCGCG ACGAGTCGCT GCCCTCCGCG GTGTATCTCG CCCAGATCGA ACAGTGCATC
GACAATCTGA TCGGCGTCAC CCACGAGATC GGTGAGATCA CCGGCCGGCA ATTGCCGTTC
GACGTGATCG AGATGCTGAC CGCACGCGAC ACGCTGCATA TCGAGGCGCA TATCACCGAC
GCCCGCGCCC CGGAGCGCAA GCTCGCCACC GAGACCGAGA ACTGCCCGGC CTGCAAGAAG
CCGGTCGAAT TCCAGATCGG CAGCCTCCCC GGCGACAGCG CCAAGCCGAC CTGCCAGCAT
TGCGGCCAGC GCTTCCACGC CCACCGCTCG ACCACCGGCG GCATTTTCCT GCGGATGCCC
GGCGCCTCGC AATTCACCCG CGCGGTGTCG GTGAGCTGCC CGGTCTGCGC CAGCAAGATC
CCGGCGAATA TCGACCAGGG CAAGCAGCAT TCGGAGACGC GGTTCTGCTT CTCTTGCGGC
GCCAAGGTCT CGATCGATCC GCTCGGGCAG GAATGCACGC TGATCAGCAA GCAGGACCGC
CTGCCCGGCC ATTTCAACGC CGCCGGCAGC CTGGTGTGCG AGACCTGCAG CGAGCCGGCG
ATCGTGCTGA CCTCGAACAG CGCCGGCACC TACGGCATCT GCCGCAAGGA CGACGGCCTG
GTGTTCCTGC CCGCAAGCAG CGCTCCGGCC GCGGCACCGT CGCGCAGCGA CGCGGCCGAG
TAG
 
Protein sequence
MLRHRSGFRR RLFGGRLATM MWGLAVLRKA FMRVFSAHPV ETIAACTGLF MIVLGFALGT 
QTGWVWFWGQ VSAICGASLT TLAIVWTVRQ RAAEARTAAH LNVLARQLET SISQIHKSVK
GGRDESLPSA VYLAQIEQCI DNLIGVTHEI GEITGRQLPF DVIEMLTARD TLHIEAHITD
ARAPERKLAT ETENCPACKK PVEFQIGSLP GDSAKPTCQH CGQRFHAHRS TTGGIFLRMP
GASQFTRAVS VSCPVCASKI PANIDQGKQH SETRFCFSCG AKVSIDPLGQ ECTLISKQDR
LPGHFNAAGS LVCETCSEPA IVLTSNSAGT YGICRKDDGL VFLPASSAPA AAPSRSDAAE