Gene RPB_3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3666 
Symbol 
ID3911468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4207840 
End bp4209012 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content65% 
IMG OID637885568 
Producthypothetical protein 
Protein accessionYP_487272 
Protein GI86750776 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.335217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.695679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CGGACGACAG CGCCACCGGC CGCATCCAGC CGATCAAGCC TCCGCGTTTC 
GGCTTCCTGC GCCGCACCTA TGCGATGCTG GTGAAGGAAC TGATCCAGCT CCGCCGCGAC
CGTCTCACCT TCGCGATGAT CGTGGTGATC CCGGTGATGC AGCTCCTGCT GTTCGGCTAC
GCCATCAACA CCACGCCCCG GCATCTGCCG ACCGCGGTGC TGCTCCAGGA GGACAGCGAT
CTCGGGCGTT CGATCCTGAA GGCGCTGGAG AACACAGCGT ATTTCGATTT CGTCCAGGAG
GTGCAGAGCG TCGAGCAGTT CGACAATCTG CTGTTGTCCG GCAAAGTGCT GTTCGGCGTC
GAGATCCCGC GCGGCTTCGA GCGCGCGGTG CGGCGCGGCG AGCGGCCGGC GCTGCTGGTC
GCCGCCGACG CCACTGATCC GGTCGCGGCC GGCTCTGCCC TGTCGGCTCT CGGCCGGATC
GTGCAGACCG CGCTGGAGCA CGACCGCGTC GCCGGCGATC CCGGCAATCC GCCGTTCGAG
ATCCGCGCGC ATGCGCGCTA CAATCCGGCG GCGTCGTCGC GGCTCAACAT CGTGCCCGGC
CTGGTCGGCA CCATCCTGAC GATGACCATG CTGATCTTCA CCGCGCTGTC GGTGACGCGC
GAGATCGAAC GCGGCACCAT GGAAAACCTG CTGTCGATGC CGATCACACC GGTCGAGGTG
ATGCTCGGCA AGATCCTGCC TTATGTCGGC GTCGGCTTCA TTCAGGCGTC GCTGATCATC
GGCATCGGCG TGTTCTTGTT CGGCGTGCCG CTGCGCGGCA GCCTGTTGCT GCTGGCGCTG
CTGTCGACGT TGTTCATCAC CACCAATCTG GCGATCGGCT ACACTTTCTC GACGCTGGTG
CAGAACCAGT TGCAGGCGAT GCAGGCCTCG ATGATGTTCT TCCTGCCGTC GATCCTGCTG
TCCGGCTTCA TGTTCCCGTT CGCCGGGATG CCGCAATGGG CGCAGTATCT CGGCGAATGC
CTGCCGCTGA CGCATTATGT GCGGATCGTC CGGGCGATCA TGCTGAAGGG ATCGACGCTG
GAGAATCTGC GCTACGACAC GCTGGCGCTG GCGGCCCTGA TGCTGATAGC GATGACGATC
GCGGTGACCC GATTCCGCAG GACGCTGGAC TGA
 
Protein sequence
MSAADDSATG RIQPIKPPRF GFLRRTYAML VKELIQLRRD RLTFAMIVVI PVMQLLLFGY 
AINTTPRHLP TAVLLQEDSD LGRSILKALE NTAYFDFVQE VQSVEQFDNL LLSGKVLFGV
EIPRGFERAV RRGERPALLV AADATDPVAA GSALSALGRI VQTALEHDRV AGDPGNPPFE
IRAHARYNPA ASSRLNIVPG LVGTILTMTM LIFTALSVTR EIERGTMENL LSMPITPVEV
MLGKILPYVG VGFIQASLII GIGVFLFGVP LRGSLLLLAL LSTLFITTNL AIGYTFSTLV
QNQLQAMQAS MMFFLPSILL SGFMFPFAGM PQWAQYLGEC LPLTHYVRIV RAIMLKGSTL
ENLRYDTLAL AALMLIAMTI AVTRFRRTLD