Gene RPB_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4003 
Symbol 
ID3911810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4569695 
End bp4570837 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content67% 
IMG OID637885907 
ProductO-methyltransferase, family 2 
Protein accessionYP_487607 
Protein GI86751111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.414327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000643794 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACTCCT TGAGTCTGCG TGACCGGCTG CTCGGCTGGC GCGATTCAGT CTTGTCGAAC 
CCGCGGTTTC AGCGCTTCGC CGCGGTGTTT CCGCTGATGC GGCCTGTTGC GCGCCGCCGC
GCCGCCGCGA TGTTCGACCT CGTCGCCGGC TTCGTCTATT CCCAGATCCT GCTCGCCTGC
GTGCAGTTGC GGCTGTTCGA CCTGATCGCC GAACGGCCTG CCACCGTCGA CGAATTGTCG
GTGCGATGCG AGCTGCCGCG CGAATCGATG CAGATGCTGC TCGATGCGGC GATCGCGCTG
AAGCTGGTGC AGCCGCGCAG CGAAGGCCGC TATGGCCTCG GCCAGCTCGG CGCCGAACTG
TGCGGCAATC GCGGCGTGCT GGCGATGGTC GAGCATCACG CGATGCTGTA TCGCGACCTC
GCCGATCCGG TGGCGCTGCT GCGCGGCCCG CGCGGCGGCG GAGAACTCGC TGCTTACTGG
GCCTATGTCC GCGGCGAGCG GCCGGCCGAG CTCGGTGCGG AGCACGTCGC GTCCTACACC
GCGCTGATGG CCGCGTCGCA GCCGATGATC GCGCGCGAAG TGCTGCACGT GTTCTCGTTC
GGCGCTCATC GTTGCCTGCT CGACGTCGGC GGCGGCGACG GCTCGTTCCT GTCGGCAGTC
GCCGCGCAGA CCCCGGAGCT GCGCTGCATC CTGTTCGATC TTCCGGCCGT GGCCGCCAAG
GCGGCCGACC GCTTCCGTAC CAATGGCCTG GCCGAGCGCG CGACCGCGAT CGGCGGCAGT
TTCCGGACCG ACCCGCTGCC CGAAGGCGCC GATATCGTCT CGCTGGTGCG AGTCATCCAC
GACCATGACG ACGAGGTCGT CGCCGCGCTG TTGCGAGCGG TCCACAGCGC CCTTCCCGAG
CGGGGGACAC TGCTGATCGC CGAGCCGATC GCCGGCCTGT CGCGTACCGC GTCGATCTCG
GACGGCTATT TTGCCTTTTA TTTGAGGGCA ATGGGAACCG GTAAAGCCAG GACCTTCGAA
CATCTCCGAT CGCTGCTGGA GGCCGCCGGA TTCGCTGAGA TCAAGCTCCA CCTGGTTCCG
ATGCCACTGG TCGCCTCCGT AATTACCGCA ACCAAGACCT CCAAATGTGT TAATCTGGCT
TGA
 
Protein sequence
MNSLSLRDRL LGWRDSVLSN PRFQRFAAVF PLMRPVARRR AAAMFDLVAG FVYSQILLAC 
VQLRLFDLIA ERPATVDELS VRCELPRESM QMLLDAAIAL KLVQPRSEGR YGLGQLGAEL
CGNRGVLAMV EHHAMLYRDL ADPVALLRGP RGGGELAAYW AYVRGERPAE LGAEHVASYT
ALMAASQPMI AREVLHVFSF GAHRCLLDVG GGDGSFLSAV AAQTPELRCI LFDLPAVAAK
AADRFRTNGL AERATAIGGS FRTDPLPEGA DIVSLVRVIH DHDDEVVAAL LRAVHSALPE
RGTLLIAEPI AGLSRTASIS DGYFAFYLRA MGTGKARTFE HLRSLLEAAG FAEIKLHLVP
MPLVASVITA TKTSKCVNLA