Gene RPC_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3039 
Symbol 
ID3973492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3363189 
End bp3364337 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content59% 
IMG OID637926150 
Producthypothetical protein 
Protein accessionYP_532903 
Protein GI90424533 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.320364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CGCGCGGGAG AATGCAAGCA CGCGCAACGC AAAAGCTGCG ACGCATTCGG 
GCGCTGGTCC GAAAGGAATT CTACCAGATC ATCCGCGACC CGAGCAGCAT GACGATCGGC
GTGGTGATGC CGATCTTGAT GGTGCTCCTG TTCGGCTATG GATTGTCGCT CGACGTCCGG
AGCGTCCCGA TCGCGGTCGT GGTCGAAGAT TTTTCGCCGG AGACCACCGA ACTGGTATCG
TCGTTCCGGT CATCGGCCTA CTTCGACGTC CGGGTCGTCA CATCCCTGCA CCGCGCCACC
CAGCTGATAC AGGAACGCAA ACTCGACGGC ATCGTTCGGA TACGCGCTGA CTTCTCACGG
CAACTCGCGA TGGGCAACAC CGACTTGCAG GTTCTGGTTC GCGGGACCGA TGCGAATACA
GCCCGCATCG TCCGGAACTA TGCCATGGGT GCGATCCGCC GTTGGCAGCT TCACCGCCCT
GACTCCGGCG GCGCCCTCCT CCCGCCGGCC GCGACGATCG CAAGCCGAAT GTGGTTCAAC
GCGAACAACG ACAGTCGTTA CTATCTCGTT CCAGGCGTGA TCGTATTGAT CGTGACGCTG
ATCGGGGCTT TTCTCACCGC CCTGGTCATG GCGCGCGAAT GGGAGCGTGG CACTCTCGAG
TCGCTGTTCG TAACGCCGGT GCAACCGGGC GAAATTCTTC TTGGAAAGAC CATTCCATAC
TTCATCCTCG GAATGACCGG GCTGTTTCTG TGCATCGGCC TGGCAACCCT ACTATTCGAC
GTTCCCCTCC GAGGTTCCTT CTGGATACTG ACACTGGTGT CGATGCTGTA TCTTCTCGTC
GCTCTTGGAA TAGGTCTTTG GGTGTCGTCA GCGACCCGAA GCCAGTTCGT CGCGAGCCAG
GCGACGCTCC TGCTGACCTT TCTGCCCGCC CTCATGCTCT CCGGATTCTT GTTCGATCTT
CGCAGCATGC CGACCGCCGT TCGATGGATC ACCTATCTGT TTCCAGCGCG ATACTATGTG
AGTACGCTTC AGACACTCTT CCTTGCGGGA AACGTGTGGA GCATCATTCT GCCAAATGCA
CTAGTCCTCG GATCCATCGC GACGGTTCTC CTGATCTTCT CGGCCCGCGC AACACGAAAG
ATCCTGTGA
 
Protein sequence
MTTARGRMQA RATQKLRRIR ALVRKEFYQI IRDPSSMTIG VVMPILMVLL FGYGLSLDVR 
SVPIAVVVED FSPETTELVS SFRSSAYFDV RVVTSLHRAT QLIQERKLDG IVRIRADFSR
QLAMGNTDLQ VLVRGTDANT ARIVRNYAMG AIRRWQLHRP DSGGALLPPA ATIASRMWFN
ANNDSRYYLV PGVIVLIVTL IGAFLTALVM AREWERGTLE SLFVTPVQPG EILLGKTIPY
FILGMTGLFL CIGLATLLFD VPLRGSFWIL TLVSMLYLLV ALGIGLWVSS ATRSQFVASQ
ATLLLTFLPA LMLSGFLFDL RSMPTAVRWI TYLFPARYYV STLQTLFLAG NVWSIILPNA
LVLGSIATVL LIFSARATRK IL