Gene RPB_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2010 
Symbol 
ID3909516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2287428 
End bp2289257 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content69% 
IMG OID637883904 
Productpeptidase M24 
Protein accessionYP_485629 
Protein GI86749133 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.623866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.492859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAAG CGCATTTCCA GACGTTCGAG GAGCCGGAAA GCGGCGTTGC CCTCACTGCG 
CGGCTCGCGG CGTTTCGCGA GGAGATGGTC CGGCGCCAGC TCACCGGCTT TGTGATTCCA
CGCGCCGATC AGCAGCAAAA CGAATACGTG CCGGCCTGCG ACGAGCGGCT GGCCTGGCTC
ACCGGCTTCA CCGGCTCGGC CGGCATGGCC GTGGTGCTGG TGCATCGGGC CGCATTGTTC
GTCGATGGCC GCTACACGCT GCAGGCCGCC CAGCAGGTCG ACGGCAAGGC CTGGACGATC
GAGTCGCTGG TGGAACCGCC GCCGGAGCGC TGGCTGGAAG CGCATCTGAA AGACGGCGAC
CGCCTCGGAT TTGATCCGTG GCTGCACACT TCTTCGGCAG TCGAACGGAT GCAGGCGGCC
TGCGCCAAGG CCAGCGCGGA GCTGGTCGCG GTCGAGAGCA ATCCGGTGGA TGGGGTGTGG
ACCGAACGAC CCGCGCCGCC GCTGGGCCAG GTCAGCATCC ACGGGCTCGA ATTCTCCGGC
GAGAGCGAGG CGGCCAAGCT CGAGCGCATC CGGGGCGAGC TGACACGGCT GAAGGCCGAC
GCGCTGGTGC TGTCGGACTC GCACGCGGTG GCCTGGACCT TCAACATCCG CGGCGCCGAC
GTGTCGCATA CGCCGCTGCC GCTGTCCTAC GCGGTTGTGC CGAAGGACGG CCGCCCGACC
ATCTTCATCG ACGGCCGCAA GCTGTCGAAC GCGGCGCGCG ACCATCTCGA ACAGACTGCC
CAGGTCGCCG AGCCCGCGGA GCTGGCGCCC ACGCTGCAGG CGCTGGCCGG CTCGGGCGCG
TCGATCGCGC TCGACAGCGC CACCGCCGCC GATGCGCTGA CCCGGCTGGT CAGGGATGCC
GGCGGCAAGC CGCTGCGCGG CGCCGATCCG GTCGCGCTAC TGAAGGCCGT CAAGAACGCC
ACCGAGATCG AAGGCACCAA GACCGCGCAT CGCCGCGACG CCGTGGCGCT GGCGCGCTTC
CTCGCCTTCA TCGATCGCGA GGCGCCGAAC GGATCGCTGA CCGAGATCGA CGCCGTCGAG
GCGCTGGAGA GCTTCCGCCG CGACACGGGC GCGCTCAAGG ACGTCTCCTT CCCCACCATC
TCCGGCACCG GCCCGAACGG CGCGATCGTG CATTATCGCG TCACCCGCAA GAGCAACCGC
CGCATCCAGC CCGGCGACCT GCTGCTGATC GATTCCGGCG CGCAATATCA GGACGGCACT
ACCGACGTCA CCCGCACCAT CGCGATCGGC GAGCCGACCG CCGAGATGTG CGACCGCTTC
ACCCGGGTGC TGCGCGGCCA TATCGCCATC GCCCGCGCGG TATTTCCCGA CGGCACCACC
GGCGCACAGC TCGACACACT GGCGCGGCAG TTCCTGTGGC AGGCCGGGAT CGATTTCGAG
CACGGCACCG GCCACGGCGT CGGCAGCTAT TTGTCGGTGC ACGAAGGCCC GGCGCGGATC
TCCAAGCTCG GCACAACGCC CTTGAAGCGC GGCATGATCC TGTCCAACGA GCCCGGCTAC
TACAAGGCCG ACGGCTTCGG CATCCGGATC GAGAATCTCG AACTGGTTGT TGAGAAGTTG
GTTGAAGGCG CCGAGAAGCC GATGAACGGA TTCGAGACGC TGACGCTGGC GCCGATCGAT
CGCCGGTTGA TCGACACGGA CATGCTGAGC CGGAAGGAAC TGGCCTGGCT GAACGCCTAC
CACGCCCGCG TCCGCGCCGA AGTGAGGCCG CATCTCGACG GCCCGACCCA AGCCTGGCTC
GACTCCGCGA CCGCGCCGCT GGAGCGCTGA
 
Protein sequence
MFEAHFQTFE EPESGVALTA RLAAFREEMV RRQLTGFVIP RADQQQNEYV PACDERLAWL 
TGFTGSAGMA VVLVHRAALF VDGRYTLQAA QQVDGKAWTI ESLVEPPPER WLEAHLKDGD
RLGFDPWLHT SSAVERMQAA CAKASAELVA VESNPVDGVW TERPAPPLGQ VSIHGLEFSG
ESEAAKLERI RGELTRLKAD ALVLSDSHAV AWTFNIRGAD VSHTPLPLSY AVVPKDGRPT
IFIDGRKLSN AARDHLEQTA QVAEPAELAP TLQALAGSGA SIALDSATAA DALTRLVRDA
GGKPLRGADP VALLKAVKNA TEIEGTKTAH RRDAVALARF LAFIDREAPN GSLTEIDAVE
ALESFRRDTG ALKDVSFPTI SGTGPNGAIV HYRVTRKSNR RIQPGDLLLI DSGAQYQDGT
TDVTRTIAIG EPTAEMCDRF TRVLRGHIAI ARAVFPDGTT GAQLDTLARQ FLWQAGIDFE
HGTGHGVGSY LSVHEGPARI SKLGTTPLKR GMILSNEPGY YKADGFGIRI ENLELVVEKL
VEGAEKPMNG FETLTLAPID RRLIDTDMLS RKELAWLNAY HARVRAEVRP HLDGPTQAWL
DSATAPLER