Gene RPD_3379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3379 
Symbol 
ID4023891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3749123 
End bp3750952 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content68% 
IMG OID637963584 
Productpeptidase M24 
Protein accessionYP_570504 
Protein GI91977845 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.379289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00212976 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTCGAAG CGCATTTCCA GACCTTCGAT GAGCCGGAAC ATGGCGTCGC CCTGAGCGCT 
CGGCTGGCCG CGTTCCGCGA GGAGCTGGCG CGCCGGACGC TCAATGGATT CATCGTTCCG
CGCGCCGATC AGCAGCAAAA TGAGTATGTT CCGCCGTCTG AAGAGCGGCT GGCCTGGCTC
ACCGGCTTCA CCGGATCGGC TGGTCTCGCA GTGGTGTTGA CCCATCAGGC CGCTGTGTTC
GTCGATGGAC GCTACACGCT GCAGGCGGCC AAGCAGGTCG ACGGCGAGGC GTGGACGATC
GAGTCGCTGG TCGAACCGCC GCCGGAGCGC TGGCTGGAGC AGCATCTGAA GCCGGGCGAC
CGCCTCGGAT TTGATCCGTG GCTGCACACT TCTTCGGCAG TGGAGCGAAT GCAGGCGGCC
TGCGCCAAGG CCGGCGCGGA GCTGATCGCA GTCGACGGCA ATCCTGTCGA CGCGGTGTGG
ACCGAACGCC CGGCCCCGCC GCTCGGCCAG GTCAGCGTCC ACGGCGTGGA GTTTTCCGGC
GAGAGCGAGG CCGCCAAGCT CGACCGGATC CGCAGCGAAC TCGACCGGCT GAAGGCCGAC
GCGCTGGTTC TGTCGGATTC GCACGCGGTG GCCTGGACCT TCAACATCCG CGGCGCCGAC
GTCGCCCATA CGCCGCTGCC GCTGTCCTAC GCGCTGGTGC CGACGCAGGG ACGGCCGACC
ATCTTCATCG ATGCGCGCAA GCTGTCGAAC AGCGCCCGCG ACCATCTCGA ACAGACCGCG
CAGGTGGCGG AGCCCTCGGC GCTGGCGCCG GCGCTGCAGG CGCTCGCCGC CAGCGGCGGA
GCGATCGCGC TCGACAGCGC CACCGCTGCA GACGCGCTGA CGCGGCTGAT TACCGAGGCC
GGCGGCAAGC CGCTGCGCGG CGCCGATCCG GTCGCGCTGC TGAAAGCGGT GAAGAACGTC
ACCGAAATCG AGGGCACGCG CACCGCGCAT CGCCGCGACG CCGTGGCGCT GGCGCGCTTC
CTGGCTTTCA TCGACCGCGA GGCGCCGAAA GGCACCCTCA CCGAAATCGA CGCCGTCGAA
GCGCTGGAGA CCTTCCGCCG CGACACCGGG GCGCTGAAGG ACGTCTCCTT CCCGACCATC
TCGGGCACCG GGCCGAACGG CGCGATCGTG CACTATCGCG TCACCCGCGC CAGCAACCGC
CGCATCCATC CCGGTGATCT GCTGCTGATC GATTCCGGCG CGCAGTATCA GGACGGCACC
ACCGACGTCA CCCGCACCAT CGCGATCGGC GAGCCGAGCG ATGAGATGCG CGACCGCTTC
ACCCGGGTGC TGCGCGGCCA TATCGCGATC GCGCGTGCGG TGTTTCCCGA CGGCGCCACC
GGCGCGCAAC TCGACACGCT GGCGCGGCAG TTCCTGTGGC AGGCCGGAAT CGATTTCGAC
CACGGCACCG GCCACGGCGT CGGCAGCTAT CTGTCGGTGC ACGAAGGCCC GGCGCGAATC
TCCAAGCTCG GCACGACGCC GCTGAAGCGC GGCATGATCC TGTCGAACGA ACCCGGCTAC
TACAAGACCG ACGGCTTCGG CATCCGGATC GAGAATCTCG AGCTGGTGGT CGAGGCGACG
ATCGACGGCG CCGAAAAGCC GATGAACGCA TTCGAGACGT TGACGCTGGC GCCGATCGAC
CGCCGCCTGA TCGACATCGA ACTGATCAGC GCCAAGGAAT TGGCCTGGCT GAACGACTAC
CACGCCCGCG TCCGGCGCGA GGTTCGCCCG CATCTCGATG GCCCGACGCA GCTATGGCTC
GACGAAGCGA CGGCGCCGCT GGAGCGGTGA
 
Protein sequence
MFEAHFQTFD EPEHGVALSA RLAAFREELA RRTLNGFIVP RADQQQNEYV PPSEERLAWL 
TGFTGSAGLA VVLTHQAAVF VDGRYTLQAA KQVDGEAWTI ESLVEPPPER WLEQHLKPGD
RLGFDPWLHT SSAVERMQAA CAKAGAELIA VDGNPVDAVW TERPAPPLGQ VSVHGVEFSG
ESEAAKLDRI RSELDRLKAD ALVLSDSHAV AWTFNIRGAD VAHTPLPLSY ALVPTQGRPT
IFIDARKLSN SARDHLEQTA QVAEPSALAP ALQALAASGG AIALDSATAA DALTRLITEA
GGKPLRGADP VALLKAVKNV TEIEGTRTAH RRDAVALARF LAFIDREAPK GTLTEIDAVE
ALETFRRDTG ALKDVSFPTI SGTGPNGAIV HYRVTRASNR RIHPGDLLLI DSGAQYQDGT
TDVTRTIAIG EPSDEMRDRF TRVLRGHIAI ARAVFPDGAT GAQLDTLARQ FLWQAGIDFD
HGTGHGVGSY LSVHEGPARI SKLGTTPLKR GMILSNEPGY YKTDGFGIRI ENLELVVEAT
IDGAEKPMNA FETLTLAPID RRLIDIELIS AKELAWLNDY HARVRREVRP HLDGPTQLWL
DEATAPLER