Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2010 |
Symbol | |
ID | 3909516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2287428 |
End bp | 2289257 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883904 |
Product | peptidase M24 |
Protein accession | YP_485629 |
Protein GI | 86749133 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.623866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.492859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGAAG CGCATTTCCA GACGTTCGAG GAGCCGGAAA GCGGCGTTGC CCTCACTGCG CGGCTCGCGG CGTTTCGCGA GGAGATGGTC CGGCGCCAGC TCACCGGCTT TGTGATTCCA CGCGCCGATC AGCAGCAAAA CGAATACGTG CCGGCCTGCG ACGAGCGGCT GGCCTGGCTC ACCGGCTTCA CCGGCTCGGC CGGCATGGCC GTGGTGCTGG TGCATCGGGC CGCATTGTTC GTCGATGGCC GCTACACGCT GCAGGCCGCC CAGCAGGTCG ACGGCAAGGC CTGGACGATC GAGTCGCTGG TGGAACCGCC GCCGGAGCGC TGGCTGGAAG CGCATCTGAA AGACGGCGAC CGCCTCGGAT TTGATCCGTG GCTGCACACT TCTTCGGCAG TCGAACGGAT GCAGGCGGCC TGCGCCAAGG CCAGCGCGGA GCTGGTCGCG GTCGAGAGCA ATCCGGTGGA TGGGGTGTGG ACCGAACGAC CCGCGCCGCC GCTGGGCCAG GTCAGCATCC ACGGGCTCGA ATTCTCCGGC GAGAGCGAGG CGGCCAAGCT CGAGCGCATC CGGGGCGAGC TGACACGGCT GAAGGCCGAC GCGCTGGTGC TGTCGGACTC GCACGCGGTG GCCTGGACCT TCAACATCCG CGGCGCCGAC GTGTCGCATA CGCCGCTGCC GCTGTCCTAC GCGGTTGTGC CGAAGGACGG CCGCCCGACC ATCTTCATCG ACGGCCGCAA GCTGTCGAAC GCGGCGCGCG ACCATCTCGA ACAGACTGCC CAGGTCGCCG AGCCCGCGGA GCTGGCGCCC ACGCTGCAGG CGCTGGCCGG CTCGGGCGCG TCGATCGCGC TCGACAGCGC CACCGCCGCC GATGCGCTGA CCCGGCTGGT CAGGGATGCC GGCGGCAAGC CGCTGCGCGG CGCCGATCCG GTCGCGCTAC TGAAGGCCGT CAAGAACGCC ACCGAGATCG AAGGCACCAA GACCGCGCAT CGCCGCGACG CCGTGGCGCT GGCGCGCTTC CTCGCCTTCA TCGATCGCGA GGCGCCGAAC GGATCGCTGA CCGAGATCGA CGCCGTCGAG GCGCTGGAGA GCTTCCGCCG CGACACGGGC GCGCTCAAGG ACGTCTCCTT CCCCACCATC TCCGGCACCG GCCCGAACGG CGCGATCGTG CATTATCGCG TCACCCGCAA GAGCAACCGC CGCATCCAGC CCGGCGACCT GCTGCTGATC GATTCCGGCG CGCAATATCA GGACGGCACT ACCGACGTCA CCCGCACCAT CGCGATCGGC GAGCCGACCG CCGAGATGTG CGACCGCTTC ACCCGGGTGC TGCGCGGCCA TATCGCCATC GCCCGCGCGG TATTTCCCGA CGGCACCACC GGCGCACAGC TCGACACACT GGCGCGGCAG TTCCTGTGGC AGGCCGGGAT CGATTTCGAG CACGGCACCG GCCACGGCGT CGGCAGCTAT TTGTCGGTGC ACGAAGGCCC GGCGCGGATC TCCAAGCTCG GCACAACGCC CTTGAAGCGC GGCATGATCC TGTCCAACGA GCCCGGCTAC TACAAGGCCG ACGGCTTCGG CATCCGGATC GAGAATCTCG AACTGGTTGT TGAGAAGTTG GTTGAAGGCG CCGAGAAGCC GATGAACGGA TTCGAGACGC TGACGCTGGC GCCGATCGAT CGCCGGTTGA TCGACACGGA CATGCTGAGC CGGAAGGAAC TGGCCTGGCT GAACGCCTAC CACGCCCGCG TCCGCGCCGA AGTGAGGCCG CATCTCGACG GCCCGACCCA AGCCTGGCTC GACTCCGCGA CCGCGCCGCT GGAGCGCTGA
|
Protein sequence | MFEAHFQTFE EPESGVALTA RLAAFREEMV RRQLTGFVIP RADQQQNEYV PACDERLAWL TGFTGSAGMA VVLVHRAALF VDGRYTLQAA QQVDGKAWTI ESLVEPPPER WLEAHLKDGD RLGFDPWLHT SSAVERMQAA CAKASAELVA VESNPVDGVW TERPAPPLGQ VSIHGLEFSG ESEAAKLERI RGELTRLKAD ALVLSDSHAV AWTFNIRGAD VSHTPLPLSY AVVPKDGRPT IFIDGRKLSN AARDHLEQTA QVAEPAELAP TLQALAGSGA SIALDSATAA DALTRLVRDA GGKPLRGADP VALLKAVKNA TEIEGTKTAH RRDAVALARF LAFIDREAPN GSLTEIDAVE ALESFRRDTG ALKDVSFPTI SGTGPNGAIV HYRVTRKSNR RIQPGDLLLI DSGAQYQDGT TDVTRTIAIG EPTAEMCDRF TRVLRGHIAI ARAVFPDGTT GAQLDTLARQ FLWQAGIDFE HGTGHGVGSY LSVHEGPARI SKLGTTPLKR GMILSNEPGY YKADGFGIRI ENLELVVEKL VEGAEKPMNG FETLTLAPID RRLIDTDMLS RKELAWLNAY HARVRAEVRP HLDGPTQAWL DSATAPLER
|
| |