Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4033 |
Symbol | |
ID | 6411716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4322736 |
End bp | 4324565 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642713915 |
Product | peptidase M24 |
Protein accession | YP_001993004 |
Protein GI | 192292399 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0371285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGAGT CGCATTTCCA GAGCTTCGAG GAGCCCGAGA GCGGCGTCGC ACTCACCGCG CGGCTCTCGG CGTTTCGCGA AGAGCTGCTG CGGCGCAAGC TCACCGGATT TGCGATTCCC CGCGCCGATC AGCAACAAAA CGAATATGTG CCGCCGTCCG ACGAGCGGCT GGCGTGGCTC ACCGGCTTCA CCGGATCGGC TGGTCTCGCT TACGTGCTGA TCGATCAGGC CGCGCTGTTC GTCGACGGCC GCTACACGTT GCAGGCCGCC AAACAGGTCG ACGGCAATGC CTGGCGCATC GAGTCGCTGG TCGAACCGCC GCCGGAACGC TGGCTGGAGA CGCATCTGAA AGCCGGTGAT CGCCTCGGAT TTGATCCGTG GCTGCACACT TCTTCGGCAG TCGAACGGAT GCAGGCCGCC TGCGCCAAGG CCGGCGCCGA GCTGGTCGCG GTCGACGGCA ATCCGGTCGA CGCGGTGTGG AGCGAACGTC CCGCGCCGCC GCTCGGACCG GTCACTGTCC ACGGCGTGGA ATTTGCCGGC GAGAGCGAGG CCAGCAAGCT CGGCCGCATC AACGAGGAGC TGGCCCGGCT GAAGGCCGAC GCGCTGGTGC TTTCGGATTC CCACGCCGTC GCCTGGACCT TCAACATCCG CGGCGCCGAC GTCTCGCACA CGCCGCTGCC GCTATCCTAC GCGCTGCTGC CGAAGGATGG CCGCCCCACC ATCTTCATCG ACGGCCGCAA GCTGTCGAAC AGCGTGCGCG ATCATCTTGA GCAGACCGCC GACGTCGCCG AGCCGGCCGA GTTGGCGCCG ATGCTGCGCG AACTCGCCAA GACCGGCGCG ACCATCGCGC TCGACAGCGC CACTGCGGCC GATGCACTGA CCCGGCTGAT CAAGGATGCC GGCGGCAAGC CGCTGCGCGG CGCCGATCCG GTGGCGCTGC TCAAGGCCGT GAAGAACACC GCCGAGATCG ACGGCACCCG CGCGGCGCAT CGCCGCGACG CGGTGGCGCT GGCGCGGTTC CTGGCGTTCA TCGATGCGGA AGCGCCGAAG GGCGCACTGA CCGAGATCGA CGCGGTGGAG GCGCTGGAGA CGTTCCGCCG CGACACCGGC GCGCTCAAGG ACGTGTCGTT CCCGACCATC TCCGGCACCG GCCCGAACGG CGCGATCGTG CATTATCGCG TCACCCGCAA GAGCAACCGC CGCATCCAGC CGGGCGACCT CTTACTGATC GATAGCGGCG CGCAGTATCA GGACGGCACC ACCGACGTCA CCCGCACCAT CGCGGTCGGC GAGCCGACGG CGGAGATGCG CGACCGCTTC ACCCGCGTAC TGCGGGGCCA TCTGGCAATC GCCCGCGCGG TGTTTCCCGA CGGCACCACC GGCGCGCAGC TCGACACGCT GGCGCGGCAG TTCCTGTGGC AGGCCGGAAT CGATTTCGAG CACGGCACCG GCCACGGCGT CGGCAGCTAT CTGTCGGTGC ATGAAGGCCC GGCGCGGATC TCCAAACTCG GCACCACCCC GCTGAAGCGC GGCATGATCC TGTCCAACGA GCCGGGCTAC TACAAGACCG ACGGCTTCGG CATCCGGATC GAGAACCTCG AACTGGTGGT CGAGAAACAG ATCGACGGCG CCGAGAAGCC GATGAACGGT TTCGAGACCC TGACGCTAGC GCCGATCGAC CGCCGCCTGA TCGACGTGGC GATGCTCAGC GCCGAGGAAC GCACCTGGCT CGACGCCTAC CACGCCCGCG TCCGCGAAAC CGTCCGCCCT CACCTCGACG GCCCGACCCA ACTCTGGCTC GACGCAGCGA CCGCGCCGCT GCAATCCTAA
|
Protein sequence | MFESHFQSFE EPESGVALTA RLSAFREELL RRKLTGFAIP RADQQQNEYV PPSDERLAWL TGFTGSAGLA YVLIDQAALF VDGRYTLQAA KQVDGNAWRI ESLVEPPPER WLETHLKAGD RLGFDPWLHT SSAVERMQAA CAKAGAELVA VDGNPVDAVW SERPAPPLGP VTVHGVEFAG ESEASKLGRI NEELARLKAD ALVLSDSHAV AWTFNIRGAD VSHTPLPLSY ALLPKDGRPT IFIDGRKLSN SVRDHLEQTA DVAEPAELAP MLRELAKTGA TIALDSATAA DALTRLIKDA GGKPLRGADP VALLKAVKNT AEIDGTRAAH RRDAVALARF LAFIDAEAPK GALTEIDAVE ALETFRRDTG ALKDVSFPTI SGTGPNGAIV HYRVTRKSNR RIQPGDLLLI DSGAQYQDGT TDVTRTIAVG EPTAEMRDRF TRVLRGHLAI ARAVFPDGTT GAQLDTLARQ FLWQAGIDFE HGTGHGVGSY LSVHEGPARI SKLGTTPLKR GMILSNEPGY YKTDGFGIRI ENLELVVEKQ IDGAEKPMNG FETLTLAPID RRLIDVAMLS AEERTWLDAY HARVRETVRP HLDGPTQLWL DAATAPLQS
|
| |