Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4476 |
Symbol | |
ID | 3912292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5066110 |
End bp | 5067087 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886379 |
Product | peptidase U32 |
Protein accession | YP_488070 |
Protein GI | 86751574 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTGA TCTGTCCGGC GGGCACGCCC GCGGCGCTGC ACGATGCCGT CGCGGTGGGG GCCGATGCGA TCTATTGCGG CTTCAACGAC GAGACCAATG CGCGCAATTT CCCGGGGCTG AATTTCAGCC GCGAGGAAAT GCGCGAGTCG ATCGCGCACG CGCATCGCTA CGGCGTCAAC GTGCTGGTGG CGATTAACAC CTTCGCCCGC GCCGGCAATG TCGAGCTGTG GCAGCGCGCG GTCGACGACG CGGTCGAGGC CGAAGCCGAT GCGGTGATCC TCGCCGATGT CGGCGTGATG GATTATTGCG CCAGGACCCA TCCGCAGCAG CGCCTGCACG TCTCGGTGCA GGCCGCCGCG GCGAACGCGG ATTCGATCCG GTTCTATGTC GACAGCTTCA ACGCCAAGCG CGTGGTGCTG CCGCGCGTGC TCAGCGTGCA GGAGATCGCC GCGATCACGC GCGAGGTCAA GTGCGAGACC GAAGTGTTCA TCTTCGGCGG GCTGTGCGTG ATGGAGGAGG GTCGCTGCTC GCTGTCGTCC TACGCCACCG GCAAATCGCC GAACATGGAC GGGGTCTGCT CGCCGGCGGG CGCGATCCAG TATCGCGAGG AGAACGGCGC GCTGATCTCG CGGCTCGGCG ATTTCACCAT CAACAAATTC GCCAAGGGCG AGGCGGCGGC CTATCCGACG CTGTGCAAGG GGCGCTACCA GACCGACGAG GGCTGCGGCT ATCTGTTCGA GGACCCGGCT TCGCTCGACG CCACCACGAT GCTGCCGGAG CTGCGCGCCG CCGGCGTCGC GGCGCTGAAG ATCGAGGGCC GCCAGCGCGG CCGCGCCTAT ATCGAGCGCG TGGTGAAGAC CTTCAAGGAG GTGCTGAGCG CGCTGGACGA CGGAAGGCCG TTGCCGGTCG ACGCGCTGCG CGGGCTCAGC GAGGGCCAGT CCAACACCAC CGGCGCCTAC AAGAAGACCT GGCGCTGA
|
Protein sequence | MELICPAGTP AALHDAVAVG ADAIYCGFND ETNARNFPGL NFSREEMRES IAHAHRYGVN VLVAINTFAR AGNVELWQRA VDDAVEAEAD AVILADVGVM DYCARTHPQQ RLHVSVQAAA ANADSIRFYV DSFNAKRVVL PRVLSVQEIA AITREVKCET EVFIFGGLCV MEEGRCSLSS YATGKSPNMD GVCSPAGAIQ YREENGALIS RLGDFTINKF AKGEAAAYPT LCKGRYQTDE GCGYLFEDPA SLDATTMLPE LRAAGVAALK IEGRQRGRAY IERVVKTFKE VLSALDDGRP LPVDALRGLS EGQSNTTGAY KKTWR
|
| |