Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1003 |
Symbol | |
ID | 6408658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 1065970 |
End bp | 1066947 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642710917 |
Product | peptidase U32 |
Protein accession | YP_001990035 |
Protein GI | 192289430 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTGA TCTGTCCCGC TGGTACGCCG GCAGCGCTGC ATGATGCCGT CGCCGCAGGC GCCGATGCGA TCTATTGCGG TTTCAACGAC GAAACCAACG CGCGCAACTT CCCGGGGCTG AACTTCAGCC GCGAGGAGAT GCGTGAGTCG ATCGCGCACG CGCATCGCTA CGGCACCAAG GTGCTGGTGG CGATCAACAC CTTCGCCCGC GCCGGCAATG TCGAGCTGTG GCAGCGCTCG GTCGACGATG CGGTCGAAGC CGAGGCCGAC GCGCTGATCC TCGCCGACGT CGGCGTGATG GATTACTGCG CCAAGACCCA TCCAGAGCAG CGGCTGCACG TCTCGGTGCA GGCGGCTGCC GCCAATCCGG ATTCGATCCG GTTCTATGTC GACAGCTTCA ACGCCCAGCG CGTGGTGCTG CCGCGCGTGC TCAGCGTGCA GGAAATCGCC GCGATCACCA AAGAGGTGAA GGTCGAGACC GAGGTCTTCA TCTTCGGCGG CCTGTGCGTG ATGGAGGAGG GCCGCTGCTC GCTGTCGTCC TACGCCACCG GCAAGTCGCC GAACATGGAC GGCGTCTGCT CGCCGGCCGC CTCGATCCAG TATCGCGAGC AGAACGGCTC GCTGGTGTCG CGGCTCGGTG AGTTCACCAT CAACAAATTC GCCAAGGGCG AGGCTGCGGC GTATCCGACG CTGTGCAAGG GCCGCTACCA GACCGACGAA GGCTGCGGCT ATCTGTTCGA AGACCCGGCC AGCCTCGATG CCACCACGAT GCTGCCGGAC CTGCGCGCCG CCGGCGTCGC GGCGCTGAAG ATCGAAGGCC GCCAACGCGG CCGGGCCTAT ATCGAGCGGG TGGTGAAGAC CTTCAAGGAC GTGCTCGCCG CGCTCGATGA CGGACGGCCG CTGCCGGTCG ACGCACTGCG CGGCCTGACC GAAGGCCAAT CCACCACCAC CGGCGCCTAC AAGAAGACTT GGCGCTGA
|
Protein sequence | MELICPAGTP AALHDAVAAG ADAIYCGFND ETNARNFPGL NFSREEMRES IAHAHRYGTK VLVAINTFAR AGNVELWQRS VDDAVEAEAD ALILADVGVM DYCAKTHPEQ RLHVSVQAAA ANPDSIRFYV DSFNAQRVVL PRVLSVQEIA AITKEVKVET EVFIFGGLCV MEEGRCSLSS YATGKSPNMD GVCSPAASIQ YREQNGSLVS RLGEFTINKF AKGEAAAYPT LCKGRYQTDE GCGYLFEDPA SLDATTMLPD LRAAGVAALK IEGRQRGRAY IERVVKTFKD VLAALDDGRP LPVDALRGLT EGQSTTTGAY KKTWR
|
| |