Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2117 |
Symbol | |
ID | 4895395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2245038 |
End bp | 2246018 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640112711 |
Product | peptidase U32 |
Protein accession | YP_001043992 |
Protein GI | 126462878 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.871657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.950378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTCG TCTGCCCTGC CGGCACCCCT GCCGCGCTGC GCGCCGCCGT GGAAGCCGGT GCCCATTCCG TCTATTGCGG CTTTGCCGAC GAGACGAATG CCCGCAACTT CCCCGGCCTG AACTTCTCGC CGAAGGAACT GGCCGAGGGC GTGGCCTTCG CCCACAAGCA TGGCGCGCAT GTTCTGGTCG CGATCAACAC CTTTCCGCGG GCGGGAGACG AGTCGCTCTG GCATCGCAAC ATCGCCGCAG CCGAAGCCGC GGGCGCCGAT GCGGTGATCC TCGCCGACAT GGGGCTTCTG GCCTATGCCG CGAAGAACCA TCCGAACCTG CGGCGGCACC TCTCGGTGCA GGCGGCGGCG GCCAACCCGG ATGTCATCAA CTTCTACAGC CGCGAGTTCG GCGTGAAGCG CGTGGTGCTG CCGCGTGTGC TGACCGTGGC CGAGATCGCC GCGATCAACA AGGAGACGCC CGAGGTCGAG ACCGAGGTCT TCGTCTTCGG CGGTCTCTGC GTCATGGCCG AGGGGCGCTG CTCGCTCTCG TCCTATGCCA CCGGAAAGTC GCCCAACATG AACGGTGTCT GCTCGCCCGC GACAGAGGTG CAGTATGTCG AGGAGGGCGA CGAGCTCGCC GCGCGCCTCG GCGAGTTCAC CATCCACCGC GTGGGCAAGG ATCAGCCCGC GCCCTATCCG ACGCTCTGCA AGGGCTGCTT CACCTCGGGC GATCAGGTGG GCCACATCTT CGAGGATGCG GTCAGCCTCA ACGCGCAGGA CATCCTGCCC CAGCTCGCCA AGGCGGGCGT CACCGCGCTG AAGATCGAGG GGCGGCAGCG CTCGCGGTCC TATGTCGCGC AGGTGGTGCG CAGCTTCCGC GCCGCCGTCG ATGCGCTGGC CGCGGGCCAG CCCATGCCGC AGGGGGCGCT GGCCGCCCTC TCGGAAGGGC AGGCGACCAC GACGGGCGCC TATGCCAAGA CCTGGAGGTA A
|
Protein sequence | MELVCPAGTP AALRAAVEAG AHSVYCGFAD ETNARNFPGL NFSPKELAEG VAFAHKHGAH VLVAINTFPR AGDESLWHRN IAAAEAAGAD AVILADMGLL AYAAKNHPNL RRHLSVQAAA ANPDVINFYS REFGVKRVVL PRVLTVAEIA AINKETPEVE TEVFVFGGLC VMAEGRCSLS SYATGKSPNM NGVCSPATEV QYVEEGDELA ARLGEFTIHR VGKDQPAPYP TLCKGCFTSG DQVGHIFEDA VSLNAQDILP QLAKAGVTAL KIEGRQRSRS YVAQVVRSFR AAVDALAAGQ PMPQGALAAL SEGQATTTGA YAKTWR
|
| |