Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1987 |
Symbol | |
ID | 3835411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 2296765 |
End bp | 2297928 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637826086 |
Product | peptidase M42 |
Protein accession | YP_427074 |
Protein GI | 83593322 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | [TIGR03106] hydrolase, peptidase M42 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGTC TGGCGATCGA TACGGACTAT CTGGCCCGGA CTCTGGTTCG CTTGTTAGCC ACCCCCAGCC CGACCGGCTA TACCGATACC GTCGTCCGCG AAACCTGTGC CGAGTTGGAA AGCCTGGGCC TGACCCCGAC CCTGACCCGA CGGGGGGCGG TTTGCGTGGT GTTGCGCGGA CGGGAAGCCC GGCCGGCCCG CGCCATCGTT TCCCATCTCG ACACGCTGGG CGCTCAGGTC AAGCAGCTCA AAGACAACGG CCGCCTGGAA CTGGTGCCGA TCGGCACCTG GTCGGCGCGC TTCGCCGAGG GGGCGCGGGT CACGGTGTTC ACCGATCGCG GCGCGGTTCG CGGCACGATT TTGCCGCTGA AGGCCTCGGG CCACATCTTT AACGAAGAGA TCGACAGCCT GCCGATCGGC TGGCCGATGA CCGAGTTGCG GGTCGATGCC CGGGTTCATA GCAAGGCCGA TCTGATCGCC CTGGGTATCG AGGTTGGCGA TATCGTCGCC ATCGACCCCC AGCCGGAATT CCTAGCCAAC GGCTATATCG TGTCCCGCCA TCTTGATGAC AAAGCCGGGG TGGCGCTGAT GCTCGCGGCT TTGAAGGCCC TGACCGCCCA TAACGAACCG CCGCCCGTCG ATGTCCATTT CATCTTCACC ATCGCCGAGG AAGTCGGCGT CGGCGCGTCT TCGGCGCTGA CCGATGACGT CGCCTCGGTG GTCGCCGTCG ATAACGGCAC CAGCGGACCC GGCCAGAACT CGGCCGAATT CGGCGTTACC ATCGCCATGG CCGACCAGAC CGGCCCCTTT GATTATCATC TGACCCGGGC GCTGATCCGG CTCTGCCGCG ACGAGGACAT CATCTTTCGC AAGGATGTGT TCCGCTACTA CCGCTCCGAC GCCGCCTCGG CGGTGGTCGC CGGCCACGAT GTGCGCAACG CGCTGGTCAC CTTTGGCATC GACGCCTCGC ATGGCTATGA GCGCATCCAT ATGCACGCCC TGCGGTCGGT GGCCGAATTG CTGAGCGCCT ATGCGCTGAG CCCGGTGGAG ATCCGCCGAG ACGCCGTGGA GACCGCCCGC GGTCTGGCCG GCTTCACCCG CCAGCCACCC CCCGAACCCA TCGCCGAGGA CCTAGCCGTT TCGCAAGAGG GCCCTCTTGC GTGA
|
Protein sequence | MTRLAIDTDY LARTLVRLLA TPSPTGYTDT VVRETCAELE SLGLTPTLTR RGAVCVVLRG REARPARAIV SHLDTLGAQV KQLKDNGRLE LVPIGTWSAR FAEGARVTVF TDRGAVRGTI LPLKASGHIF NEEIDSLPIG WPMTELRVDA RVHSKADLIA LGIEVGDIVA IDPQPEFLAN GYIVSRHLDD KAGVALMLAA LKALTAHNEP PPVDVHFIFT IAEEVGVGAS SALTDDVASV VAVDNGTSGP GQNSAEFGVT IAMADQTGPF DYHLTRALIR LCRDEDIIFR KDVFRYYRSD AASAVVAGHD VRNALVTFGI DASHGYERIH MHALRSVAEL LSAYALSPVE IRRDAVETAR GLAGFTRQPP PEPIAEDLAV SQEGPLA
|
| |