Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3019 |
Symbol | |
ID | 5084618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 3089014 |
End bp | 3089895 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640484590 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001169208 |
Protein GI | 146279049 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.798249 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTATC AGGTCCGCCT TTCCGTCAGT TATCACTACG CCCGTCCCGC CTCGGGGGGG CGGCACCTGC TGAGGCTTCT GCCCGCCCAT CTGCCGGGCG TGCAGGAGAT CTCGCAGGAG GCGATCGCGA TCAGCCCGCG ACCGACCGAG AGAAGCGAGC ATACCGATTT CTTCGGCAAC CGGACGGTGG AGGTCGCGGT GGCCTGCGAC CATTCCGAGA TCGTCTTCAC CGCCGGCTGC CGGGTCGAGC GGCTGTTCGC GGGGCCGGGC GCGGACCGCT CGCCGCCGCT GACGGACCTG CCGGGCGAGA TCGCGGCGGT GCTCGATCTC GGTCCCGCCG CGCCCCATCA TTTCCTGTCC GCCTCGCCGC GGATCCGGCC GGTGGCGGCC ATCACCGACC ATGCCCGGCT GGCGCTGGCC GACGCGCCGA GCGTGCGCGC AGCGGTCGAG GCGCTGGGGC GGGCGCTGCA TCGTGACCTG CGGTTCGATG GCGGGGCGAC GTCGGTGGAC ACACGTCCCG AAGAGGCCTT CGCTCACCGC CATGGCGTCT GCCAGGACTT TGCCCAGATC ATGATCGCGG GCCTTCGCGG TGTCGGAGTG CCGGCCGCCT ATGTCTCGGG CTTCCTGAGG ACGGTGCCCC CGCCCGGCCA GCCGCGGCTG GAGGGGGCGG ATGCGATGCA TGCCTGGGTG CGCGCCTGGT GCGGGCGGTC CGAAGGCTGG GTCGATTACG ATCCGACGAA CGGCTGCTTC GTGGGAGCGG ATCATGTCGT TGCCGCCGTG GGCCGCGACT ATGGCGACCT GGCGCCGGTC TCGGGCATCC TGCGCATCGC CGGCGGGCAG AGGACGAGCC ATGCGGTGGA TGTGATCCCG CTGCCGGGCT GA
|
Protein sequence | MLYQVRLSVS YHYARPASGG RHLLRLLPAH LPGVQEISQE AIAISPRPTE RSEHTDFFGN RTVEVAVACD HSEIVFTAGC RVERLFAGPG ADRSPPLTDL PGEIAAVLDL GPAAPHHFLS ASPRIRPVAA ITDHARLALA DAPSVRAAVE ALGRALHRDL RFDGGATSVD TRPEEAFAHR HGVCQDFAQI MIAGLRGVGV PAAYVSGFLR TVPPPGQPRL EGADAMHAWV RAWCGRSEGW VDYDPTNGCF VGADHVVAAV GRDYGDLAPV SGILRIAGGQ RTSHAVDVIP LPG
|
| |