Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oter_4603 |
Symbol | |
ID | 6207487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Opitutus terrae PB90-1 |
Kingdom | Bacteria |
Replicon accession | NC_010571 |
Strand | - |
Start bp | 5907836 |
End bp | 5910130 |
Gene Length | 2295 bp |
Protein Length | 764 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641694271 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001821474 |
Protein GI | 182416408 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGG TGACGGCGCA TCCCCAGCTC ACACCTGACG AACTGCAGCA GCTACGCTGG CTGCTCGGCG GCGCGCTGAC GCTACTCGCG GTCTCAACCG TCTTCACCAT GGACGTCGGT GCATCGGCCG TGGCCGCGTT GACCCTGGTG ACCGTCACCA CGGCGACGCT CCGGCCCGAC TGGCTGGTGT GGGTCCCGCG CTGGGTACAC CGGCTGGCCT TCTCATTCGT CGCGGCATTT TTCACCGCCG ACCTCTGGCT GACGGGAGAA TTGCTCTCGG CGATTGTGCG GCTCGACGCC CTGCTGCTGC TGTACCGCGG CATTTCCTAT CGGAAGAAGC GCGACGATCT GCAGCTGATC GTGCTAGGGC TGTTCCTTGT TGTGCTCGGC GGGGTGTTGA GCGTCTCGCT CGGGTTCGCG GTGCAGATCC TGGCGTTCAC CGCGGTCGCG CTGGTGTTGC TGATGACGAT CACGCTGAGC GCAACAGAGC CCGCGGCGGA AAAGCGCGTG GCCGGCGGCG CGCTCCCTGC GGTGCCGCGC TGGGCGCGGC ACGTGTCCTG GCCGGCCCTG CTCCGGCGCG TGAGGCAGGC GACCGACTGG CGCGTGGTGG CGTTGAGCAC CGGCCTGTTC GTCGGGCTCG TGGGCTTGGC GGCGCTGCTC TTCGTGACGA TTCCGCGGTT TCAGCTCGAA AACAGCTTTT TCCTGGAGCG GTTCATCACG AAAAAAACGC GCACAGGTTT CAGCGACACG ATCCGGTTCG GCGAGGTGAC CGAGATTACG CAGGACAACA GCGTGGCGTT GAGCGTGGAC GTTTCCGACC GGAGGGCGAT TCCGGCGGAC CCGTATTGGC GGATGCTCGT GCTGGACGAA TATCGCGACG GTACGTTCCG ACTTTCACCG GGGATGCGGC GGCTGGCGTT GCCGCAGGAG CGCGGCGGGG TGAATGTGCG CGGCGGGCTG CGCGAGCGCG CAGGTGCCAC GGCGGATTGG ACGTTTTACC TGGAATCAGG CGTGAGCCGC TATCTGCCGC TGCTCGGCCC GTTCGAGCTG CTGCGCTTTC GCGAGGCGCA GAATTATCGC TTCAGCCGCG GACTCGGCGT GGTCGAGCTG CGCGCGGAAC CGGCGACGAT GCTGGCTTAC CGCATCGAGG GAATGGAGAA CAACGCCGTG GTCGCCGACC CAGCCTTTGC GGCGCAGTGG CGGGCTCATG GAAGCAATCG CTGGTCGGCC GGCGCACTGT TGCTGCAGCT GAGTGTGAGT GCGGAAGACC GCGAGAAGCT GGCGCGAGCG GTCGGGGAGA TTCGTGCCGC GGCGACGCCG TCCTCGCGCG CCGCTCCGGA TCCAGGCCCG CCGGACGTGG CGGAATTCCT GCGGCGAGCC GAGGCGTGGC TCCAACACCA GCATCGGTAC TCGTTGTCGC CGCGGATTCC GGCGGGGGAC GGAGATCCGC TGCTGCGGTG GATGATGTCA TCCGAGCCGG GGCATTGCGA ACTGTTCGCC GGTTCGCTGG TGCTGCTGGC CCGGGCGGCG GGGCTGCCGG CGCGGGTCGC CACGGGGTTT CGCGGTGGAT CGTGGAACGG TTACTCGAAC AATTTCACGC TGCGCAACTC CGATGCGCAC GCGTGGTGCG AAGTGTTCGA TGCGCAAGCG CAAGCCTGGC GGCGCGCGGA TCCGACGCCG GGATCCGGCG CGGCGGCGCG CGAGGATCGT GGCGGCGAGG CCGGGCTTGC CGACCGCCTC GACCGCAGCT GGACGGCGCG GCTCGATAGC CTGCGCGTGT TCTGGTATCG GCGGATTGTC AGCTTCGACC AGCAGTCGCA GCTCGAGACC CTGCAGGCGG TGAAGGAGGC GACGGAGCGC TCAGGACAAC AGCTGCGCGA AATGCTGGAA CGGTGGGCGG GCGGCCTGCG CGCGTGGCTG GCACGACCGT GGACGAGTGG ACGGATGGCA TGGCTCGCGG TGATCGGCGC AGGAGTGCTC GTGGCGGTGG TGGGGTTGCG CTGGGCGGCG AGGAATTTTC GATTTTCAAT CTTCGACTTT CGATTTCGCC GACGCGGACA GAATTGGGAT GTGGTGCGCG CGGAAGCGGG ACGATGGTTG ACCAGGATCG CGCAAGCAGA ACGCAGGGTG GGCGAAACGA TGCAGGTCGT CGCGGCGCTG GAGCGGTTGC GATTCGGGGC GCGCGAAACG TGGCCGGAAC CCCGAAAAAT TTTCGCGCGT GCGAGAAAAG CGCTGCGCGA GGCGAGGAGC GATAAGGTAG GGCGCGGACT CCGCTCCGCG CCGAGAAAAA CGTGA
|
Protein sequence | MNKVTAHPQL TPDELQQLRW LLGGALTLLA VSTVFTMDVG ASAVAALTLV TVTTATLRPD WLVWVPRWVH RLAFSFVAAF FTADLWLTGE LLSAIVRLDA LLLLYRGISY RKKRDDLQLI VLGLFLVVLG GVLSVSLGFA VQILAFTAVA LVLLMTITLS ATEPAAEKRV AGGALPAVPR WARHVSWPAL LRRVRQATDW RVVALSTGLF VGLVGLAALL FVTIPRFQLE NSFFLERFIT KKTRTGFSDT IRFGEVTEIT QDNSVALSVD VSDRRAIPAD PYWRMLVLDE YRDGTFRLSP GMRRLALPQE RGGVNVRGGL RERAGATADW TFYLESGVSR YLPLLGPFEL LRFREAQNYR FSRGLGVVEL RAEPATMLAY RIEGMENNAV VADPAFAAQW RAHGSNRWSA GALLLQLSVS AEDREKLARA VGEIRAAATP SSRAAPDPGP PDVAEFLRRA EAWLQHQHRY SLSPRIPAGD GDPLLRWMMS SEPGHCELFA GSLVLLARAA GLPARVATGF RGGSWNGYSN NFTLRNSDAH AWCEVFDAQA QAWRRADPTP GSGAAAREDR GGEAGLADRL DRSWTARLDS LRVFWYRRIV SFDQQSQLET LQAVKEATER SGQQLREMLE RWAGGLRAWL ARPWTSGRMA WLAVIGAGVL VAVVGLRWAA RNFRFSIFDF RFRRRGQNWD VVRAEAGRWL TRIAQAERRV GETMQVVAAL ERLRFGARET WPEPRKIFAR ARKALREARS DKVGRGLRSA PRKT
|
| |