Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0998 |
Symbol | |
ID | 4485941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1100093 |
End bp | 1102381 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639729773 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_872757 |
Protein GI | 117928206 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.2813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCC AGCTCCGGTT GAGTCTGGCC GCGGGACTGG CGGCGTTCCT CGGCTCGCTC GCCCTTCTCC CCCTCTTCAC CACCCTGCGG TGGCTCGGGC CGGTGGCGGT CGTCATCGCT GCGGTCACGG TGGCGTCGTT GCTGTTCCGG CAGATACGGC CGATCGCCGG CCTTGCTCCG CTCGCCGGGG TAGCCGGCTA TGTCTTCGCG GTGACGGCGC TGTTCGCTCA CACGAGCGCC GTCTTCGGCT TCCTGCCGGG ACCCGGAGCG GTGGCCAGCC TCCGCGACAC GCTGATCAAC GGCTTTAACG ACACCACCGA GATGTCGGCC CCGGTCACGC CGACCCGGGG GATCACCCTG CTGGCGGTCG GCGGTGTCGG CCTGGTCGCC GTGATGATTG ACATTTCCGT CACAGCGTTA CGCCGTCCGG CGATCTGTGG TCTGCCGCTG CTCGCCGTCT TCACCGTTCC GGCGGCGATC CTGAACCAGG GTGTGGGTTG GCTCCCGTTC GTCTGTGCGG CGGCCGGTTA TCTCCTCCTG CTGACCGCGG AAGGACGAGA ACGGCTGAGC GGGTGGGGCC GCGCGGTCGT CGGCCGGGTT GCCGGCGCAC GGCGGTGGCC CGCGACGGTC GGCCGCGAAC TGGCCCGGTC CGGCCACACC ATGGCCGGGG TTGCCATACT GATCGCTATT GCCGTTCCCC TCGCGATTCC GGGGCTCCAC GCGGGGTGGT TCGGCACCCA TCACACGTCG GGAGGCGGAG TTGATCCGGG CGGCGGCGGA GCGACGATCC AGCCGTTCGT CTCGGTACGT CGGGACCTGA CGCAGAGCAC GCCCATTCCG CTATTCACGT ACACGACCTC CGGCCGGCCT GACTACTTCC GGATGCTCAC TCTCGACGAG TTCGACGGCA CCACCTGGCG GGCCAGCGGT CTGGCGTCCG GTGGGGACAT CGCCGCGGAC GCTCCGCTGC CGACCGTCGG CGGGACGACG GACACCCGGG TTGTCACGCA GGTCACCGTC AGCGGATTGC GTGAACCTTT CCTTCCGGTG CCGCAAGTCC CACTCCGGGT GGACGTCGGG GAACCCTGGA AATTCAATCC GACCACAGGC GTCTTCTACG ACCCGCAGGG AGTAACGAGA AAGAATCAGC AATACACGGT CGTCAGCGCG CCGATAACTC CCTCTGTTCA GATGCTGAGA AATATTCGGA CGGCGGTCGA CCCCACCGCC ACCCGCTATT TGCAATATCC CACGAATATT CCGCCGAACA TCAAACAACT GGCCGACCAA ATCGTCGCGC GGGCCGGCAC GCCTTACGAA AAAGCGCTTG CGCTGCAGAA TTGGTTTCTC GCAAATTTCA CCTACGACAT CAACGCCCGC TCCGGAAGCT CGACCAGCGC GCTGGAATCG TTCCTGCAAG ACCGGACCGG CTACTGCGAA CAGTTCGCCG CCACCATGGC GCTGATGGCC CGGATGGAAG GGATTCCGGC CCGGGTCGAT ATCGGCTTCA CGCCCGGCGA ACCCGTTACC GGCACGGACA GCTACGTCGT AACGACGGCG GACGCCCACG CCTGGCCGGA GCTGTACTTC CCCGGAATCG GGTGGTTGCG CTTCGAGCCG ACACCGCGGG CCGATGGGCA GGCGACGGTC CCGGCATACG GCGCGACCGG CACTGTGCCG TCCGCGACCG TTCCACCCAC CCCGAGCGCT ACCGGCCCAG GTACCGCGAA CATCCCATCG GCCGCCCCCA GCGGTGCAGG GGCCGCCGCC CCGGGAACGC GCAGCGTCCA GCACGGCGTC CAATTGCCGC GGATTCCACC GGAACTTCTT GGGCTCATCG TGCTGGTTGC GTTGGGCGTC GGTGCCGGTC CGGCCGCCCG TTGGTGGATC CGGGAGCGCC GGTGGACTGC GGCGGACGAC GCGGCTGCCG AGGCTCACGT GGCCTGGGCC GAATTGGGTG ACGACGTGCG TGACCTACGG CTGGAGTGGA CCGGTGACAC GGACACGCCG CGCCGGGCGG CACAACGGCT GGCGGCGGCT CCGCAGCTTC GCGGCCAACC GGAGGCGACC GACGCGCTCT TCCGCCTTGC GCACGCCGAG GAGCTCGCCC GGTACGCGAC CCCCCATCGC GTGCGGAATT TGGCGGAGAA TTTTGAACCC CGTCGCGACC AGCAATTGGT ACGCCGCGCC CTCATTGCCG CAATGCCCCG GTCCCGCCGG CTGCGGGCGC TCCTCCTGCC GACCTCGGTC CGTACGGTGC TCCGATCGAG ACGGCGGACG TCGCGTTGA
|
Protein sequence | MTSQLRLSLA AGLAAFLGSL ALLPLFTTLR WLGPVAVVIA AVTVASLLFR QIRPIAGLAP LAGVAGYVFA VTALFAHTSA VFGFLPGPGA VASLRDTLIN GFNDTTEMSA PVTPTRGITL LAVGGVGLVA VMIDISVTAL RRPAICGLPL LAVFTVPAAI LNQGVGWLPF VCAAAGYLLL LTAEGRERLS GWGRAVVGRV AGARRWPATV GRELARSGHT MAGVAILIAI AVPLAIPGLH AGWFGTHHTS GGGVDPGGGG ATIQPFVSVR RDLTQSTPIP LFTYTTSGRP DYFRMLTLDE FDGTTWRASG LASGGDIAAD APLPTVGGTT DTRVVTQVTV SGLREPFLPV PQVPLRVDVG EPWKFNPTTG VFYDPQGVTR KNQQYTVVSA PITPSVQMLR NIRTAVDPTA TRYLQYPTNI PPNIKQLADQ IVARAGTPYE KALALQNWFL ANFTYDINAR SGSSTSALES FLQDRTGYCE QFAATMALMA RMEGIPARVD IGFTPGEPVT GTDSYVVTTA DAHAWPELYF PGIGWLRFEP TPRADGQATV PAYGATGTVP SATVPPTPSA TGPGTANIPS AAPSGAGAAA PGTRSVQHGV QLPRIPPELL GLIVLVALGV GAGPAARWWI RERRWTAADD AAAEAHVAWA ELGDDVRDLR LEWTGDTDTP RRAAQRLAAA PQLRGQPEAT DALFRLAHAE ELARYATPHR VRNLAENFEP RRDQQLVRRA LIAAMPRSRR LRALLLPTSV RTVLRSRRRT SR
|
| |