Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0661 |
Symbol | |
ID | 4485423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 712338 |
End bp | 714458 |
Gene Length | 2121 bp |
Protein Length | 706 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639729429 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_872420 |
Protein GI | 117927869 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.233246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGGC CCGTGCGGCT CAGTCCGCGC CGGCCGGCCC AGGGCGGTGG GACGGCCGGG CGCGGCCGGC TGGCCGAGGG CAGCCGGACC ATCGCCGCGT TCGCAGCGGC AGGTGTCGTC GCTGACCTCG CCGGCTGGTG GACGACGGGA TGGGTGGCGT ATCCCGCTGG CGCCGCTGGC ATTGTCGTTG CCGCTGTCGG CTTGTGGCGG GGACGCCGCG TGCTCGGCCT GCTGGCGAGC CTTGCGGCCG CCGCTGCGCT GGCTGTGGAG CTGATGGCAG CCGCCCGTCC GCTCCCGGGC GGGTGGACCG GTACGACGGT CAGCGGCCTG CTCATCCTGA CCCAGGTCGC CCAGGTCAGC TCCGCCGGCG TCCCACGGGA CCTGTTCTTC GGGACGCCGA TCGCGCTGGC GGTCGCGCTG CAGGCCGCGT CCCGGTCACC TTCTGTCGGG GGACTGGCCG TCGCCGGCGT CCTGGGATGC CTGGTCGCGG GCCTTGCCGC GGTACCGGCG CGACGTCTCG CCTCGGTGGG AGTGCGGGTA GGTCTTGTCG CGGCGGTCAT CGTCCTCGGC ACCCCGGATA GCCTCCACCT TCGGCTGTCC GGCCACCCGG CCGGCGGCAC TCCCGCCGCG TCGGTCCCCC CATCACCGGG CCCAGGCGGC GCTGCCGACG GCACCTCGGC GGGCATCGGC GCGCATGCCG CCGCCCTGCT CGGCACCGGC GGCCTCGATC TGGCCGTCCG GGGCGCATTG CCGCCGGTCC CGCTTGCCGC TGTCCCCGCC GATGCGCCGG TGTACTGGCG GGGGATGGTC TACGACGCCT TCGACGGCCG GCGCTGGTCG GTAAGCGGCG ACGCCGCGGG CGCCTGGACG GCGTCCAGAC CCACCAGCCA GCCACCGGCT GCCCCGGCGG TGCCGGACGG CCCGCGGCGC ACCGACACCG TCGATCTGCT CGGCCCACCG GTCGGCGTCC TGCTGACGCC GGGGGAGGCG GTCGGTTACG CCGGACCCGG AACCCTGGTG GCCGACGGCG ACGGCAATGC CCTGCCGCGC GAGGGGTTGA CCGGACAGTA CGCCGTCACG TCGGTGGTCA CTGCGGGGCC GGTCACCGGG GTTGCCCGGG CCGCCCCCGG GGTCGCAGCC AAGTGGACCG CGCTGCCGGC GACCCTCTCG CCGCAGGTGG CCGCACTTGC GGCGACCTTG GCCCGTCCGA CGCGGGCGGC GACGGTGGAC GCCGTCCGGA CGTACCTGCA GAGCCACGAG CGGTACCGGC TGGATCCGCC GGCCTCTCCG AGCGGTGACC CGGTGGCGGC GTTCCTGTTC CGCACGCACG AGGGATTCTG CGAGCAATTC GCCACCGCCG AGGTGGTGTT GCTGCGCGCG GTCGGCGTCC CGGCGCGGCT CGTGACCGGC GTGGCGTTCG GGGAGGTCCA GGATGGGCGG CGGATATTCC GGGCGAGTGA TGCGCACGCG TGGGTCCAGG TCTGGTATCC CGGGGTGGGC TGGGTGAATG ACGATCCGAC CCCGCCGTCC GCGCTGGGGG CTGCTCCTCC CGGCGTGTCG TCGGGGCCAT CCCTCACCGC CGGCGGACCG GCGCACAAGG CCGAGGCACC CCCGGCCGAG AGTGCTGCGC CATCCCTTAC CGCCGGCGGA GCGGCGACTG CGACGCCCAG CCCCGCGGGG CAGCCGACCG TCCAGCCGCC GCGCCGCCCA TCCGCGGCGG GTGCCGGCCG GTTCGCGTGG AACCTCTTGC TGGCTGGCGC GTTGCTCGGC CTGTTGGCGG TCTGGGCGGT TGCCCGGCGG CTTCGCGGGC GCGCCCGCCG GATTGGCGCC CCGGCTCCTG ACGTGTCGGT CCCGTCCGAG AGGAGTGACG GGTACGTCGG CGGCGGTCCG GTGCTCCGGG CGTATCGGGA CTTCGCTGAG GCCGTCGGCG GCAGCGCCGA TTGCACCCCG CGCGAGGTGG TGGCGCGGTT GGGACCGGCG ATCGCGGATG CACCCGAGGT CCTTGCGGCA TTGGGCACGC TGGACGAGGA ATGTTTCGGG ATCGAGCCGC CGAGCCCGGT GCGGGTTCAG GCCGCCGTCC AGGTCTTTAG TCGTTTGCGA AGTATGTCGG GTTTACGGTA G
|
Protein sequence | MSRPVRLSPR RPAQGGGTAG RGRLAEGSRT IAAFAAAGVV ADLAGWWTTG WVAYPAGAAG IVVAAVGLWR GRRVLGLLAS LAAAAALAVE LMAAARPLPG GWTGTTVSGL LILTQVAQVS SAGVPRDLFF GTPIALAVAL QAASRSPSVG GLAVAGVLGC LVAGLAAVPA RRLASVGVRV GLVAAVIVLG TPDSLHLRLS GHPAGGTPAA SVPPSPGPGG AADGTSAGIG AHAAALLGTG GLDLAVRGAL PPVPLAAVPA DAPVYWRGMV YDAFDGRRWS VSGDAAGAWT ASRPTSQPPA APAVPDGPRR TDTVDLLGPP VGVLLTPGEA VGYAGPGTLV ADGDGNALPR EGLTGQYAVT SVVTAGPVTG VARAAPGVAA KWTALPATLS PQVAALAATL ARPTRAATVD AVRTYLQSHE RYRLDPPASP SGDPVAAFLF RTHEGFCEQF ATAEVVLLRA VGVPARLVTG VAFGEVQDGR RIFRASDAHA WVQVWYPGVG WVNDDPTPPS ALGAAPPGVS SGPSLTAGGP AHKAEAPPAE SAAPSLTAGG AATATPSPAG QPTVQPPRRP SAAGAGRFAW NLLLAGALLG LLAVWAVARR LRGRARRIGA PAPDVSVPSE RSDGYVGGGP VLRAYRDFAE AVGGSADCTP REVVARLGPA IADAPEVLAA LGTLDEECFG IEPPSPVRVQ AAVQVFSRLR SMSGLR
|
| |