Gene Acel_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0998 
Symbol 
ID4485941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1100093 
End bp1102381 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content68% 
IMG OID639729773 
Producttransglutaminase domain-containing protein 
Protein accessionYP_872757 
Protein GI117928206 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.2813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCC AGCTCCGGTT GAGTCTGGCC GCGGGACTGG CGGCGTTCCT CGGCTCGCTC 
GCCCTTCTCC CCCTCTTCAC CACCCTGCGG TGGCTCGGGC CGGTGGCGGT CGTCATCGCT
GCGGTCACGG TGGCGTCGTT GCTGTTCCGG CAGATACGGC CGATCGCCGG CCTTGCTCCG
CTCGCCGGGG TAGCCGGCTA TGTCTTCGCG GTGACGGCGC TGTTCGCTCA CACGAGCGCC
GTCTTCGGCT TCCTGCCGGG ACCCGGAGCG GTGGCCAGCC TCCGCGACAC GCTGATCAAC
GGCTTTAACG ACACCACCGA GATGTCGGCC CCGGTCACGC CGACCCGGGG GATCACCCTG
CTGGCGGTCG GCGGTGTCGG CCTGGTCGCC GTGATGATTG ACATTTCCGT CACAGCGTTA
CGCCGTCCGG CGATCTGTGG TCTGCCGCTG CTCGCCGTCT TCACCGTTCC GGCGGCGATC
CTGAACCAGG GTGTGGGTTG GCTCCCGTTC GTCTGTGCGG CGGCCGGTTA TCTCCTCCTG
CTGACCGCGG AAGGACGAGA ACGGCTGAGC GGGTGGGGCC GCGCGGTCGT CGGCCGGGTT
GCCGGCGCAC GGCGGTGGCC CGCGACGGTC GGCCGCGAAC TGGCCCGGTC CGGCCACACC
ATGGCCGGGG TTGCCATACT GATCGCTATT GCCGTTCCCC TCGCGATTCC GGGGCTCCAC
GCGGGGTGGT TCGGCACCCA TCACACGTCG GGAGGCGGAG TTGATCCGGG CGGCGGCGGA
GCGACGATCC AGCCGTTCGT CTCGGTACGT CGGGACCTGA CGCAGAGCAC GCCCATTCCG
CTATTCACGT ACACGACCTC CGGCCGGCCT GACTACTTCC GGATGCTCAC TCTCGACGAG
TTCGACGGCA CCACCTGGCG GGCCAGCGGT CTGGCGTCCG GTGGGGACAT CGCCGCGGAC
GCTCCGCTGC CGACCGTCGG CGGGACGACG GACACCCGGG TTGTCACGCA GGTCACCGTC
AGCGGATTGC GTGAACCTTT CCTTCCGGTG CCGCAAGTCC CACTCCGGGT GGACGTCGGG
GAACCCTGGA AATTCAATCC GACCACAGGC GTCTTCTACG ACCCGCAGGG AGTAACGAGA
AAGAATCAGC AATACACGGT CGTCAGCGCG CCGATAACTC CCTCTGTTCA GATGCTGAGA
AATATTCGGA CGGCGGTCGA CCCCACCGCC ACCCGCTATT TGCAATATCC CACGAATATT
CCGCCGAACA TCAAACAACT GGCCGACCAA ATCGTCGCGC GGGCCGGCAC GCCTTACGAA
AAAGCGCTTG CGCTGCAGAA TTGGTTTCTC GCAAATTTCA CCTACGACAT CAACGCCCGC
TCCGGAAGCT CGACCAGCGC GCTGGAATCG TTCCTGCAAG ACCGGACCGG CTACTGCGAA
CAGTTCGCCG CCACCATGGC GCTGATGGCC CGGATGGAAG GGATTCCGGC CCGGGTCGAT
ATCGGCTTCA CGCCCGGCGA ACCCGTTACC GGCACGGACA GCTACGTCGT AACGACGGCG
GACGCCCACG CCTGGCCGGA GCTGTACTTC CCCGGAATCG GGTGGTTGCG CTTCGAGCCG
ACACCGCGGG CCGATGGGCA GGCGACGGTC CCGGCATACG GCGCGACCGG CACTGTGCCG
TCCGCGACCG TTCCACCCAC CCCGAGCGCT ACCGGCCCAG GTACCGCGAA CATCCCATCG
GCCGCCCCCA GCGGTGCAGG GGCCGCCGCC CCGGGAACGC GCAGCGTCCA GCACGGCGTC
CAATTGCCGC GGATTCCACC GGAACTTCTT GGGCTCATCG TGCTGGTTGC GTTGGGCGTC
GGTGCCGGTC CGGCCGCCCG TTGGTGGATC CGGGAGCGCC GGTGGACTGC GGCGGACGAC
GCGGCTGCCG AGGCTCACGT GGCCTGGGCC GAATTGGGTG ACGACGTGCG TGACCTACGG
CTGGAGTGGA CCGGTGACAC GGACACGCCG CGCCGGGCGG CACAACGGCT GGCGGCGGCT
CCGCAGCTTC GCGGCCAACC GGAGGCGACC GACGCGCTCT TCCGCCTTGC GCACGCCGAG
GAGCTCGCCC GGTACGCGAC CCCCCATCGC GTGCGGAATT TGGCGGAGAA TTTTGAACCC
CGTCGCGACC AGCAATTGGT ACGCCGCGCC CTCATTGCCG CAATGCCCCG GTCCCGCCGG
CTGCGGGCGC TCCTCCTGCC GACCTCGGTC CGTACGGTGC TCCGATCGAG ACGGCGGACG
TCGCGTTGA
 
Protein sequence
MTSQLRLSLA AGLAAFLGSL ALLPLFTTLR WLGPVAVVIA AVTVASLLFR QIRPIAGLAP 
LAGVAGYVFA VTALFAHTSA VFGFLPGPGA VASLRDTLIN GFNDTTEMSA PVTPTRGITL
LAVGGVGLVA VMIDISVTAL RRPAICGLPL LAVFTVPAAI LNQGVGWLPF VCAAAGYLLL
LTAEGRERLS GWGRAVVGRV AGARRWPATV GRELARSGHT MAGVAILIAI AVPLAIPGLH
AGWFGTHHTS GGGVDPGGGG ATIQPFVSVR RDLTQSTPIP LFTYTTSGRP DYFRMLTLDE
FDGTTWRASG LASGGDIAAD APLPTVGGTT DTRVVTQVTV SGLREPFLPV PQVPLRVDVG
EPWKFNPTTG VFYDPQGVTR KNQQYTVVSA PITPSVQMLR NIRTAVDPTA TRYLQYPTNI
PPNIKQLADQ IVARAGTPYE KALALQNWFL ANFTYDINAR SGSSTSALES FLQDRTGYCE
QFAATMALMA RMEGIPARVD IGFTPGEPVT GTDSYVVTTA DAHAWPELYF PGIGWLRFEP
TPRADGQATV PAYGATGTVP SATVPPTPSA TGPGTANIPS AAPSGAGAAA PGTRSVQHGV
QLPRIPPELL GLIVLVALGV GAGPAARWWI RERRWTAADD AAAEAHVAWA ELGDDVRDLR
LEWTGDTDTP RRAAQRLAAA PQLRGQPEAT DALFRLAHAE ELARYATPHR VRNLAENFEP
RRDQQLVRRA LIAAMPRSRR LRALLLPTSV RTVLRSRRRT SR