Gene Acel_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1601 
Symbol 
ID4484656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1800240 
End bp1802540 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content72% 
IMG OID639730387 
Product4-alpha-glucanotransferase 
Protein accessionYP_873359 
Protein GI117928808 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.154546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCGA GCCAGGCGAG CCTCGACCCG CAGTTGGCCG AGCTTGCGCG CAGCTACGGA 
ATCGCCACCA CATTCCCTGA CTGGCAGGGC AGACCGGCCG AGGTGAGCCG CCGGGCGGTG
CTCGCCGTTC TCCAGGCCCT GGGCGTCGAC GCGTTGGATC CGCGGCGGAT TCCGGAATTG
CTCGCGGAAC GGCGTGCCGA GAGCCTGCGG CGGTTCCTGC CGTCGTACGT CGTCGTCCGG
GAGGGACGAT CCGCGGTGAT TCCGGTGCGG CTGCCGGACG GCGTCCCCGC CGCGGTGAGC
ATCGAGACGG AAGATGGCGG GCGGGTCGCG CTGACCCTCC GTCGGCGGGA CGGCGAGGCA
CACCACGTCG ACGGCCGGGC AATTGTGACG TACGACGTGG AGCTCCCCGC GGACCTGCCG
GCCGGTTACC ACCGGGTGCA CGGAGCCGGC GGGGAGTGGA CGGCGGAGTC CCTCGTCCTC
GTGAGCCCGC GAAAACTTCC TGTCCCACCC GATCCCGTCA CCGGCCTGAT GTGCCAGTTG
TACGCCGTCC GCTCGACCGC GTCGTGGGGC ATCGGCGATG CGGCGGACCT TCGCGCCCTG
GCGGAATGGG CGGCTCGTGA GCTGGGATGC GGATTCGTGC TCGTCAATCC CCTCCATGCC
CCGGCGCCGA CGCTGCCAAT GGAGCCGTCG CCGTATTTTC CGTCGTCCCG CCGGTTCGCT
GACCCGCTCT ATCTGCGGAT CGAGGACATT CCCGAAGCGG CCGGCTGCGC GGACGCCGAT
GCCGCGGCGT TGCGGGCGGA GAATTCCGTG GACCGGCTTA TCGACCGCGA TCGTATCTGG
ACGGTGAAAC GCGCCGCACT GGAGAAAGCC TTTGCGGTCT GGGAAGACTG TTCGACCAGC
GCGCGACGCG CCGAATTTCA CCGATTCCAG GAGGAACAAG GCGCCGCGCT GACGAATTTC
GCGACCTGGT GTGCGCTCGC GGAGCAGTAC GGACCGAAAT GGCGGAATTG GCCGCCGGAA
TTTCGGGATC CGGCGGGCCG CGCGGTGTCG GAATTCCGGG CCGCGCAGCA CCGTCGGGTG
CAGTTTTACG GCTGGTTGCA GTGGCTTGCC GCCGGGCAGT TGGCTGCTGC GCACCGGTCG
GCGCGTGCTG TCGGCATGCC GCTCGGCGTC GTCCACGATC TCGCGGTCGG CGTGGACCCG
GACGGTGCCG ACGCCTGGGC GTACGCGTCG GTGATTGCGC CCGGCGTCAC CCTGGGCGCA
CCCGCGGACA TGTACAACCA GCAGGGCCAG CGGTGGAATT TGGCGGCCTG GCATCCGGAC
CGGTTGGCCC GCGCCGGATT CCAGCCGTTG CGAGACACGG TCCGCGCGTG GTTGGCGCTC
GGCGGCGGCC TGCGCATCGA CCACATCCTC GGGTTCTTCC GGCAGTGGTG GATCGCCGAC
GACGCACCCG CCGCTGACGG CGCGTACCTG GAGATGGACG CCGACGCGCT GCTGGGCGTC
GTCCGGATCG AGGCGGCCCG CGCTGGAGCT GTGGTGATCG GTGAGGACCT GGGCGTGGTG
CCTGCCGGCG TGCGTGAGCG GCTGCGCGCC GAGGACATCA TGGGCACGTC CGTGCTGTGG
TTCGAGCGGG ATCGGTCCGG CCGGCCGAGT CCACCGGCGC ACTGGCGGCG GGAGTGCCTG
GCGACGGTGA CAACCCATGA CCTGCCGCCG ACCTGCGGCT ACCTGCTCGG CGTACACGTC
GACCTGCGCG CCCGGTTGGG TCTCCTCGCC CGGGACGAAG CGGCCGAGCG GGCGGCGGAC
GAGGCGGACC GTCTCGACTG GCTGCGTGTC CTGGCCGCCG AGGGCCTTCT CGACCCGGCG
ATCCTCGCGG AGATCACTGG CCGGCAGACG GCGGCCGAGG CCCAGCCGGA CGCCGTACCC
GCAGCCGGGG CCCAGCCGGA CGCCGTACCC GCAGCCGGGG CCCAGCCACG CGCCGTACCC
GCCCGCGAGG CCGAGCCGGA CGCCGTGCCC GCGGACTTCG CGGCACGCCT GCGGCCGCAC
CTGGATGCCG TCCGCGCCGC GCTCTACGGG TACGTCGGCC GGACACCGGC GCTGCTCCGG
GGCCTCTACC TCCCGGACAT CGTGGGCGAT CGACGTCCGG TGAATCAGCC GGGCACCGCG
GACGCGTATC CGAACTGGCG GGTGCCGATG GCGGACGGCA ACGGACGGGT CGTCCTCCTC
GACGAGGTGT TCAGCGATCC GGCGATCCGC GCGGTGGCTC AAACGCTGGC TCGACTGCTG
CGCGGCGGCC GGGCGACATG A
 
Protein sequence
MGASQASLDP QLAELARSYG IATTFPDWQG RPAEVSRRAV LAVLQALGVD ALDPRRIPEL 
LAERRAESLR RFLPSYVVVR EGRSAVIPVR LPDGVPAAVS IETEDGGRVA LTLRRRDGEA
HHVDGRAIVT YDVELPADLP AGYHRVHGAG GEWTAESLVL VSPRKLPVPP DPVTGLMCQL
YAVRSTASWG IGDAADLRAL AEWAARELGC GFVLVNPLHA PAPTLPMEPS PYFPSSRRFA
DPLYLRIEDI PEAAGCADAD AAALRAENSV DRLIDRDRIW TVKRAALEKA FAVWEDCSTS
ARRAEFHRFQ EEQGAALTNF ATWCALAEQY GPKWRNWPPE FRDPAGRAVS EFRAAQHRRV
QFYGWLQWLA AGQLAAAHRS ARAVGMPLGV VHDLAVGVDP DGADAWAYAS VIAPGVTLGA
PADMYNQQGQ RWNLAAWHPD RLARAGFQPL RDTVRAWLAL GGGLRIDHIL GFFRQWWIAD
DAPAADGAYL EMDADALLGV VRIEAARAGA VVIGEDLGVV PAGVRERLRA EDIMGTSVLW
FERDRSGRPS PPAHWRRECL ATVTTHDLPP TCGYLLGVHV DLRARLGLLA RDEAAERAAD
EADRLDWLRV LAAEGLLDPA ILAEITGRQT AAEAQPDAVP AAGAQPDAVP AAGAQPRAVP
AREAEPDAVP ADFAARLRPH LDAVRAALYG YVGRTPALLR GLYLPDIVGD RRPVNQPGTA
DAYPNWRVPM ADGNGRVVLL DEVFSDPAIR AVAQTLARLL RGGRAT