Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1601 |
Symbol | |
ID | 4484656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 1800240 |
End bp | 1802540 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639730387 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_873359 |
Protein GI | 117928808 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.154546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGCGA GCCAGGCGAG CCTCGACCCG CAGTTGGCCG AGCTTGCGCG CAGCTACGGA ATCGCCACCA CATTCCCTGA CTGGCAGGGC AGACCGGCCG AGGTGAGCCG CCGGGCGGTG CTCGCCGTTC TCCAGGCCCT GGGCGTCGAC GCGTTGGATC CGCGGCGGAT TCCGGAATTG CTCGCGGAAC GGCGTGCCGA GAGCCTGCGG CGGTTCCTGC CGTCGTACGT CGTCGTCCGG GAGGGACGAT CCGCGGTGAT TCCGGTGCGG CTGCCGGACG GCGTCCCCGC CGCGGTGAGC ATCGAGACGG AAGATGGCGG GCGGGTCGCG CTGACCCTCC GTCGGCGGGA CGGCGAGGCA CACCACGTCG ACGGCCGGGC AATTGTGACG TACGACGTGG AGCTCCCCGC GGACCTGCCG GCCGGTTACC ACCGGGTGCA CGGAGCCGGC GGGGAGTGGA CGGCGGAGTC CCTCGTCCTC GTGAGCCCGC GAAAACTTCC TGTCCCACCC GATCCCGTCA CCGGCCTGAT GTGCCAGTTG TACGCCGTCC GCTCGACCGC GTCGTGGGGC ATCGGCGATG CGGCGGACCT TCGCGCCCTG GCGGAATGGG CGGCTCGTGA GCTGGGATGC GGATTCGTGC TCGTCAATCC CCTCCATGCC CCGGCGCCGA CGCTGCCAAT GGAGCCGTCG CCGTATTTTC CGTCGTCCCG CCGGTTCGCT GACCCGCTCT ATCTGCGGAT CGAGGACATT CCCGAAGCGG CCGGCTGCGC GGACGCCGAT GCCGCGGCGT TGCGGGCGGA GAATTCCGTG GACCGGCTTA TCGACCGCGA TCGTATCTGG ACGGTGAAAC GCGCCGCACT GGAGAAAGCC TTTGCGGTCT GGGAAGACTG TTCGACCAGC GCGCGACGCG CCGAATTTCA CCGATTCCAG GAGGAACAAG GCGCCGCGCT GACGAATTTC GCGACCTGGT GTGCGCTCGC GGAGCAGTAC GGACCGAAAT GGCGGAATTG GCCGCCGGAA TTTCGGGATC CGGCGGGCCG CGCGGTGTCG GAATTCCGGG CCGCGCAGCA CCGTCGGGTG CAGTTTTACG GCTGGTTGCA GTGGCTTGCC GCCGGGCAGT TGGCTGCTGC GCACCGGTCG GCGCGTGCTG TCGGCATGCC GCTCGGCGTC GTCCACGATC TCGCGGTCGG CGTGGACCCG GACGGTGCCG ACGCCTGGGC GTACGCGTCG GTGATTGCGC CCGGCGTCAC CCTGGGCGCA CCCGCGGACA TGTACAACCA GCAGGGCCAG CGGTGGAATT TGGCGGCCTG GCATCCGGAC CGGTTGGCCC GCGCCGGATT CCAGCCGTTG CGAGACACGG TCCGCGCGTG GTTGGCGCTC GGCGGCGGCC TGCGCATCGA CCACATCCTC GGGTTCTTCC GGCAGTGGTG GATCGCCGAC GACGCACCCG CCGCTGACGG CGCGTACCTG GAGATGGACG CCGACGCGCT GCTGGGCGTC GTCCGGATCG AGGCGGCCCG CGCTGGAGCT GTGGTGATCG GTGAGGACCT GGGCGTGGTG CCTGCCGGCG TGCGTGAGCG GCTGCGCGCC GAGGACATCA TGGGCACGTC CGTGCTGTGG TTCGAGCGGG ATCGGTCCGG CCGGCCGAGT CCACCGGCGC ACTGGCGGCG GGAGTGCCTG GCGACGGTGA CAACCCATGA CCTGCCGCCG ACCTGCGGCT ACCTGCTCGG CGTACACGTC GACCTGCGCG CCCGGTTGGG TCTCCTCGCC CGGGACGAAG CGGCCGAGCG GGCGGCGGAC GAGGCGGACC GTCTCGACTG GCTGCGTGTC CTGGCCGCCG AGGGCCTTCT CGACCCGGCG ATCCTCGCGG AGATCACTGG CCGGCAGACG GCGGCCGAGG CCCAGCCGGA CGCCGTACCC GCAGCCGGGG CCCAGCCGGA CGCCGTACCC GCAGCCGGGG CCCAGCCACG CGCCGTACCC GCCCGCGAGG CCGAGCCGGA CGCCGTGCCC GCGGACTTCG CGGCACGCCT GCGGCCGCAC CTGGATGCCG TCCGCGCCGC GCTCTACGGG TACGTCGGCC GGACACCGGC GCTGCTCCGG GGCCTCTACC TCCCGGACAT CGTGGGCGAT CGACGTCCGG TGAATCAGCC GGGCACCGCG GACGCGTATC CGAACTGGCG GGTGCCGATG GCGGACGGCA ACGGACGGGT CGTCCTCCTC GACGAGGTGT TCAGCGATCC GGCGATCCGC GCGGTGGCTC AAACGCTGGC TCGACTGCTG CGCGGCGGCC GGGCGACATG A
|
Protein sequence | MGASQASLDP QLAELARSYG IATTFPDWQG RPAEVSRRAV LAVLQALGVD ALDPRRIPEL LAERRAESLR RFLPSYVVVR EGRSAVIPVR LPDGVPAAVS IETEDGGRVA LTLRRRDGEA HHVDGRAIVT YDVELPADLP AGYHRVHGAG GEWTAESLVL VSPRKLPVPP DPVTGLMCQL YAVRSTASWG IGDAADLRAL AEWAARELGC GFVLVNPLHA PAPTLPMEPS PYFPSSRRFA DPLYLRIEDI PEAAGCADAD AAALRAENSV DRLIDRDRIW TVKRAALEKA FAVWEDCSTS ARRAEFHRFQ EEQGAALTNF ATWCALAEQY GPKWRNWPPE FRDPAGRAVS EFRAAQHRRV QFYGWLQWLA AGQLAAAHRS ARAVGMPLGV VHDLAVGVDP DGADAWAYAS VIAPGVTLGA PADMYNQQGQ RWNLAAWHPD RLARAGFQPL RDTVRAWLAL GGGLRIDHIL GFFRQWWIAD DAPAADGAYL EMDADALLGV VRIEAARAGA VVIGEDLGVV PAGVRERLRA EDIMGTSVLW FERDRSGRPS PPAHWRRECL ATVTTHDLPP TCGYLLGVHV DLRARLGLLA RDEAAERAAD EADRLDWLRV LAAEGLLDPA ILAEITGRQT AAEAQPDAVP AAGAQPDAVP AAGAQPRAVP AREAEPDAVP ADFAARLRPH LDAVRAALYG YVGRTPALLR GLYLPDIVGD RRPVNQPGTA DAYPNWRVPM ADGNGRVVLL DEVFSDPAIR AVAQTLARLL RGGRAT
|
| |