Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0558 |
Symbol | |
ID | 7408684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 629695 |
End bp | 631647 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714941 |
Product | 1,4-alpha-glucan branching enzyme |
Protein accession | YP_002572457 |
Protein GI | 222528575 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000187675 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA AAGTAAAATC TACTATTTAT CTATCTGATA TAAAAAAATT TGAATCAGGA GAGCATTTTG AAAGTTATAA GTTCTTGGGA AGCAAGGTTG TAAACTACAG AGGCAAGGTT GGAACAGTTT TCTGTGTGTG GGCACCAAAT GCTAAAAGCG TATCTGTTGT TGGAAATTTT AATAATTGGC GCGGTGAAAA CCATAAGATG ATGAGAGTTT ATGGAAGCGG ATTTTGGTGG CTATTTGTAG AGGGCATTGG TGAAGGAGAG CTCTACAAAT ATGAAATTAT TGGTGCTGAT GGGAAAAGGG TTTTAAAAGC TGACCCGTAT GCAATCTATT CTGAGAAACG TCCCAATACA GCATCAATTG TCAAAAACAT CCCGGACTAT GAATGGCATG ACCAGGAATG GATGGAAAAA AGAAAAACGA CTCCACCATA TGACAAGCCC ATCAATATCT ATGAGGTTCA TCTTGCATCA TGGAAAATGA AAAAGGATGG AAGCATAGAG AAGGCTGGCG AGTTTTATAA CTATCGTGAA CTTGCTCACA TGCTGGTAGA CTACATAAAA GAAATGAATT ATAATTACAT TGAACTTCTG CCAGTTTTAG AACATCCTCT TGACATGTCA TGGGGTTACC AGCCAACAGG TTACTTTTCT CTCACATCAC GTTATGGTAG TATTGAGGAT TTTATGTATT TTGTTGACTA TATGCACCAA AATGGAATTG GAGTAATAGT TGACTGGGTG CCAGCTCATT TTTGCAAAGA TGAGCATGGA CTTTATAGGT TTGACGGAAC ATTTTTATAT GAATATGAGG ATGAACTTTT GAGAGAAAAC TACACATGGG GTACAGCCAC ATTTGACTTC GCAAAACCCC AGGTTCAAAG CTTTCTTATC TCAAGCGCCA TGTTTTGGTT TGATGTTTAT CACATTGACG GAATAAGGGT AGATGCTGTT TCTCACATCA TCTATATGAA CAACAACCAG AAAAACAGGT ATGGCGGACA TGAAAACATA GAAGGAATTG AATTTATAAA AAAGCTTAAC AAGGCAATAT TCTCAAAATA TCCAAATGTC CTGATGATTG CCGAAGAGTC AACTGCATTT CCTCTGGTCA CATATCCAAC ATACGATGGA GGGCTTGGCT TTAATTACAA GTGGAACATG GGATGGATGA ACGACACCTT AAAGTACATG CAAAAACACC CAGATGAGAG AAAACAGCAT CACAATCTTT TGACATTTTC TATAATGTAT GCATTTTCTG AAAACTTTAT TCTGCCGTTT TCACACGACG AGGTTGTCCA TGGAAAAAAA TCTTTGCTTG ACAAAATGCC AGGTGATTAC AATCAAAAAT TTGCAAACCT AAGACTTCTT TACGGTTATA TGTACACTCA TCCGGGCAAA AAGCTCCTTT TCATGGGTGG TGAGTTTGGT CAGTTCATTG AATGGAGATT TTATGCTTCG CTTGACTGGC TACTTTTGGA CTACCCCATG CACCGCATGC TTCAGCACTA TGTTAAAAGT TTAAACAGAT TTTACTTAGA AAACAAAGCT TTGTGGGAGC TTGACCATAA AATGAATGGT TTTAGATGGA TAGATGTTCA CAACTGGGAG CAAAGTGTCA TATCATACTT GAGAATTTCC AAAGAACCTG ATGATTACCT TGTTGTCATT TGTAACTTTA GTCTTGCTTC ATATGAAAAT TACAAAATAG GCGTGCCAAA AAAAGGCATT TATCTGGAAG TATTTAACAG TGATAAAGCT GAATTTGGTG GTAATAATAT AGTAAACACA GAAAAACTTA AAACAATTGA TGAAGTGTGG CATGGGTATA ACCAGTGTAT AGAATTTAGG CTTCCTGCAC TTTCGTGTTT GATTTTCAAG CCAATTGAAT TTTTCAATGC TCAAGAAGAA AAGAATCAAA ATGACAACAA TATTCAGATA TAG
|
Protein sequence | MIKKVKSTIY LSDIKKFESG EHFESYKFLG SKVVNYRGKV GTVFCVWAPN AKSVSVVGNF NNWRGENHKM MRVYGSGFWW LFVEGIGEGE LYKYEIIGAD GKRVLKADPY AIYSEKRPNT ASIVKNIPDY EWHDQEWMEK RKTTPPYDKP INIYEVHLAS WKMKKDGSIE KAGEFYNYRE LAHMLVDYIK EMNYNYIELL PVLEHPLDMS WGYQPTGYFS LTSRYGSIED FMYFVDYMHQ NGIGVIVDWV PAHFCKDEHG LYRFDGTFLY EYEDELLREN YTWGTATFDF AKPQVQSFLI SSAMFWFDVY HIDGIRVDAV SHIIYMNNNQ KNRYGGHENI EGIEFIKKLN KAIFSKYPNV LMIAEESTAF PLVTYPTYDG GLGFNYKWNM GWMNDTLKYM QKHPDERKQH HNLLTFSIMY AFSENFILPF SHDEVVHGKK SLLDKMPGDY NQKFANLRLL YGYMYTHPGK KLLFMGGEFG QFIEWRFYAS LDWLLLDYPM HRMLQHYVKS LNRFYLENKA LWELDHKMNG FRWIDVHNWE QSVISYLRIS KEPDDYLVVI CNFSLASYEN YKIGVPKKGI YLEVFNSDKA EFGGNNIVNT EKLKTIDEVW HGYNQCIEFR LPALSCLIFK PIEFFNAQEE KNQNDNNIQI
|
| |