Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2191 |
Symbol | |
ID | 4810907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2610189 |
End bp | 2612402 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107597 |
Product | 1,4-alpha-glucan branching enzyme |
Protein accession | YP_001038586 |
Protein GI | 125974676 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAA CAGCAAATAT TGACGAGGTT TACAAGGTTA TCAATGCAGA GCATCATGAT CCGTTCAGTG TGCTGGGAAT GCACCGTCTG GAATCTGAGA ATGCAATGGT CGTAAGGGCA TATCTGCCAA ATGCAAAAGA AATAGAAGTA GTGGAACTCT CCAAAAACAA CACATACCCA ATGGAGAAAA TCGACGAGAG GGGCTTTTTT GAGGTTGTAA TAAAAGACAG GAATGACTTT TTCAAGTATA ACCTGAGGGC CACCGACTAC GTAGGCAACA CTTTTACCTT TTACGACCCT TACTGTTTCA TGCCCGTGAT CTCTGACTAT GACCTTTATT TATTCAATGA GGGCAACCAT CATAAAATCT ACGAAAAGCT TGGTACCCAC AGAATGACTA TTGACGGTGT TGAAGGCACA CTTTTTGCCG TTTGGGCCCC CTGTGCAAAA CGCGTAAGTG TTGTGGGAAA TTTCAACCAG TGGGACGGCC GCAGGCACCA AATGAGAGTG AGAGGAAGTT CCGGAGTATG GGAGTTGTTT ATTCCGGGTG TTGGTGAAGG CGAGCTTTAC AAATATGAAA TAAAGACACC TCACAACGAA ATTTACATCA AAGCCGACCC CTATGCTTTT TATTCGGAAC TTCGGCCAAA TACAGCTTCC ATCGTGTATG ATATTGAAGG ATATGAATGG CATGACGCAG ACTGGATGCG TGAAAGAGAC AGCAGCAATT CCTTTGACAA GCCCATATCC ATATACGAAG TGCATCTTGG CTCATGGAAA AGAGTCTCCA ACGATGAAAA CGGCTTTTAC TCATACAGGG AACTTGCAGA TATGCTGGTA GAGTATGTGA AATACATGGG TTACACCCAT ATTGAACTTC TGCCGATTGC AGAACATCCT TTTGACGGTT CGTGGGGATA CCAGGTTACA GGATATTATG CAGCAACCAG CAGATACGGA CAGCCAAAAG ATTTTATGTA TTTTGTGGAC AAATGCCACC AAAACGGCAT AGGAGTTATA ATCGACTGGG TACCGGCTCA TTTTCCCAAA GACGGCCACG GTCTGGCAAG ATTTGACGGC ACAGCCCTGT ATGAGCATTA CGACCCCAAA CAAGGCGAGC ATCCTGACTG GGGAACTCAT ATATTCAATT ACGGAAGAAA TGAAGTTAAA AACTTCTTAA TTGCCAATGC CATGTTCTGG TTCGATAAAT ACCATATTGA CGGACTTAGG GTTGACGCAG TGGCATCCAT GCTCTATCTG GATTACGGCA AAAAGGACGG GGAATGGATA CCCAATCGCT GGGGAGGAAA AGAAAACGTC GACGCCATTG AGTTTATGAG GCAGCTGAAT TCAACCGTGT TCCAATACTT CCCCGGTGTT ATGATGATCG CTGAGGAATC CACAGCCTGG GCTTTAGTAA CCAAACCTCC TTACACAGGT GGTCTTGGCT TCTCGTACAA ATGGAACATG GGCTGGATGA ACGACTTTTT GCGCTATATG AGTATGGACT CGGTATACAG AAAGTACCAT CAAAATCTTA TAACCTTCTC CCTGATGTAC GCTTTCTCGG AAAACTTTAT ACTTGTACTC TCCCATGACG AAGTAGTACA CGGCAAATGC TCCATGATAA ACAAAATGCC GGGAGATTAC TGGCAAAAGT TTGCGGGCCT GCGTGCAAGC TACGGTTACC TGTACGGCCA TCCCGGCAAG AAACTTCTTT TCATGGGGGG AGAATTTGCA CAGTTTATTG AATGGAACTA CAAGCAAAGC CTTGACTGGT TCCTGCTGGA TTATGACATG CATAAGAAAA TGCAGGATTA TGTACGGGAC CTCAACAAGC TTTACAGAAG CGAAAAAGCA CTGTATGAAG TTGATTTCCA TTATGATGGC TTTGAATGGA TAGATTGCAA CGATACGGAA CACAGCATTA TTTCCTTTAT GAGAAAGGGC AAGGACTGGC ATAATTCTCT CATATTTGTG TGCAACTTCA CTCCCGTGCC CCATGAAGAC TACCGCATTG GTTCGCCTTT TAACACCACT TATGATGAGA TTTTCAACAG CGACTGGGAA AAATACGGCG GAAGCAATGT CGGAAACTTC GAGCCTATTA AGGCTGAAGA GATAAGCATG CACAACAAAC CTTATTCCAT GAGGCTTCGC ATTCCGCCGC TTGCAACAAT TGTGCTAAAG CCAAGGTTTG ACAGGAAGGA TTAA
|
Protein sequence | MNTTANIDEV YKVINAEHHD PFSVLGMHRL ESENAMVVRA YLPNAKEIEV VELSKNNTYP MEKIDERGFF EVVIKDRNDF FKYNLRATDY VGNTFTFYDP YCFMPVISDY DLYLFNEGNH HKIYEKLGTH RMTIDGVEGT LFAVWAPCAK RVSVVGNFNQ WDGRRHQMRV RGSSGVWELF IPGVGEGELY KYEIKTPHNE IYIKADPYAF YSELRPNTAS IVYDIEGYEW HDADWMRERD SSNSFDKPIS IYEVHLGSWK RVSNDENGFY SYRELADMLV EYVKYMGYTH IELLPIAEHP FDGSWGYQVT GYYAATSRYG QPKDFMYFVD KCHQNGIGVI IDWVPAHFPK DGHGLARFDG TALYEHYDPK QGEHPDWGTH IFNYGRNEVK NFLIANAMFW FDKYHIDGLR VDAVASMLYL DYGKKDGEWI PNRWGGKENV DAIEFMRQLN STVFQYFPGV MMIAEESTAW ALVTKPPYTG GLGFSYKWNM GWMNDFLRYM SMDSVYRKYH QNLITFSLMY AFSENFILVL SHDEVVHGKC SMINKMPGDY WQKFAGLRAS YGYLYGHPGK KLLFMGGEFA QFIEWNYKQS LDWFLLDYDM HKKMQDYVRD LNKLYRSEKA LYEVDFHYDG FEWIDCNDTE HSIISFMRKG KDWHNSLIFV CNFTPVPHED YRIGSPFNTT YDEIFNSDWE KYGGSNVGNF EPIKAEEISM HNKPYSMRLR IPPLATIVLK PRFDRKD
|
| |