Gene Cthe_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2191 
Symbol 
ID4810907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2610189 
End bp2612402 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content44% 
IMG OID640107597 
Product1,4-alpha-glucan branching enzyme 
Protein accessionYP_001038586 
Protein GI125974676 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA CAGCAAATAT TGACGAGGTT TACAAGGTTA TCAATGCAGA GCATCATGAT 
CCGTTCAGTG TGCTGGGAAT GCACCGTCTG GAATCTGAGA ATGCAATGGT CGTAAGGGCA
TATCTGCCAA ATGCAAAAGA AATAGAAGTA GTGGAACTCT CCAAAAACAA CACATACCCA
ATGGAGAAAA TCGACGAGAG GGGCTTTTTT GAGGTTGTAA TAAAAGACAG GAATGACTTT
TTCAAGTATA ACCTGAGGGC CACCGACTAC GTAGGCAACA CTTTTACCTT TTACGACCCT
TACTGTTTCA TGCCCGTGAT CTCTGACTAT GACCTTTATT TATTCAATGA GGGCAACCAT
CATAAAATCT ACGAAAAGCT TGGTACCCAC AGAATGACTA TTGACGGTGT TGAAGGCACA
CTTTTTGCCG TTTGGGCCCC CTGTGCAAAA CGCGTAAGTG TTGTGGGAAA TTTCAACCAG
TGGGACGGCC GCAGGCACCA AATGAGAGTG AGAGGAAGTT CCGGAGTATG GGAGTTGTTT
ATTCCGGGTG TTGGTGAAGG CGAGCTTTAC AAATATGAAA TAAAGACACC TCACAACGAA
ATTTACATCA AAGCCGACCC CTATGCTTTT TATTCGGAAC TTCGGCCAAA TACAGCTTCC
ATCGTGTATG ATATTGAAGG ATATGAATGG CATGACGCAG ACTGGATGCG TGAAAGAGAC
AGCAGCAATT CCTTTGACAA GCCCATATCC ATATACGAAG TGCATCTTGG CTCATGGAAA
AGAGTCTCCA ACGATGAAAA CGGCTTTTAC TCATACAGGG AACTTGCAGA TATGCTGGTA
GAGTATGTGA AATACATGGG TTACACCCAT ATTGAACTTC TGCCGATTGC AGAACATCCT
TTTGACGGTT CGTGGGGATA CCAGGTTACA GGATATTATG CAGCAACCAG CAGATACGGA
CAGCCAAAAG ATTTTATGTA TTTTGTGGAC AAATGCCACC AAAACGGCAT AGGAGTTATA
ATCGACTGGG TACCGGCTCA TTTTCCCAAA GACGGCCACG GTCTGGCAAG ATTTGACGGC
ACAGCCCTGT ATGAGCATTA CGACCCCAAA CAAGGCGAGC ATCCTGACTG GGGAACTCAT
ATATTCAATT ACGGAAGAAA TGAAGTTAAA AACTTCTTAA TTGCCAATGC CATGTTCTGG
TTCGATAAAT ACCATATTGA CGGACTTAGG GTTGACGCAG TGGCATCCAT GCTCTATCTG
GATTACGGCA AAAAGGACGG GGAATGGATA CCCAATCGCT GGGGAGGAAA AGAAAACGTC
GACGCCATTG AGTTTATGAG GCAGCTGAAT TCAACCGTGT TCCAATACTT CCCCGGTGTT
ATGATGATCG CTGAGGAATC CACAGCCTGG GCTTTAGTAA CCAAACCTCC TTACACAGGT
GGTCTTGGCT TCTCGTACAA ATGGAACATG GGCTGGATGA ACGACTTTTT GCGCTATATG
AGTATGGACT CGGTATACAG AAAGTACCAT CAAAATCTTA TAACCTTCTC CCTGATGTAC
GCTTTCTCGG AAAACTTTAT ACTTGTACTC TCCCATGACG AAGTAGTACA CGGCAAATGC
TCCATGATAA ACAAAATGCC GGGAGATTAC TGGCAAAAGT TTGCGGGCCT GCGTGCAAGC
TACGGTTACC TGTACGGCCA TCCCGGCAAG AAACTTCTTT TCATGGGGGG AGAATTTGCA
CAGTTTATTG AATGGAACTA CAAGCAAAGC CTTGACTGGT TCCTGCTGGA TTATGACATG
CATAAGAAAA TGCAGGATTA TGTACGGGAC CTCAACAAGC TTTACAGAAG CGAAAAAGCA
CTGTATGAAG TTGATTTCCA TTATGATGGC TTTGAATGGA TAGATTGCAA CGATACGGAA
CACAGCATTA TTTCCTTTAT GAGAAAGGGC AAGGACTGGC ATAATTCTCT CATATTTGTG
TGCAACTTCA CTCCCGTGCC CCATGAAGAC TACCGCATTG GTTCGCCTTT TAACACCACT
TATGATGAGA TTTTCAACAG CGACTGGGAA AAATACGGCG GAAGCAATGT CGGAAACTTC
GAGCCTATTA AGGCTGAAGA GATAAGCATG CACAACAAAC CTTATTCCAT GAGGCTTCGC
ATTCCGCCGC TTGCAACAAT TGTGCTAAAG CCAAGGTTTG ACAGGAAGGA TTAA
 
Protein sequence
MNTTANIDEV YKVINAEHHD PFSVLGMHRL ESENAMVVRA YLPNAKEIEV VELSKNNTYP 
MEKIDERGFF EVVIKDRNDF FKYNLRATDY VGNTFTFYDP YCFMPVISDY DLYLFNEGNH
HKIYEKLGTH RMTIDGVEGT LFAVWAPCAK RVSVVGNFNQ WDGRRHQMRV RGSSGVWELF
IPGVGEGELY KYEIKTPHNE IYIKADPYAF YSELRPNTAS IVYDIEGYEW HDADWMRERD
SSNSFDKPIS IYEVHLGSWK RVSNDENGFY SYRELADMLV EYVKYMGYTH IELLPIAEHP
FDGSWGYQVT GYYAATSRYG QPKDFMYFVD KCHQNGIGVI IDWVPAHFPK DGHGLARFDG
TALYEHYDPK QGEHPDWGTH IFNYGRNEVK NFLIANAMFW FDKYHIDGLR VDAVASMLYL
DYGKKDGEWI PNRWGGKENV DAIEFMRQLN STVFQYFPGV MMIAEESTAW ALVTKPPYTG
GLGFSYKWNM GWMNDFLRYM SMDSVYRKYH QNLITFSLMY AFSENFILVL SHDEVVHGKC
SMINKMPGDY WQKFAGLRAS YGYLYGHPGK KLLFMGGEFA QFIEWNYKQS LDWFLLDYDM
HKKMQDYVRD LNKLYRSEKA LYEVDFHYDG FEWIDCNDTE HSIISFMRKG KDWHNSLIFV
CNFTPVPHED YRIGSPFNTT YDEIFNSDWE KYGGSNVGNF EPIKAEEISM HNKPYSMRLR
IPPLATIVLK PRFDRKD