Gene Cthe_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2834 
Symbol 
ID4809671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3350172 
End bp3351857 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content42% 
IMG OID640108254 
Producthypothetical protein 
Protein accessionYP_001039226 
Protein GI125975316 
COG category 
COG ID 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0278532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTA TAACGTTATA TGCCGGAAAA ATCAACCAAA TACCCGGATT GATAAATGAA 
GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG
AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC
CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTT
ATATCGGAAG TAGTACGTAT CGATGAAGAA GTTGCTGAGC TTATCAATAA ACGGAAAGAA
AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA
ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCC
AAGATAGTGG CTGCCGCAGT AGTTATTACC GGGTTAGGGA TAGCGGCGGC ATTGACAGGA
GGGGTATTGG GAGTCATACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA
GGAGGAGCGG TTGGAGGAAT AGCCGCGGCG ATAAATGGAG GATCGTTTCT GGAAGGATTT
GCGGACGGCG CTTTAAGCGG AGCAATTTCC GGAGCGGTGA CAGGAGCGGC ATGTGCCGGG
CTTGGTGCTT TAGGAGCTCT AGCAGGGAAA AGCATCCAAT GTATGAGCAC AGTGGGAAAA
GCGATAAATG TTACGTCAAA GGTTACGGCA GCACTTTCTT TTGGTATGGA TGGATTTGAC
ATGCTGGCAA TGGGAATATC ATTGTTTGAT CCATCCAATG CATTGGTTGA ATTTAACCGG
AAGCTGCATT CCAGTGCACT TTACAACGGA TTCCAGATTG CTGTAAACGC GCTGGCTGTT
TTCAGTGCCG GGGCGGCATC GACAATGAAG TGCTTTGTTG CAGGTACAAT GATATTGACT
GTGGCAGGCT TGGTTGCGAT AGAGAATATC AAGGCAGGGG ACAAGGTAAT TGCGACGAAT
CCGGAGACTT TTGAAGTAGC GGAAAAGACG GTGCTTGAGA CATATGTGAG AGAAACAACG
GAGCTTTTGC ATTTGACAAT CAATGGAGAG GTAATCAAGA CAACCTTTGA GCATCCGTTT
TATGTAAAAG ATGTGGGTTT TGTTGAAGCG GGAAAACTGC AAGTAGGAGA TAAGTTGGTT
GATTCAAGAG GCAATCTTTT GGTGGTGGAA GAGAAAAAGC TTGAAATAAC AGATAAGCCT
GTAAAGGTTT ACAATTTTAA GGTCGATAAT TTTCATACGT ATCATGTTGG CGAAAATAGG
GTATTGGTTC ATAATGCGAA TAAGTATGTT AAGGGAACGC GTAGTACTCA GTTGACGTTT
GATGAAGCAC TGAAAAAGTT AGACAAGTCA GGCTTACGAC CGGGTCAAAC AGAAATTTCA
AAGAGTAGGG TTATGGAAAT CGTAGAGAAT TATGATCCTA TGAAAGCACA AAGCAGTGTG
TATACTGATT CAACGGGTAG ATATTTAGTT GAAGGCCATC ATACAACTGT CGCAAATACA
ATGCTAGGAA AAGGATCTGG GGTGAATATG AATATACCTA CACAGCAGAT ACCATCTGCT
ACAAATGTCT ATTGGACAAA AAAGTGGTAT GAATTTTGGA AAACACAAAT AAAAGTAACA
AAATAA
 
Protein sequence
MATITLYAGK INQIPGLINE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS 
QTQDRKIDSL EKFCSESEKF ISEVVRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK
IKDGLKSVAE WCKENWKSIA KIVAAAVVIT GLGIAAALTG GVLGVILAGA FWGALAGGLI
GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGALAGK SIQCMSTVGK
AINVTSKVTA ALSFGMDGFD MLAMGISLFD PSNALVEFNR KLHSSALYNG FQIAVNALAV
FSAGAASTMK CFVAGTMILT VAGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT
ELLHLTINGE VIKTTFEHPF YVKDVGFVEA GKLQVGDKLV DSRGNLLVVE EKKLEITDKP
VKVYNFKVDN FHTYHVGENR VLVHNANKYV KGTRSTQLTF DEALKKLDKS GLRPGQTEIS
KSRVMEIVEN YDPMKAQSSV YTDSTGRYLV EGHHTTVANT MLGKGSGVNM NIPTQQIPSA
TNVYWTKKWY EFWKTQIKVT K