Gene Cthe_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2030 
Symbol 
ID4811000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2406778 
End bp2408454 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content41% 
IMG OID640107439 
Producthypothetical protein 
Protein accessionYP_001038434 
Protein GI125974524 
COG category 
COG ID 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00279579 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTA TAACGTTATA TGCCGGAAAA ATCAACCAAA TGCCCGGATT GATAAAAGAA 
GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG
AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC
CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTC
ATATCGGAAG TAGTACGTAT TGATGAAGAA GTTGCTGAGC TTATCAATAA ACGGAAAGAA
AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA
ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCT
AAAATAGTAG TTGCGGCGGT AGTTATTGCA GGATTGGGGA TAGCGGCGGC TTTGACAGGC
GGGATATTGG GAGTCGTACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA
GGGGGAGCGG TTGGAGGAAT AGCCGCTGCG ATAAATGGAG GATCGTTTCT GGAAGGATTT
GCGGACGGCG CTTTAAGCGG AGCAATTTCC GGGGCTGTGA CAGGAGCGGC ATGTGCCGGG
CTTGGTGCTT TGGGAGCTCT AGCAGGGAAA AGTATCCAAT GCATGAGCAC AGTGGGAAAA
GCGATAAATG TTACATCAAA GGTTACGGCA GCACTCTCGT TTGGTATGGA TGGATTTGAC
ATGCTGGCAA TGGGAGTATC ATTGTTTGAA CCATCCAATG CATTGGTTGA ATTTAACCGG
AAGCTGCATT CCAATGCATT TTATAACGGA TTCCAGATTG CTGTAAACGC GCTGGCTGTT
TTCACTGCCG GGGCGGCATC GACAATGAAG TGCTTTGTTG CAGGTACGCT GATATTAACT
GCGACAGGCT TGGTTGCGAT AGAGAATATC AAGGCAGGGG ACAAGGTAAT TGCGACGAAT
CCGGAGACTT TTGAAGTAGC CGAGAAGACA GTGCTTGAGA CATATGTGAG AGAGACGACG
GAGCTTTTGA ATTTGACAAT CAATGGAGAG GTAATCAAGA CAACCTTTGA GCATCCGTTT
TATGTTAAAG ATGTGGGTTT TGTTGAAGCG GGAAAACTGC AAGTAGGAGA TAAGTTGGTT
GATTCAAGAG GTAATGTTTT AGTATTGGAA GGTAAAAAGC TTGAAATAAC AGATAAGCCT
GTAAAGGTTT ACAATTTTAA GGTCGATAAT TTTCATACGT ATCATGTTGG CGAAAATAGG
GTATTGGTTC ATAATGCGAA TAAGTATGTT AAGGGAACGA GTAAGACTGA GATAATTGGT
AAGCCACATG CTTCAGCTCA ACATCAAGCT TTTACTATGG ACGAAGTAAA TAAGTTATCT
TCAACAGGTG AATTCTCAAA AATATACATA AATAAGTCTT TAAAAACTGC AGGTTTTAAT
GGAACACAAA AACCTGATAT AATTGCAGTA GGGAAAAACG GAAATGGACA TATTGTTGAG
ATTGCCGGTC CTAGTCAGTT ATCAGGTAAA CCTAAGTATG CTCTAAAGAA CAAATTCAAT
ACTATGTTAC AAAATAACCC TGGAATGACT GGTGATTTGA TATTCCCTGA ATATTAA
 
Protein sequence
MATITLYAGK INQMPGLIKE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS 
QTQDRKIDSL EKFCSESEKF ISEVVRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK
IKDGLKSVAE WCKENWKSIA KIVVAAVVIA GLGIAAALTG GILGVVLAGA FWGALAGGLI
GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGALAGK SIQCMSTVGK
AINVTSKVTA ALSFGMDGFD MLAMGVSLFE PSNALVEFNR KLHSNAFYNG FQIAVNALAV
FTAGAASTMK CFVAGTLILT ATGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRETT
ELLNLTINGE VIKTTFEHPF YVKDVGFVEA GKLQVGDKLV DSRGNVLVLE GKKLEITDKP
VKVYNFKVDN FHTYHVGENR VLVHNANKYV KGTSKTEIIG KPHASAQHQA FTMDEVNKLS
STGEFSKIYI NKSLKTAGFN GTQKPDIIAV GKNGNGHIVE IAGPSQLSGK PKYALKNKFN
TMLQNNPGMT GDLIFPEY