Gene Cthe_0274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0274 
Symbol 
ID4808557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp339019 
End bp340710 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content44% 
IMG OID640105686 
Productglycoside hydrolase family protein 
Protein accessionYP_001036706 
Protein GI125972796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00113767 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TAGCTGTATT GTTAATTACT TTGACGTTCT TAGTCACGAC TCTGTTTTCA 
ATGTCATTTT CATCGGCAGC TGCAAGTCAT GATTATGCCA CCGCATTAAA ATACTCAATT
TTATTTTACG ACGCCAACAA ATGCGGTCCG GATGCAGCAG TTGACAATGT CTTCAGTTGG
AGAGGACCGT GTCACACGAC TGACGGAAGT GAAATAGGCC TGGATCTGAC CGGAGGATAT
CATGATGCCG GAGACCATGT CAAATTCGGT TTGCCCCAGG CTTATGCGGC AGCGGTACTG
GGTTGGTCAC TTTATGAATA CAAAGGAGTG TTTGACGCAA CAGGAAACAC CTCCAAAATG
CTCAGCACTC TCAAATATTT CACCGACTAT CTTTTAAAAT GTCATCCGGA CTCCAATACC
TTCTACTATC AAGTAGGAGA CGGTCAGGCA GACCATACAT ACTGGGGTGC GCCGGAAGTC
CAGCCGGGTC CGAGACCCGT ACCCTATGTT GCCAATGCCT CAAATGCGGC TTCTGATGTA
TGTGGTTTGA CATCCGCCGC TCTTACGATA ATGTACCTAA ACTACAAGGA TATAGACCAA
AACTATGCCA ACAAATGTTT AAAAGCTGCA AAAGAACTTT ATACAATGGC AAAAACCAAT
TTGGGCTACT ATGCCGAGAA TGCTTTTTAC ATCTCCCACT CCTACTGGGA CGATCTTTCC
TTTGCAGCCA CCTGGCTCTA TGTAGTGGAA AAAGACCCGA CTTACCTGAA GGAAATTGAC
AGCTATCTTT CAAATAAAAC CCTTTGGGGA GAGAGTCCTT TCAACAACAA ATGGACAATG
TGCTGGGATG ACATGTACAT GGCTGTCTTC TGCAAACTTG CAGAGATAAC AGGTGAACAA
AAGTACATTG ACGCAATGAA TTACAATCTC GATTACTGGA TGAATTCCCT TAATACAACT
CCCGGAGGCC TTAAGTATCT TGACAGCTGG GGAGTATTAA GATATGCGGC CGCCGAGGCC
TTTATCGCCA TGAGATACTA CGAGCTTACC AAAAATGAAG CATTAAAATC CTTTGCAAAA
TCTCAAATAG ACTACATACT TGGCAGCAAT CCCATCAACA TGTCCTATGT TATAGGTTAT
GGATCAAACT ACCCAAAATG TCCTCACCAC AGGGCAGCCA ACGGCTACAC TTACGCCAAC
GGTGACAATG CAAAACCTGC CAAAAACCTT CTTTTAGGTG CTTTGGTGGG CGGTCCGAAT
ATGTCTGACA ACTTTATCGA TGATGTCAAT CAGTTCCAGT TTACGGAAGT GGCTATTGAC
TATAATGCTG CTTTCGTGGG CGCTCTGGCT GCTATTGAAA AATACTACGG CAATATCGTT
ATACCCACTC CTCCAGCCAC TACACCCCCG TCTCCTACCG CAACGCCTTC CTTAATATGG
TGTGATGTCG GAGACTTAAA CGTTGACGGT TCAATAAACT CAGTAGACAT TACATACATG
AAAAGGTATC TTTTGCGCAG TATAAGTGTC CTTCCTTACC AGGAAAATGA AAGGATTCGC
ATACCGGCGG CAGACACCAA CGGCGACGGT GCAATCAATT CCAGTGACAT GGTATTGCTA
AAAAGATATG TCCTTCGCAG TATTAGCGAA TTTCCGGTTA AATATGATAT CTATGGAAAC
ATCATAAATT AA
 
Protein sequence
MRKIAVLLIT LTFLVTTLFS MSFSSAAASH DYATALKYSI LFYDANKCGP DAAVDNVFSW 
RGPCHTTDGS EIGLDLTGGY HDAGDHVKFG LPQAYAAAVL GWSLYEYKGV FDATGNTSKM
LSTLKYFTDY LLKCHPDSNT FYYQVGDGQA DHTYWGAPEV QPGPRPVPYV ANASNAASDV
CGLTSAALTI MYLNYKDIDQ NYANKCLKAA KELYTMAKTN LGYYAENAFY ISHSYWDDLS
FAATWLYVVE KDPTYLKEID SYLSNKTLWG ESPFNNKWTM CWDDMYMAVF CKLAEITGEQ
KYIDAMNYNL DYWMNSLNTT PGGLKYLDSW GVLRYAAAEA FIAMRYYELT KNEALKSFAK
SQIDYILGSN PINMSYVIGY GSNYPKCPHH RAANGYTYAN GDNAKPAKNL LLGALVGGPN
MSDNFIDDVN QFQFTEVAID YNAAFVGALA AIEKYYGNIV IPTPPATTPP SPTATPSLIW
CDVGDLNVDG SINSVDITYM KRYLLRSISV LPYQENERIR IPAADTNGDG AINSSDMVLL
KRYVLRSISE FPVKYDIYGN IIN