Gene Cthe_1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1631 
Symbol 
ID4809326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1958009 
End bp1959793 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content32% 
IMG OID640107047 
Producthypothetical protein 
Protein accessionYP_001038048 
Protein GI125974138 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000289868 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATTTTT TGACAAAACG TGTTATAATT CCTCCAAATA AGATTATTAT CGGAGGAGTA 
ACTTTGGATC GTATAACTCA GGCTTTTTTA GATGAATTTT CTATTAGCCA TAATTTTACT
ATGTACGAAA CAAGTGTTCA ATTCGAGCAT TTTGCCAATT TCTGTGCTTT ATCTGCAGAA
ACTGGCATGG TTGAAATAGA TATTCAAGAC ATGCATACTG GCAATGCTAC TCAAGGTATT
GATGGAATAG CGATAGAAGT AAACGGAGCT ATTGTGTGCA GTATTGATGA GATAGAAACG
TTAATTAAGC AAAATAAAAA ACTTGATGTA AAATTTATAT TTGTACAAGC AAAAACATCT
GACAGTTTTG ATAATTCAGA GATAAGTAAC TTTTTGTCTT TTGTTAAAGT CTTTTTTTCT
GATGAAGCTA AGAATACATT CAGTACAGAG GAAATGGCAG ACTTCATAGA AATGAAGGAT
TTTATTTATA GTAATTCTCG CTATATGAAA GTCAAAAACC CGATAATACG ACTTTATTAT
ATTGCTCCGG GGAAGTGGAA CGATGATGAT TCCAATTTGA AAGCTGTAAT TAATAGTCAT
ATAGATACGC TTAATAACAT GGCACTTTTT TCATCTGTGG AATTTATACC TTGCGGTGCC
CAAGAAATAC AGCGTATGTA TAGAAAATCA CAAGAGCAAA TAGAAGCGAC TTTTGTTTTT
ACAAAAAATG TGATGATGTT TTCTGATGAT AATGGAGATT ATGGATATAG TGGGGTACTG
CCATTCTGCG AATTTTATAA AATTATATGC GATGAAAATG GTTCACTAAA AAAAGTATTT
GAAGATAATA TTCGGGACTT CCTTGGAGTG AATAATTATG TTAATGCGGA CATTGAAGAA
ACTATTGTTG AAGGCAGAAA TAGCGCTTTT TGCATGCTAA ATAATGGAAT AACAATTGTC
GCTCATTCTG CTGTGCTTGT GAGCGATAAG ATGACAATTT CAAACTATCA GATAGTTAAT
GGATGCCAAA CCAGCCATGT TTTGTATCTT AACCGTGATA ATCTTGGAAT ACATGATTTA
CTTATACCGA TTAAGATTAT TGTAACCAAG GATGAGGACT TAAAAAACCG TATTACAAAA
GCTACAAATA ATCAAACTGG TATAACCAAA GAACAATTAG AAGCGTTATC AACTTTCCAA
AAAACACTGG AGGAATACTA TCGCACATAC ACTGCTGAGG ATGAACGCTT GTATTATGAA
CGTCGTTCAG GACAATATAG GAATGAATCG ATTCCCAAAG ATCGAATAGT TACTATTCGT
GCCCAGTTAA AAAATGCATC ATCAATGTTC AATGATAAAC CACACGACGC TGCTGGTCAT
TATAGTAGCT TATTGAAAGA TATTGGAAAC CGTATTTTTC TACCTGACGA CCAGCCTATA
TTGTATTACA CAAGTTCTTT GGCCATGTTT CGTTTCGAAA ACCTGATAAA AACAAAATGT
ATTGATAAAA AATACCGTAA AGGAAAGTAT CATGCCATAA TGCTTTTAAA GTATATGGCA
ACAAACAACT TACCAAAACA TCATAGTGCC AAAAAAATGA TCAATGCTTG CAATCAAATT
TTGCGTATCT TGAATGATTC AGGGAAATGT CTCGATTATT TTTTAAGAAT AATTGAATTC
ATTGAAACAC AAAAAGAATT AGATTTGACG GATCGTAAAT TGTTTGAACG GAAAGAAACA
ACAGATATTT TACTACAAAA TAAGGATAAG TTAATAAGAA GTTAA
 
Protein sequence
MNFLTKRVII PPNKIIIGGV TLDRITQAFL DEFSISHNFT MYETSVQFEH FANFCALSAE 
TGMVEIDIQD MHTGNATQGI DGIAIEVNGA IVCSIDEIET LIKQNKKLDV KFIFVQAKTS
DSFDNSEISN FLSFVKVFFS DEAKNTFSTE EMADFIEMKD FIYSNSRYMK VKNPIIRLYY
IAPGKWNDDD SNLKAVINSH IDTLNNMALF SSVEFIPCGA QEIQRMYRKS QEQIEATFVF
TKNVMMFSDD NGDYGYSGVL PFCEFYKIIC DENGSLKKVF EDNIRDFLGV NNYVNADIEE
TIVEGRNSAF CMLNNGITIV AHSAVLVSDK MTISNYQIVN GCQTSHVLYL NRDNLGIHDL
LIPIKIIVTK DEDLKNRITK ATNNQTGITK EQLEALSTFQ KTLEEYYRTY TAEDERLYYE
RRSGQYRNES IPKDRIVTIR AQLKNASSMF NDKPHDAAGH YSSLLKDIGN RIFLPDDQPI
LYYTSSLAMF RFENLIKTKC IDKKYRKGKY HAIMLLKYMA TNNLPKHHSA KKMINACNQI
LRILNDSGKC LDYFLRIIEF IETQKELDLT DRKLFERKET TDILLQNKDK LIRS