Gene Cthe_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1413 
Symbol 
ID4809074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1732208 
End bp1734124 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content32% 
IMG OID640106836 
Producthypothetical protein 
Protein accessionYP_001037837 
Protein GI125973927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGTTT TTGAATGGAA GAAAGTCCTG ATAAAACAAA AAGGATTGCT TTGTATAGGT 
ATTATGTTTC TTCTCAAAAT TGCCCTGTTA TTTTACCAGG GATATGATTC TAACAGCATT
ATCAATAGCA ACGAAGAAGG TTATAAATAC TATATAAACC TCTATCAAGG TAAACTTACG
GAGGAAAAAG AAAAGTCCAT TAAAGCTGAA TATGATAGTG TAACAAATGC ACAGGCTTAC
CTAGAAGACT TGTCTCACAA AAAAAGAAAT GGAGAGATTG GTTTTAAGGA GTATGAAGAA
AAATCTAAAA AGTATTATGA ATGCTTAAAA AACGCAGATG TTTTTAATTT GGTATATAAT
CAATATTACT ACGCGAAAGA AGCTCCGGAT GTCAGATATA TTATTGATTG GAGGGGCTGG
CAGACACTTC TGAGCCATGA CGCGCCGGAT GTATTGCTTA TAGTGTGCTT GCTAATTGTT
ATGGTACCAT TGTTTTGCAA TGAATATGAA AGTGGCATGT ATTCATTGCT TGTATCCAGT
GTTAGGGGAA AGTACAAGGT TGCAATCGTA AAGCTACTGA GCGCATTTGT TTTAAGTGCT
GGCATAGTGA TTTTGTTTTC AGTAGCAGAA TATATTTGCG TGGATTTTAT GGTGGGACTT
GATAATAGTA CATTTCCATT GCAAAGTTTG AAATTTTTTG AATATAGTGA CTGGTATGTT
TCTTTAAGAC AAGCTTTTGT CATAATCGTA TTATTTCGCA TAGTTGGTGC GGTGCTGTTT
ACAGCTTTTA TTTCAGTTGT AAGTGTAATA AGTAAAAAAA CAATTGTTGC TTTGTTTACT
TGCAGTACAT TGGTGTTTTT ACCGTATATA GTATATGGAG GAACAACTAC ATTATATTAT
CTTCCACTGC CGTCAGGGCT TCTTGTCGGA GCAGGGTATT TGTGGGGAGA TAATTATCTT
TCGGCTATTA CTGAAGAAGG AGTGGATAGA ATAATATTAT TTCAAAAAAT TAGTAAAAAT
ATGATTACTT TATTGATGTT AATGTTTGTA ATTGAAATTG TATTTCTTTT TTTGACTTGC
ATTGTGAAAT ATTCAAGGCA TACTTTTCGC CTAAATAATT TCGGTAATAA AATACGTAAG
TTCTCATGTG CTTTATGTGT GTTAACCATT CTGCTTTTAA TTCTTACAGG TTGCCGGACA
GAAATGAGTG AAAAAGATAA TTTCACTTTC AATGCTTCGG AAGAATGGAG ATGTGTAAAA
ACTGATGAGT ATGTAATATC TTTGGATCCG GAAAAAAATA TAATAACTGC GGAGAATCTT
GATAAAGGAG AGCAAATTGT TTTGCCAAAG GATCCTTTCA GACAGGATAT ATATGAAACT
GAAGACAGAT CATTAAAACG AGGATACAGA ATAAGGTCGA TATTTGTAAG AGACGGATGG
TGCTATTATT TAAAAGAAAT ATTGCAAACT GATGGATTTC AAATATATGG TATTGATTTA
AAAGATTTTA AAGAGGAATT GATTTATAAT GGTATCCAAG AGAATGATAA AAATTTTTTT
GGAGCATTTT TCGATAGAAG ACAGGATCAA ACTAGTCTGC CGTCAGTTGA TTATTTTTTT
CTCAATGCGA ATTATATATA TTATCTGCAA GGCAAAAGAC TGGTTAGAAT TGACAGAAAT
ACAAATTCAG AAACAGTATT GGCATTGGAT GTGAAAGAAA GAAGTGCCGT TTATCATAAC
GGGGATATTT ATTACATAGA TACTCTTAAC AGGCTTAGTG TGTATAAGGA AGAAGATGAA
ACCGTCAATA AAATAGATTC TGTTTATACT GATCAGATCA GTATTGAGGG AAAGCGTATC
AGGTATACTG ATTTGTTAAA TGATAAAAAT ATTGGATATT ATGATATAGA AACCTGA
 
Protein sequence
MIVFEWKKVL IKQKGLLCIG IMFLLKIALL FYQGYDSNSI INSNEEGYKY YINLYQGKLT 
EEKEKSIKAE YDSVTNAQAY LEDLSHKKRN GEIGFKEYEE KSKKYYECLK NADVFNLVYN
QYYYAKEAPD VRYIIDWRGW QTLLSHDAPD VLLIVCLLIV MVPLFCNEYE SGMYSLLVSS
VRGKYKVAIV KLLSAFVLSA GIVILFSVAE YICVDFMVGL DNSTFPLQSL KFFEYSDWYV
SLRQAFVIIV LFRIVGAVLF TAFISVVSVI SKKTIVALFT CSTLVFLPYI VYGGTTTLYY
LPLPSGLLVG AGYLWGDNYL SAITEEGVDR IILFQKISKN MITLLMLMFV IEIVFLFLTC
IVKYSRHTFR LNNFGNKIRK FSCALCVLTI LLLILTGCRT EMSEKDNFTF NASEEWRCVK
TDEYVISLDP EKNIITAENL DKGEQIVLPK DPFRQDIYET EDRSLKRGYR IRSIFVRDGW
CYYLKEILQT DGFQIYGIDL KDFKEELIYN GIQENDKNFF GAFFDRRQDQ TSLPSVDYFF
LNANYIYYLQ GKRLVRIDRN TNSETVLALD VKERSAVYHN GDIYYIDTLN RLSVYKEEDE
TVNKIDSVYT DQISIEGKRI RYTDLLNDKN IGYYDIET