Gene Cthe_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1148 
Symbol 
ID4810816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1364867 
End bp1366150 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content31% 
IMG OID640106570 
Producthypothetical protein 
Protein accessionYP_001037573 
Protein GI125973663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00179115 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTGCG TGTTTATTAC ACCAGATAAA ATGGATGGAA AGCGGATTTC TTATGATTAT 
GTAGTACAGC AATATGAAAT ACAGCGATTA TTACAGAGGC GTCCCCTTAT AACATTAATT
GATATTTGCA AAGAAATAAC AAGTGGGATT CGGGTTAAGA AAGAGTATTA TACAGATAAA
AACGGATATA AGATTATTGC TCCTGGAGAC ATAAGAAATG AAGTTATATA TATTAATGAA
CTTAAAGTAG TACAGCCTGA AGTAGTAAGA GAAAAAGACA TTATAAATAA TGGAGATATA
TTGATTACAG CCTCAGGTAA ATCAGGACAG GTAATTTATG TAAATGAAGT ATTAGAAGGA
TGTGTAGTAA CATCGGATAT TATTAAAATT ACATTAAGGG ATAGGGATAA AGGTATAAGA
TTATATAAGT TTTTAAAAAG CAGTATAGGA CAAATGCTGT TAAACTCCAT AAAAATAGGG
ATTTTAAATA AAATTTTTGT GGAGGATGTT GAAAATTTAT TAATTCCTGA AGACTTTGAT
ACATATCAGG AAGATTGTTC TGATGATTCT ACAGTATATG CAGAGGCTGA AAAACTATAC
AGGTCTGCAG AAAACATATT TTACAGGGTA TTTGATTATA AAGGTGAAAA AAAGAATCTT
AAACACTTTT ATGTGACAGA ATACCTTGAC AGTCACAGAT TAGACCCTGA GTACTACTCG
AACTTTTACA CTGAACTGTA TAGGGTAATT CACAAGAATT TTGATGATGT AAAATGGGAG
GAACTTGGAG AACTTGTAGA AATAAAAAAA GCAGATAAGC CCGAAATAAG TAAGAATCAA
AAAGTAAAGT ATTTTTTGTT AGCGGATATA GACCCGAATT TTTCAATCAT AAAAGAAACA
CATGAGGATT TTTATGGAAA TTTAAGCAAT CGGATGAGAT ATATTGTGAG GAGGGGAGAA
ATAGTAACTG CCAAAGGAGG CAGTGCTACA GGGACTAAGG GACATGCAAC GGCACTTATT
ACAGAAAAAT TTGATGGTTT GGTTACGACA GATGCTCTAT ATAACTTGGT CCCCAGAAGA
ATTAATCCTT ACTATCTTCT GTTTTTGTTT AAACAGCCAA TAATTCTAAA CCAGGTAAAC
ATGTTTACTA AAGGGACACT ATATAAACTC ATTCAAAGAA ATGACTTTGA AAAAATCAAA
ATTCCAAGAC TGGAAAGTAG TTTGGAAGAA CAAATAGTAG ATAAAATGAT GAATTATTTA
AGTGTGTTAC AAAACAAATT TTAA
 
Protein sequence
MNCVFITPDK MDGKRISYDY VVQQYEIQRL LQRRPLITLI DICKEITSGI RVKKEYYTDK 
NGYKIIAPGD IRNEVIYINE LKVVQPEVVR EKDIINNGDI LITASGKSGQ VIYVNEVLEG
CVVTSDIIKI TLRDRDKGIR LYKFLKSSIG QMLLNSIKIG ILNKIFVEDV ENLLIPEDFD
TYQEDCSDDS TVYAEAEKLY RSAENIFYRV FDYKGEKKNL KHFYVTEYLD SHRLDPEYYS
NFYTELYRVI HKNFDDVKWE ELGELVEIKK ADKPEISKNQ KVKYFLLADI DPNFSIIKET
HEDFYGNLSN RMRYIVRRGE IVTAKGGSAT GTKGHATALI TEKFDGLVTT DALYNLVPRR
INPYYLLFLF KQPIILNQVN MFTKGTLYKL IQRNDFEKIK IPRLESSLEE QIVDKMMNYL
SVLQNKF