Gene Cthe_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1431 
Symbol 
ID4810581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1751588 
End bp1753543 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content39% 
IMG OID640106854 
Producthypothetical protein 
Protein accessionYP_001037855 
Protein GI125973945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTTT TTCTCTTGAT TGCATCATTC TTAACGGGAT TTTGCCTGCT TAAAAAATTA 
ACTCAAATAA AATCCCCGAT AATGATTATT TCTGGTTCTT TTCTAATAGG GTCCGTTTTT
TCCGGCACAC TGCTATACTG GCTTGATATA CTTTTCGTTA AAACCCTGAG CAACTACTAT
CTCAGCAATA TAGCATACCT CGTAATATCT TTTGCTTTTA TAGCATACGT ATACCGAACA
AACAGCAAAA TATTCAAAGA GCTTCTTGAC ACGATAAAAG AGTTCTGCCG TGACAGAGTT
GCAATAATTT GCTTTATCGC TTTTGTTTTA TTTTCAACGT GGTTCAATTA TGATACCTTC
AGTTTGTCAA ACGGAAGCAT CACCATGTCC GGGGGAGCAT GGAGCGATTT AACTCTTCAC
AACGGATTTG TACGTTCCAT AAGCATTGGC CAAAACATTC CGGTTGAGCA TATCTTTTAC
GCCAATACTC CGGCAAAATA TCACTTTCTG TTTGACTACT ATGTGGGTAA AATAACCCAA
ACCGGCCTGC ATTCGGTTCA CGCCTTGAAT CTTATGTGTA TTTTGAGTCT TTCTTTCCTG
CTGTTAATGA TATTTCAGTT TGGAAGAACA GTATTTAAAA ATGATGCCGT GGGAATTCTC
GGCGCCCTCT TTTTATTGTT CCACAGCTCA ATAAGCGGTT TAAAGTGGGT TGCCGAAAAC
TGGGGCTGGG ATATATTCAA GAAAATGTAT GAAAAAACCG GCTGGCTGGC TTCAACCATG
TTTGAAGGCT GGGGGCTTTT TAACCTGAAT GTTTTCGTAA ATCAGCGGCA CTTTGCCTTC
TCCTTGGCAT TTCTTGTTTT CATCGTTACA TATGTGATTT CTATGTATGA AAATGAAAAA
GAGACAAAAA ATCCGGAACC GGGAAGCGGT GAAATAATCC CATTGAACCT GCGTAACGAT
TATTCATGGC TTTTATCAAG CTGTCTTATA GGAATTGCTG TAGGTATCAT GCCTTACTGG
AACTCTGTTG TAAACACGGC TCTGCTTTCA TTTTTGGGAC TGTATACAAT TATAAATATA
AGAAAAAAAG ATGTCTTTAT TCCAATGTTT ATCTCAACCG CCATAGCCGG ACTTGTTTCA
CTGCCGCAGC TTCTGAGATT TAAATCAGGT GCTTCGAGCC TGACAGAATA TCCAAAGTTT
CATATCGGCT ATGAAGTGGG ACGCTTTGAC ATATTGGATT TAACAGAATT TTACTTCAAA
GTCCTGGGAT TAAAATTAAT TATAATTGTT ATAGCATTTT TGATTGTACC CAACAGAAAA
AAAATACTGT TCCTTATCCT TTCAGTGCCG TTTGTACTGG CCAATTTACT GCAGCTTGGC
GTGGTGCTGT ATGACAATAA CAAACTCATG ATTTCATCAC TCATATTTAT AAACTGCCTG
GCAGCTTACT ACTTAGTAGA ACTTTTCCGC CAAAAACATG TAATACTTAA ATTCATATCC
GTTATCCTTT GTCTCTGCCT TATGATTGCC GGAGTGCTTG ACTTAATGTC AGTCAAGAAT
CTTCCCAAAG TCAATGTGGC AGACAAATCA GACTTCACAC AGTGGATTAT TGAAAACACC
GAACCGGGCT CAACTTTCCT TACTCTGCCG ACCATACAAT ATAATGACAA CGCAGTATCC
AACATACTGA TGGCAGGCGG TAAAATGTAT GTACATAATG CAGCCGACTC GGCATACAAG
CTCGCCGAAC GCTTCAACAT TCTAAACACC ATATTAAGGG GCGAAGAAAG TTTTGAAAAA
ATAAAAAGCA TTATTGAGCA GGAAGGTATT GACTATATCG TGGTCAGTCC GGAACTCAGA
CAAAGCCAGG AATACCCGGT AAACGAGGAA TTTCTCAAAC AAAACTTTGT AACAAAATAT
GATTTTAACG GCATTACCGT TTACTCAATT TATTAG
 
Protein sequence
MFVFLLIASF LTGFCLLKKL TQIKSPIMII SGSFLIGSVF SGTLLYWLDI LFVKTLSNYY 
LSNIAYLVIS FAFIAYVYRT NSKIFKELLD TIKEFCRDRV AIICFIAFVL FSTWFNYDTF
SLSNGSITMS GGAWSDLTLH NGFVRSISIG QNIPVEHIFY ANTPAKYHFL FDYYVGKITQ
TGLHSVHALN LMCILSLSFL LLMIFQFGRT VFKNDAVGIL GALFLLFHSS ISGLKWVAEN
WGWDIFKKMY EKTGWLASTM FEGWGLFNLN VFVNQRHFAF SLAFLVFIVT YVISMYENEK
ETKNPEPGSG EIIPLNLRND YSWLLSSCLI GIAVGIMPYW NSVVNTALLS FLGLYTIINI
RKKDVFIPMF ISTAIAGLVS LPQLLRFKSG ASSLTEYPKF HIGYEVGRFD ILDLTEFYFK
VLGLKLIIIV IAFLIVPNRK KILFLILSVP FVLANLLQLG VVLYDNNKLM ISSLIFINCL
AAYYLVELFR QKHVILKFIS VILCLCLMIA GVLDLMSVKN LPKVNVADKS DFTQWIIENT
EPGSTFLTLP TIQYNDNAVS NILMAGGKMY VHNAADSAYK LAERFNILNT ILRGEESFEK
IKSIIEQEGI DYIVVSPELR QSQEYPVNEE FLKQNFVTKY DFNGITVYSI Y