Gene Cthe_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1164 
Symbol 
ID4810832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1387940 
End bp1389526 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content44% 
IMG OID640106586 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001037589 
Protein GI125973679 
COG category[R] General function prediction only 
COG ID[COG2509] Uncharacterized FAD-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTAA TAGTTCGAAA TTTAAAACTT TCTTTGGATG AAGATATAGA TGCGTTAAAA 
AAACTTGTTT GTAAAAAAAT TAAAGTTAGT GAAAAGGACT TTAAGAATTT CAGGATAGTA
AAAGAGTCCA TTGATGCGAG GAAAAAACCT TTTATCAATC TTGTCTACTC TGTGATGGTT
GAAATTGAAG GCAAAATAAA GGTAAGGGAA AGTACGGACA TCAGCATTTT AGAGCAAGAG
ACGGAAAAGG TTTTGGTTCC TGGCAGCATA AAGCTTAAAA ACAGGCCTGT GGTTATCGGC
TCAGGTCCTG CAGGTCTGTT TGCAGGGCTT GTTCTGGCCC AAAACGGCTA TAGGCCGTTG
ATTCTCGAAC GGGGAGAATG TGTTGAAAAA CGCACGCAAA TTGTCAACAG GTATTGGACG
ACCGGCGAGC TTGATCCAGA AACCAATGTA CAGTTTGGAG AAGGCGGTGC CGGGACTTTT
TCTGACGGAA AACTTACCAC CAGGATAAAT GACAGGCGCT GCAGTATTGT TTTAGAGGAA
TTTTACAAAT CCGGGGCGCA TGAAGAGATT TTATACAAGG CAAAGCCTCA TATAGGTTCT
GATGTGTTAA AAAAAGTAGT ATCAAACATG CGCAACAGGA TAATTGAATA CGGGGGAGAA
GTAAGGTTTA ATTCAAAAGT TACTTCAATA ATTGTTAAAA ACGGAAGTAT AACCTCAATT
GTGGTAAACG ACAAGGAAGA AATACCCTGT GAGGTTGCAG TCCTTGCAAT AGGCCATAGT
GCAAGGGATA CCTTCAAAAT GCTTTTTGAC AAAGGGGTTG AATTCATACA AAAGCCTTTT
TCAATAGGAG TCAGGATTGA ACATCCCCAG GAGCTGATTG ACAGGGCCCA GTACGGTGAA
GCGGCAGGTC ATCCCAGACT TGGAGCGGCG GATTACCAGC TGTTTCAAAA ACTTGGCGAC
AGAACCGTGT ATTCATTCTG CATGTGTCCG GGAGGCGTTG TTGTGGCATC GGCTTCGGAA
CCGGGCATGA TTGTGACAAA CGGAATGAGT GAATTTGCAA GGGACAAGGA AAATGCCAAC
AGTGCCCTTG TGGTATCAGT GGAACCGGGG GATTTTGGAA GCAGCCATCC CCTGGCCGGT
GTTGATTTTC AGAGAAAGTG GGAAAGGCTG GCTTTTGTTG CGGGCGGTTC CTGCAACCGG
GCACCGGTGC AGAGGCTTGG GGATTTTATC GAAGGAAGAA AATCCACTTT TTTGGGAACA
GTCAAGCCCA GTTATACCGG AGGAACGAAT CTTGCCGATA TTCATTCCTG TCTTCCGACA
TTTGTGACGG ATTCCATAAA AAAAGCCATA CCCTATTTTG ATTCCAAAAT AAAAGGTTTT
GGCATGAAGG ATGCCGTTAT TACAGGAGTG GAGACCAGGA CCTCGTCGCC CGTAAGAATT
CCAAGAGGGG ACACACTTGA AGCAATTGGC ATAAAGGGTT TGTATCCTGC CGGTGAAGGA
GCCGGGTATG CGGGCGGAAT TGTGAGTGCA GCGGTTGACG GCATTAGAAT AGCGGAAAAG
ATAATAAGTA CCTATTCATA CGAGTAA
 
Protein sequence
MKLIVRNLKL SLDEDIDALK KLVCKKIKVS EKDFKNFRIV KESIDARKKP FINLVYSVMV 
EIEGKIKVRE STDISILEQE TEKVLVPGSI KLKNRPVVIG SGPAGLFAGL VLAQNGYRPL
ILERGECVEK RTQIVNRYWT TGELDPETNV QFGEGGAGTF SDGKLTTRIN DRRCSIVLEE
FYKSGAHEEI LYKAKPHIGS DVLKKVVSNM RNRIIEYGGE VRFNSKVTSI IVKNGSITSI
VVNDKEEIPC EVAVLAIGHS ARDTFKMLFD KGVEFIQKPF SIGVRIEHPQ ELIDRAQYGE
AAGHPRLGAA DYQLFQKLGD RTVYSFCMCP GGVVVASASE PGMIVTNGMS EFARDKENAN
SALVVSVEPG DFGSSHPLAG VDFQRKWERL AFVAGGSCNR APVQRLGDFI EGRKSTFLGT
VKPSYTGGTN LADIHSCLPT FVTDSIKKAI PYFDSKIKGF GMKDAVITGV ETRTSSPVRI
PRGDTLEAIG IKGLYPAGEG AGYAGGIVSA AVDGIRIAEK IISTYSYE