Gene Cthe_1695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1695 
Symbol 
ID4808870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2018315 
End bp2019790 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content31% 
IMG OID640107109 
Productradical SAM family protein 
Protein accessionYP_001038110 
Protein GI125974200 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.192909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAG TTCCAATAAT TTTAAGAGAA AATCTTGTAT CACAAATATA TAATTATAAT 
TCTAATATTA GATTAGATAT GATTGATGAT AATAAGTATA TAATTCTAAA AGAAAATTAC
CATTTACATA ATTGGCCAGA TAAAACTTTA TTAATATTTA ATGGCATCGA GGTTTTATCG
TTAAATATTG AAGGGGCAGA AGTAATAAAC AAGCTCAATG GAGAGCATAC ATTACAGGAA
GTTGTGTCTA ATTCATTTAA AGAGTTAGAT CAGTCGGATA TGAGCAAGTA TTTTTCAATT
AAAATGTTTA TCTTGCAGCT GTACAGGAGA AACATAATAG ATTTTTTAGA ATATAAGCAG
AATAGAGTAA TCAAAAATAC CGGAAGCAGA GACTGGTATG TACCTTATTC CTGTTCTATA
GAAATAACTA AGCAATGTGA TCTTAGATGT AAACATTGTT ATGGTGAAGC AGGAGCTATG
AGAAACACTC AGCTAACAGA AGAACAAATT TATTCTATTC TAGATAAATT AAGTGATGGG
TGTAATAGTG TAAGCATAAC CGGCGGCGAC CCAATGTGTC ATCCTAAAAT AAAAGAAATT
ATAAAGTATA GCATAGCACG TGGATTTGAA ACAACATTAA TCACGAATGG TATGAGATTG
AATCAAAATT GGGCAAATTG GCTATCAGAA GTTGGAATAA AGCGTGTTAA GCTTAGCTTG
GATGGCCCTA CCAAAGAAAT ACATGATGAA TTAAGGGGAG TTAAGGGAGC TTTTGAAAAA
GTAATACTTG CTATGGGATA TCTAAAAAAT GCAAAAGTAA GCTTTTCAAT TGGTACTGTG
ATAACAAAAA AGAATTTAGA ATATATAAAT ATTATTGCTG ATATTGCATA TGAAAATGGA
GCGAAGTCTA TAGGTTTTGG CAGAGTTGTA AATCATGGAA GAGCAATTAG GGAAATGAAA
GATGCAAGAG AAAGTGATTT AGATTCAATA ATAAAGAAAG TAGATAAAAT AATGAGAGAT
TATAAAGGAA AAGATTTTTT GGTTACATAT GAGGAGGACG GAAATTGGAC AAGTAGTTTT
CTGGATAAAT GCCCAAGTCT TGAAGAATAT TATTTATATA GAAATAACAA TGTCAAATGT
AGTTGTAATG GGTGCGGAGC GGGATCGAGA TTGTTGTTTA TTGAGGCAAA CGGAAACATA
AAACCATGCA TGATGAGTAC ATTTACAATA GGGAAAATAA ATCATGGTGA GGATATGGTA
AATATAATTA ACAAGTCAAC CAATGAATGT TTTTCAAATT TAGAGAGTCC GGACCTAAAT
ACTTGTAAGA ATTGTGATTA TGTCAGCAAT TGTTTAGGCT GTATTTCTCA AGCTATAACA
AATTCTAGTT TAGTTGAATG TAGGTGGAAA AAAGAAATAC TTAAACATGA GTTGAAAATG
AAAGAAATAT TAAGTGGTGA GGTAGGTTAT GTATAA
 
Protein sequence
MMKVPIILRE NLVSQIYNYN SNIRLDMIDD NKYIILKENY HLHNWPDKTL LIFNGIEVLS 
LNIEGAEVIN KLNGEHTLQE VVSNSFKELD QSDMSKYFSI KMFILQLYRR NIIDFLEYKQ
NRVIKNTGSR DWYVPYSCSI EITKQCDLRC KHCYGEAGAM RNTQLTEEQI YSILDKLSDG
CNSVSITGGD PMCHPKIKEI IKYSIARGFE TTLITNGMRL NQNWANWLSE VGIKRVKLSL
DGPTKEIHDE LRGVKGAFEK VILAMGYLKN AKVSFSIGTV ITKKNLEYIN IIADIAYENG
AKSIGFGRVV NHGRAIREMK DARESDLDSI IKKVDKIMRD YKGKDFLVTY EEDGNWTSSF
LDKCPSLEEY YLYRNNNVKC SCNGCGAGSR LLFIEANGNI KPCMMSTFTI GKINHGEDMV
NIINKSTNEC FSNLESPDLN TCKNCDYVSN CLGCISQAIT NSSLVECRWK KEILKHELKM
KEILSGEVGY V