Gene Cthe_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3122 
Symbol 
ID4809685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3684732 
End bp3686810 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content37% 
IMG OID640108555 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001039510 
Protein GI125975600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTT TTAGGGGGAG GCTAATATCT TTAGCGCTTG TACTTGCAAT TATAATGCAA 
TTATTTGCAA ACTTTGCTTT TGCAGAGCCG ACCGAAAAGA ACACTATATT TTTCAGAGAC
ATATCAGGAC ACTGGGCGGA AGAATCAATC AAGTACCTGG CAGAACGTGG CTTGGTAAAA
GGCTATTTGA CTGACAACGG ATACATAATT AAACCTGATA CGTACATAAC AAGAGCTGAA
TATCTTACCG TTCTTTTAAA TACAAAACCG AATCTCAAGG TTGTAAGTGA TAAAGTAAAA
TTATTTGTTG ACGTAAAAGA TAATGATTGG TACAAAGAAG TGGTTGATAA AGCATCAAGT
AATGGAATTC TCGAAGGATA TCCCGATGGA AGTTTCAAGC CTAATAATCC GATTACCAGG
GCAGAAATTT CTGCGTTAAT GGTAAAAATC AACAATTGGA ACGAAAAGGA TATTACTGAG
GATGCTGATG TGTTTTCTGA TGTTCAAAAG AATTCATGGT ACTATGCAGC TGTTTTAACG
TGTAAGTTAA AGAAGATAAT AAATGGATAT CCGGATGGCA CTTTCAAACC GGGAAACTAT
GCATCAAGAG CGGAAGCTTT TGCTTTGCTT GCAAATTATG TCAGGAATTT TATTGACAAG
GATTTGGATG AAGAGCCTGA AAAAACTCCC GGTACAACTC ATGACGTGGT TTCACCGACT
CCAAACCCTG GCACACCGTC TTTACCATCT TCCGGTTCCG GTTCAACAAT TGAAAAAGGG
CCGTTTAAAG GTAAAACAGT TGATGACTTT TTATACGTAT CCGGCGGAGA CTTTAAAATT
TTGAATGAAG AACAGAAAAA GATCACCTTT GAAGGTTTGG TTGACAAGGA AAATGTAAAA
AAGATGTACT ACAATGTGGA ATACTATGGA TTCGACACGG ATACGCCCAA AATGGAAAAA
AACAATATTA CAGACGGATT AATTATAGCA GGGCCTGATG AAGAAAACAA AGGAACATTT
AATGAAAATC AAATCCCGTA CAGTTGGGTA CTTAAAGATT TTGAAGTTAA TAAGAGCTAT
TTTAAGATTT ACATTAAACT GGTGATAGAA GACGAATACG GAAATAAACT TACTAAAATT
TTGGCAATTA TAAATGATAA GGATTCTGAC GGGGATGGAT TGTCAGACTA TGAAGAGGTG
TACATATATA ATACAAATCC GTTGAATTAC GATACAGATG GGGATAAACT TTCAGACTAT
GAAGAAATTA ATATTTATGG AACAGATCCG TTAAATTCAG ATACAGATGA AGATGGTTTG
ACGGATTATG AGGAAATGAA AGCATGGGTT ATTTTAGAAT TAGGTACAAT AGTATCAATT
TTTTATAATG CTCCCGAATA CGAAGATATG GGAACTGAAG GCGTGGAGCT GGCAGCTGAA
AAATTCGGCT ATAAAGAAGA GCAGATAATT TTGGGATTGG ATCCATTAAA TCCTGATACG
GACGGAGACG GACTTCCTGA TGGCTATGAA TTCAGGATAT TGGGAACTGA TCCGACACGC
AAAGTAACTT ATGGTGCGAA TGTGCCGGAT GTTGATCTTG ACATGGATAA AGACGGGCTT
TCAAACTGGG ATGAATTTTT ATATGGAACG GATCCATGGT TAAAAGATAC AGATGGGGAC
GGCATTAGTG ATTACGATGA GATTCGTACT TATAAAACAG ATCCATTAAA TCCTGATACA
GACGGAGACG GGCTTGAAGA TGGATTGGAA CTGGAGCAGG GTTTTGATCC TCTAAAGTCT
GATACCGGTA ACAGTGGTGT TTTGGATTCA GAAAAATTTG TCAGCATGAA TTTGTCCGAA
GAAACTCTAA GTGACGTTTT GACTCCTGAA AACAGAGCAA TTCCTTCAAT AAAAGTGTTC
GGCGTACCTG ATTTTGATAT TAATACTACT GTGGAAAATG CGTCTGAGCA TGAGAGTGTT
AAAAACATTG TTGGCGTGGT GGGATTTCCT ATAGACATAA AAACTGATGA GGATTTTGAA
AGTGCGCAGA TAAGTTTTAA AATAAGTGAA GAAGTTTAA
 
Protein sequence
MSFFRGRLIS LALVLAIIMQ LFANFAFAEP TEKNTIFFRD ISGHWAEESI KYLAERGLVK 
GYLTDNGYII KPDTYITRAE YLTVLLNTKP NLKVVSDKVK LFVDVKDNDW YKEVVDKASS
NGILEGYPDG SFKPNNPITR AEISALMVKI NNWNEKDITE DADVFSDVQK NSWYYAAVLT
CKLKKIINGY PDGTFKPGNY ASRAEAFALL ANYVRNFIDK DLDEEPEKTP GTTHDVVSPT
PNPGTPSLPS SGSGSTIEKG PFKGKTVDDF LYVSGGDFKI LNEEQKKITF EGLVDKENVK
KMYYNVEYYG FDTDTPKMEK NNITDGLIIA GPDEENKGTF NENQIPYSWV LKDFEVNKSY
FKIYIKLVIE DEYGNKLTKI LAIINDKDSD GDGLSDYEEV YIYNTNPLNY DTDGDKLSDY
EEINIYGTDP LNSDTDEDGL TDYEEMKAWV ILELGTIVSI FYNAPEYEDM GTEGVELAAE
KFGYKEEQII LGLDPLNPDT DGDGLPDGYE FRILGTDPTR KVTYGANVPD VDLDMDKDGL
SNWDEFLYGT DPWLKDTDGD GISDYDEIRT YKTDPLNPDT DGDGLEDGLE LEQGFDPLKS
DTGNSGVLDS EKFVSMNLSE ETLSDVLTPE NRAIPSIKVF GVPDFDINTT VENASEHESV
KNIVGVVGFP IDIKTDEDFE SAQISFKISE EV