Gene Cthe_1182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1182 
Symbol 
ID4810134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1410808 
End bp1412076 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content40% 
IMG OID640106604 
Producthypothetical protein 
Protein accessionYP_001037607 
Protein GI125973697 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.114028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATA TAGTATATCT TTTTATGCTG GTATCCATGT TTTTATATAC ATACATATTT 
GGCGATGAAA CAAGTATGTT GATGCTCTAC ATGCTGATTC TTTCCCCTGT TTTGTCTTTG
CTTCTGTCTT ATGCATCGCT TAAAAGTCTT GAATTTTCAA TTGATGAGAA GGTTCATGCC
TCACAGGTTG AAAAAGACGG TGTTGTGGGA GTAACGGTAT TACTTCAAAA CAAATCTTTT
GTGCCGATAC CGATTATTGA TATATCGTTT GCTGTTCCGC AAAACTTGAT TCCTCTGGAC
AATCCCAGGC CTATTGTGTC TTTGGGACCG TATAAAACTC AGATAATCCA TTTGCAGTAC
AAAGCAAAGT ACCGTGGAGT GGCGGAAATT GGAGTCAGGG ATATTAAAAT AAGAGACTTT
CTGGGGTTTT TTAACTTTTC TTTGCTAAAG AAACAGAATA AAGTGGAGAG TACCAGAGAA
ATAACGGTGT TAAACAAGAT TTCCAGGCTT AAGATGAACA GTGTTTTACT GCTTGAATCA
ATTCTGGCTG CCAATGAAGA AACAGGCGCC GCTACGAACG ATTTTAATTT TTTAAGCTGC
TTGAATGGAG AGCCGGGGTA TGAATTTCGT GAATATCAGC CCGGAGATCC CCTTCACAAG
ATTCACTGGA AACTTTCGGC AAAAACAGAC GTGTTTATGG TGAGAAAAGA TGAAGGACGG
GGTATTCCTA GAAAAAAGCT GGTACTTGAT CCTGTTGCCG TAAAGGGTCC AAAATCAAAA
GCCGGAAGTG TCGTTGAAAT AGAGGATAAA ATTTTAGATG CCCTTATATC AGTTGTTGAC
ATGTTGGTTA GAGCGGGAAG AGATGTGGAA GTGTGGCTTT TGGAACATGG AGAATGGATG
AGCCATTTAG TCAAGGACAG GGATGAGATT GTAGAAATGC AGCACAGACT TGCATCATAC
AAATTTCTGC ATTCAAGAGA CGAACTTGAA AATGAACGTC TTCCTGTGTC CACCATTACA
TTGCAGGACA GTAGCGGCAG GATTTTTGCC GGAGGAGATG CCATGATTTT TACGGCTTCC
CTTGACAAGG AACTTTCTGA AATAATAGAG GGAATGCAGG AGTTGAAAAT GACGGTGGAT
TTGGTTGCAA TTAAGAATGA AAGAGATGTT GAAAAGAGCG AAGGTTTCGA AAAGAGAGAG
CACAAAAGCA CCAAAATGAA CCTGTGGACG ATAGGACTGA CGGACGATAT TTCTGAAGTT
TTGGCATAG
 
Protein sequence
MSYIVYLFML VSMFLYTYIF GDETSMLMLY MLILSPVLSL LLSYASLKSL EFSIDEKVHA 
SQVEKDGVVG VTVLLQNKSF VPIPIIDISF AVPQNLIPLD NPRPIVSLGP YKTQIIHLQY
KAKYRGVAEI GVRDIKIRDF LGFFNFSLLK KQNKVESTRE ITVLNKISRL KMNSVLLLES
ILAANEETGA ATNDFNFLSC LNGEPGYEFR EYQPGDPLHK IHWKLSAKTD VFMVRKDEGR
GIPRKKLVLD PVAVKGPKSK AGSVVEIEDK ILDALISVVD MLVRAGRDVE VWLLEHGEWM
SHLVKDRDEI VEMQHRLASY KFLHSRDELE NERLPVSTIT LQDSSGRIFA GGDAMIFTAS
LDKELSEIIE GMQELKMTVD LVAIKNERDV EKSEGFEKRE HKSTKMNLWT IGLTDDISEV
LA