Gene Cthe_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0254 
Symbol 
ID4808602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp311724 
End bp313064 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content42% 
IMG OID640105666 
Producthypothetical protein 
Protein accessionYP_001036686 
Protein GI125972776 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAGG GTGTCTCAAA AACGGGTATT GTGTTTTTGT TACTGATTTG TGTGGGTTTT 
CTTCTTGCCA ACAACATACT GATACTTGTT TCCATTATTC CATTTACGTT AATGGCTTTT
GGGTATTATT TAAAAATGCC CGACGGTATC AGGGTTGACA AGACTGTGTC CAAAAACAGG
GTTACGGTCG GAGAACTGCT GGAGGTATCT GTAAGAGTAC TGGTAGAGTC GGGATTTGGT
TCAATGGAAA TATGCGATAT TGTGCCCCCG CATTTTGAAC TGGTGGAAGG AACTAATTAC
TGTGCAGTGT GGAAAGGGTT TGAGCCGAAA GAAATACTCT TAAATTATAC TGTCCGCTGT
ACAGCATCGG GAACTTATAC ATTCAGGACC ACTGGCTGGA GAGCCAGACA TGCTGTGGGA
GCTTTTTCGA TAAACAGAAA ATATGAGACG GATTTGACGG TAGAAGTGAC CCCAAGGCTT
ATTGAACTCA AGAAGGTAAG GGGCATGTCC ACGGTGTGCA AAGTTCCGAT GCCGGAGGGA
GCTTTGGCAA GCATGGGAAT GACAACCCAG GAATTTAAGG AACTCAGGCT CTATTCTCCC
GGTGACCCGT TTAAGGCGAT AAACTGGAAG GTTACGTCGA GAAATTTGGT CAGGGGCAGT
ATCTGGCCTG TGGTAAATGA GTTTGAAAAA GAAGGAAAGA AGTCTGTGTG GATATTTTTG
GATACGTCAA AAATAATGTC CTTCGGTTCC AACATAAAAA ATGTCAAAGA GTATTCTGTT
GAGGCTGTAA ACAGTCTTAG CGACTATTAT ATAAAACACA ACTGCAGTGT GGCTTTTCAT
ACCTTTGGAG GAAGCGATGT CTTTATAAAT CCCGGGTCGG GAAGGCAGCA GCATTACAGG
ATTTTAAGGG AGCTTATGAA AATAAGGAAT TTCACCGGAG TATCCCGGGA AAATTCCGGG
GAGCGTCAAA AAAACTCCAA GGAGCACAAA AAACTGGAAG AGGCAGTGTA TTCGTGCAGA
AATTATTTTA ACGGACTGAG ACCAATGTTT ATAATTATTA CAAGATTTTG CACAAAAAAT
TCCGAAGAAA TTTTCAAAGG TATAAACCTT ATGTCAAAAT ACACTTCGCT TCGAAAGGGC
TATGTTCCCA GCATAATGTT GATAAATATA ATGGGGTATG GTCTTATGGC TGAAAATGAG
AATGAAATGA TGGCGGCAAA TCTTCTTGAG GCCATGAACA AAGTGCTTTC GGAAAAGATA
AGAAAAAATT GTATCTGGAT TGACTGGGAC CCTAACAAGG AAAGCCTTAC AGGTGCATTA
TTAAAACAGG TGGTGGGTTA A
 
Protein sequence
MPKGVSKTGI VFLLLICVGF LLANNILILV SIIPFTLMAF GYYLKMPDGI RVDKTVSKNR 
VTVGELLEVS VRVLVESGFG SMEICDIVPP HFELVEGTNY CAVWKGFEPK EILLNYTVRC
TASGTYTFRT TGWRARHAVG AFSINRKYET DLTVEVTPRL IELKKVRGMS TVCKVPMPEG
ALASMGMTTQ EFKELRLYSP GDPFKAINWK VTSRNLVRGS IWPVVNEFEK EGKKSVWIFL
DTSKIMSFGS NIKNVKEYSV EAVNSLSDYY IKHNCSVAFH TFGGSDVFIN PGSGRQQHYR
ILRELMKIRN FTGVSRENSG ERQKNSKEHK KLEEAVYSCR NYFNGLRPMF IIITRFCTKN
SEEIFKGINL MSKYTSLRKG YVPSIMLINI MGYGLMAENE NEMMAANLLE AMNKVLSEKI
RKNCIWIDWD PNKESLTGAL LKQVVG