Gene Cthe_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0114 
Symbol 
ID4808740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp145686 
End bp147005 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content41% 
IMG OID640105525 
Producthypothetical protein 
Protein accessionYP_001036548 
Protein GI125972638 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATG ATAATTGGTT TAGAAAAAAA ATTGCAGGAT ACCGATGGTG TCTTTTGATC 
GTTTTCGGGA TTATACTGAT TAGTGCGGGC ATGCTTTTGG CATACAGACA TGACAATACT
TTTGACATCT TTTGTTCTGT ACTTCTTTTT GCGTCAGGAT GTATATTTAT TGTTGTTTCG
ATACGGCTTA TTGCAATTGA CACGGTGTCA AAATACGCCC AAAACGGTAT AAATGTAGTA
TGCTGCAAAG ACAGGGACGA TAATTTGTAT GAAAAAGAAT TTCTTGACAA AGGGCCGAAA
ATAGTTGCAA TAGGTGGAGG AACCGGACTT TCAACCATGC TGAGAGGGCT TAAGGAATGC
AGCTCGAATA TAACGGCCGT GGTTACGGTT GCCGATGATG GGGGAGGCTC CGGGATTCTA
AGACAGGACC TTGGGATACT TCCTCCCGGG GATATCAGAA ACTGTATTTT GGCCCTTGCC
AATACCGAGC CTATTATGGA AAAACTGCTT CAGTACAGAT TCCAGGACGG AATGCTGAAA
GGACAGAGTT TTGGAAATCT GTTTCTTGCA GCAATGGATG GTATTTCTTC GAGTTTTGAA
CAGGCTGTCC AAAGAATGAG TGATGTACTT GCAGTAAAAG GGAGAGTTCT TCCGGTTACG
CTTGAGGACA TTCAGCTGTG TGCGGAGCTG GAAGACGGAT ATGTTATCAC CGGAGAGTCA
CAAATTGGCA ATCATAACAG CTTTCACCGT TGCGCAATCA AGAGGGTGTA TTTGGAACCC
GGAAAGGTAA AACCCCTGGA TGAGGTGATA GAAGCAATCG GAGAAGCGGA TGTAATTGTG
TTGGGGCCGG GAAGTCTCTT TACAAGTATA ATTCCCAACC TTTTGGTTGA CGGTGTGTGT
GATGCGATAA AAAAATCAAA GGCTTTGAAA ATATATGTGT GCAATGTCAT GACCCAGCCG
GGGGAAACCG ACGGATATAG CGTTTCGGAT CATATAAAGG CACTTGAAAG GCACTCTTTT
GAGGGAATTG TTGATTACTG CATTTTTAAC ACTGCTGATA TACCTGAACT ATTGAAAAAG
AAGTATAGTG AGGACGGAGC ACAAATTGTC AGAGTTGACT ATGATGAGTT GGATAAATTA
GGCATAAAAT TGCTGGGAGG GGACTTTGTC TGCATAACTA ACGGATATAT AAGACATGAT
ACAAAGAAAT TGGCTCAGGC CATCATGAAC CTTGTTATTG AGAATGTATT TGGAAAAGAC
GACAGAAAAT CATCCGGTTA TGTAAATACA ATGAAGCAAT TTAAAAATAT AGTCGGATAA
 
Protein sequence
MKNDNWFRKK IAGYRWCLLI VFGIILISAG MLLAYRHDNT FDIFCSVLLF ASGCIFIVVS 
IRLIAIDTVS KYAQNGINVV CCKDRDDNLY EKEFLDKGPK IVAIGGGTGL STMLRGLKEC
SSNITAVVTV ADDGGGSGIL RQDLGILPPG DIRNCILALA NTEPIMEKLL QYRFQDGMLK
GQSFGNLFLA AMDGISSSFE QAVQRMSDVL AVKGRVLPVT LEDIQLCAEL EDGYVITGES
QIGNHNSFHR CAIKRVYLEP GKVKPLDEVI EAIGEADVIV LGPGSLFTSI IPNLLVDGVC
DAIKKSKALK IYVCNVMTQP GETDGYSVSD HIKALERHSF EGIVDYCIFN TADIPELLKK
KYSEDGAQIV RVDYDELDKL GIKLLGGDFV CITNGYIRHD TKKLAQAIMN LVIENVFGKD
DRKSSGYVNT MKQFKNIVG