Gene Cthe_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2018 
Symbol 
ID4810988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2396459 
End bp2397694 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content36% 
IMG OID640107428 
Producthypothetical protein 
Protein accessionYP_001038423 
Protein GI125974513 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0359073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATC CGTTGTGTGA TGAAAGTTAT TTGTTGAAAA CAATAGAGTC TAACCGAAAG 
TTCATTTCTA AAAGGAAAGA AAAGATTATT GGATTAAAAG CTGATATAGA GAATGGTATA
CAAAGATATC CAAGAGATAA CCAAAGTATA ATTTATATTA CGTTTTCTCA AATGTTTATG
TATGGCATGA ATATGCTTTT AGCAAAATAT TCCTTGGGCA ATCACCCTGA TACAATGATA
GATGACTATT TAGACAACAT AACATATTTA GAGAATTGCG GTGAAGAAGA GGCCGGCTAC
ATTAACCTTT TATGGATGGT TGGACTGGGT ATCCTTTTGG AAATGGATAA AGAAGTGTTA
AAAAGACTGG CAAGAGTTAT AGAAAGGCAA AGAATAGAAG ACGCACTTAT GGATTTTTTA
TTGAAAGCTT GTGATATAGG TTGGAACCAC AGTACAACGA AATATGAAAA AAAGAACCCG
TATGAAAAGA CAGTAGAGAT TATAAAAATA GCATTACACG ACAAAGACAA GGAAGCGGCA
TCTAAAAGGC TTGAAAAATA TATGGAAAAA GAATGGTTCA AGGGACATTA CGACTTTGGG
TGGAGGAATG CCCATAAGGA ACCTGGCTAT TATGGTTTTT GGAGTTTTGA TACAGCGGCA
CTGGCCAAGA TACTGGGGCT GGACGACAGT GCGTTAAAAG ACAACAACCA TTATCCTTAT
GATTTGGCAC ACTATAAGAA TGGAATGACC TTTGATTTAA GTTGGTATAG TGAACCAAGG
GAAGAGGAAG TCCGGGAAGA AACGGTGGTA TATGGTATAC CGGGTAATCC TTTGTTGGAG
AGGATAATAC CTGGGAGATT CCACAGTTTT GTAAATGAGA TAATAAATGA TTATAAAACA
CTGCCGGACG AAGAGTTTTG GAAGAAGTAT AATTTGAAAG AAATATGGTT TGATGTAGAG
GAGTATAAGG AGGATAATAA AGATAAGAAT TTGTTGGGTA CGATTATAGT GTTCATGCTT
GTGGACAAAG ATTATATTTT GCAGTTGGAT TATAAAGAAG AGTTAATAGA CTATATAGAG
AATATACATA ATTACTGGCC CAAGGAAGAA GTTAAGCTTA TAAGCTTTGA ATTAGACAAT
GACCAACAGT ACTATGCGTA TGTACCGAAG GATGCGGAGG CTGGTTCGTT GTATGAGGTA
AAAGTGACAG AAGTGGAGAA AATAGAGGAG GTTTAG
 
Protein sequence
MRDPLCDESY LLKTIESNRK FISKRKEKII GLKADIENGI QRYPRDNQSI IYITFSQMFM 
YGMNMLLAKY SLGNHPDTMI DDYLDNITYL ENCGEEEAGY INLLWMVGLG ILLEMDKEVL
KRLARVIERQ RIEDALMDFL LKACDIGWNH STTKYEKKNP YEKTVEIIKI ALHDKDKEAA
SKRLEKYMEK EWFKGHYDFG WRNAHKEPGY YGFWSFDTAA LAKILGLDDS ALKDNNHYPY
DLAHYKNGMT FDLSWYSEPR EEEVREETVV YGIPGNPLLE RIIPGRFHSF VNEIINDYKT
LPDEEFWKKY NLKEIWFDVE EYKEDNKDKN LLGTIIVFML VDKDYILQLD YKEELIDYIE
NIHNYWPKEE VKLISFELDN DQQYYAYVPK DAEAGSLYEV KVTEVEKIEE V