Gene Cthe_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2347 
Symbol 
ID4808981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2798883 
End bp2800163 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content36% 
IMG OID640107754 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_001038742 
Protein GI125974832 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000937745 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAGA TAAGCTTGAG AGAATTAATT GAAATAATCA TTAAAGGTAA ATGGATTATA 
GTAGCTGTAA CTGCAGTTTG CATAGCTATT GGCATTGCAG TAAACGCATT TGTTATAAAG
CCGGTTTATG TGGCACAAAC GACGCTGATG ATTTCATCCA TTAAAAAGAG TCAAATAAAA
GAACAGGTAA AGTCAGGAGA TGGTGACGTT AACAATTTTT CATCCTTAAT CGATGACATA
TTGCAGTACC CGGAGATGTC TGTTGATAAC TATAGGGAAC AAGTAAAAAA TCCTGTGATA
CTGGAGTATA TTCGTGAAGA AATGGATATG AAAGATGTTC CCCTAAGTTC AATAGCATCT
AAAATAACCT TAAATGAAAT AAAAGATACA GATTTGATTA CAATAAAGGT GACTGATGAG
AATCCTGAGA CAGCAGCAAA GATAGCAAAT CTGGTTGGCG ACAGGTTTGC AAAACTTGTG
TGCGAAACAA ACCAAAAACG GACTGAGAGT ACACTGGAGT TTATTGAAAA TCAGATGAAA
AAAGAAAAAG AGAACATGGA AAAATTGTTG GGAGAATATA AAAGTTATCT GTTGCAATCC
AGAGGACCTG AAGAGGTTAA GATGGAGCTG GATGCAAAAC TGGAAAAAAT GACTGAATAC
AAAACCCAAT TATCGCAAAT CAAAATTGAT GAGAATGCCA CGAAAGCGTC TTTAGATACC
GCAAAAAATT TGATAAACAA AACACCGCAA AAATTAGTTA CAGATAGTTC GCTTTTAACT
AATCCGTTGC TTTCGGCAGT GATTAAAGAA AAAACAGGTA TTAGTTCGGA AGAACTGGCA
AGCATGAAAA TGTCAACAGA ACAAATAAAT ATTATATACG TTGAATTGTC CAACATAATT
AATGAATTGG AAATTCGGCT GTCAAACCTG GAAGCTCAGA GAATAAATAT CGAAAAGGTC
ATTCAGGAAT GTCAGAAGGA AATTGAAAAT CTCCAGACAG AATATGCGGA AAAACAACAG
GAGTACGAGA TTCTGAAAAA GGAGCTCGAC TTGTCGAAAG AAGTATATAA TGCATATCAG
CAAAAATATA AAGAGTCAAT GATTATGCAG TCTGCAGAGA CAGGCAGATC AAGTGCGGTA
ATAGTATCTG AGGCCATTCC GCCCGCTAAT CCTGTTGCTC CAAAAAAGGC TTTGAATGTG
GCTGTTGCAG GAGTAGTGGG AGTCGGAATC AGTTTTGCTA TAATATTTAT AAAGGAATAT
TTAATTAGAA GCAAACAGTG A
 
Protein sequence
MEEISLRELI EIIIKGKWII VAVTAVCIAI GIAVNAFVIK PVYVAQTTLM ISSIKKSQIK 
EQVKSGDGDV NNFSSLIDDI LQYPEMSVDN YREQVKNPVI LEYIREEMDM KDVPLSSIAS
KITLNEIKDT DLITIKVTDE NPETAAKIAN LVGDRFAKLV CETNQKRTES TLEFIENQMK
KEKENMEKLL GEYKSYLLQS RGPEEVKMEL DAKLEKMTEY KTQLSQIKID ENATKASLDT
AKNLINKTPQ KLVTDSSLLT NPLLSAVIKE KTGISSEELA SMKMSTEQIN IIYVELSNII
NELEIRLSNL EAQRINIEKV IQECQKEIEN LQTEYAEKQQ EYEILKKELD LSKEVYNAYQ
QKYKESMIMQ SAETGRSSAV IVSEAIPPAN PVAPKKALNV AVAGVVGVGI SFAIIFIKEY
LIRSKQ