Gene Cthe_2417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2417 
Symbol 
ID4808132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2886534 
End bp2887622 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content39% 
IMG OID640107830 
Productabortive infection protein 
Protein accessionYP_001038812 
Protein GI125974902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000014849 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGAGT TTTCAATGGA ACAAAGTGAT TTTGACATAC AAAAAGAGGA AGACCGGAAT 
AAGAATACCA AAATGCCACG CATAATTCAG GTTGGAGCTT TGTATTCTCT TGCGGTAATA
CTTATGGTGT TTGTATCCAC AAGACTGCAA ACAGCATTGG GGTTCAATCT CGGAGGGGCT
CTTTCGGAAG TTCTTCTTAT AATGCTGCCT CCGTTGCTTT TTTTGATATT GTTCAAGTTT
GATGTAAAAA AGGTGCTGAG AATAAATAAA ACAGGCTTTA TGAATTTCTT TCTGACCTTT
TGGATCATGT TTTTTTCCAT ACCTGTAGTG GGACTTTTTA ATATTTTGAA CATGCTTTTG
GTTAAGCTTT TGTTTGGTAC TGTGGAAATT ACCCAGTATC CTGTTGGAAG TGATGCCAAA
GGGTTTCTTG TCAGCATTCT GGTTATAGGT GCTTCTGCCG GAATATGTGA GGAACTTTTG
TTCAGAGGGG TAATCCAAAG GGGATTGGAG AGACTTGGAG CAGTTAAATC CATTCTTATA
ACGGCGTTTC TTTTTGGACT TATTCATTTT GATTTTCAGA GGCTTTTTGG AACTTTTCTC
CTGGGGGCGT TGATAGGTTT TCTGGTATAC AGAAGCAATT CCCTGCTTGT TGGAATGTTT
GCCCATTTCA CCAACAATTC CATAGCTGTG GCGGCACTTT TTTTGTCAAT GAAAATGACC
GAGTACGCAG AGAAAATGGG CATTTCCAAT GTATCTGAAA TGAACACGTC CGGTGCGGCG
GATGTGTTTG GTGAGCTTCA AAAGCTTCCT GCTCCCCAGC TTCTTGCAGT AATAATCTTT
TATTTGTTCA TGTTTGTTTT TATGGCAGTA GTTTTTGGGG TTCTTCTTTA TGCTTTTATT
AAAAATACGG CAAAAGATGT TGGGAAAATA AATGAGGATA AATCGAAGAT TAAGGCAGTG
GATTTTATTT CCTTTGTGCC GGGAATTCTC ATAGTGATTT TGATATATGT CTACAACGGT
TTGTCAATGT CAGGTTCTGC CGCTGCCGAA TCTATGACGG AATTTTTTAA AGCCATAGGT
ATAGGTTGA
 
Protein sequence
MEEFSMEQSD FDIQKEEDRN KNTKMPRIIQ VGALYSLAVI LMVFVSTRLQ TALGFNLGGA 
LSEVLLIMLP PLLFLILFKF DVKKVLRINK TGFMNFFLTF WIMFFSIPVV GLFNILNMLL
VKLLFGTVEI TQYPVGSDAK GFLVSILVIG ASAGICEELL FRGVIQRGLE RLGAVKSILI
TAFLFGLIHF DFQRLFGTFL LGALIGFLVY RSNSLLVGMF AHFTNNSIAV AALFLSMKMT
EYAEKMGISN VSEMNTSGAA DVFGELQKLP APQLLAVIIF YLFMFVFMAV VFGVLLYAFI
KNTAKDVGKI NEDKSKIKAV DFISFVPGIL IVILIYVYNG LSMSGSAAAE SMTEFFKAIG
IG