Gene Cthe_1830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1830 
Symbol 
ID4809814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2169800 
End bp2171014 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID640107244 
Producthypothetical protein 
Protein accessionYP_001038244 
Protein GI125974334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.034164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTT TAAAAGAACT GCTTAAAAAG AAAGATGGGT CGTTAACTGT AGAGGCTGCA 
ATTTCATTGC CGCTGTTTAT GTGTGTGTTT TTATCCATAG CCTTTTTCAT GAAGGTTGTG
TATATACATA ATAATGTGCA ATATGCAATA AACGGCGCTG CCAATGAAGT TGCAACCTAC
AGCTATCTTT ATTCAATTTC CGGCCTCCAG AAGGTGAATG ATGCTATTAC GGAAACAACG
GATGAATATG GAACGACTGC ATCAGAGCAT ACAAAAGAGA TTCTGGAAGC TTTTGATGCC
TTGGGAGATA TTTCACAGCA GAGTCTGGAA TCGTTTAAAG GACTGGCGGC AGGTGATACA
ACACAAATAG ACAAGTTAAA GGAGTTGTAT GAAGAAGGTA AAATTTCAGT GGGGACCGTC
CAGAAAGTCA TTGGTGAAGT AAAGGAGAAT CCCAGAAAAG AGTTCATCAG TGTTGCTTCT
TTGTTTTTCA GTGCAGGATA TGAAAAAATC AAATCTGAAT TGTCAGAGCC GTTAATTAAG
CTCTTTATGA GAAAATACAT TGACGAGAGA ATATTCAACA GCAAGGGTGG ACCGGGAGCT
TATATTGTAG TAAAGGAAGG AAAAGACCCG TTAGATGCTT TTAGCTTTAA CAACCGGATA
TTTACCGACA ATAAGAGCAT AGATATAAGA GTAAAATATA AGATAAAGAC TTCGTTGCCT
ATAAACATTC TTCCTGAAAT CAGCATCGAG CAACGGGCAA CTGTCAGAGG ATGGATGGAC
GGAGATAAAT CGGCACCGGT AAAAGAGGAA CCAAAAGAAG AATCTTTATG GGATAAAGCG
CCTTTTGAGT ACGGTAAAGT GATTACCGAG AAAGAACTTG AAAAGTATCC GGACAAATAT
CCGAATTCCG GGCATATATA TGAAGTCAGG AGTATCAATT TGGATTGCGA AACGTATAAA
GATATTAAAA AAGCAAAGAG CTCTTTAAAG AGCAGTATTA ATAAATTTAG TTCGAAAACT
AAAGATGTCG CTGAAATTAC TTCAAGGACA TTTATTATAG TGATACCGGA AGGGACATTG
ACAGACGAAA TCAAGGCAAT GTTGGAAGAA CTGAAAAGTG AAGCGGCTTC CGGGACACCT
TCGATAGAGG TGATTTATAA AGAAGGATAT GGAAGACAAA GTAATGTAAG TGACAGCAGC
GAAGAAGAAA AGTAA
 
Protein sequence
MNILKELLKK KDGSLTVEAA ISLPLFMCVF LSIAFFMKVV YIHNNVQYAI NGAANEVATY 
SYLYSISGLQ KVNDAITETT DEYGTTASEH TKEILEAFDA LGDISQQSLE SFKGLAAGDT
TQIDKLKELY EEGKISVGTV QKVIGEVKEN PRKEFISVAS LFFSAGYEKI KSELSEPLIK
LFMRKYIDER IFNSKGGPGA YIVVKEGKDP LDAFSFNNRI FTDNKSIDIR VKYKIKTSLP
INILPEISIE QRATVRGWMD GDKSAPVKEE PKEESLWDKA PFEYGKVITE KELEKYPDKY
PNSGHIYEVR SINLDCETYK DIKKAKSSLK SSINKFSSKT KDVAEITSRT FIIVIPEGTL
TDEIKAMLEE LKSEAASGTP SIEVIYKEGY GRQSNVSDSS EEEK