Gene Cthe_2974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2974 
Symbol 
ID4810862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3493240 
End bp3494451 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content37% 
IMG OID640108396 
Producthypothetical protein 
Protein accessionYP_001039364 
Protein GI125975454 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAC AAAAAGGTAC TATTTTAAAG CTTAAAAACA ATTTGGCCAT TATCATGACC 
AGTGACTGCA AAATTGTTTC AATAAAGAGA CAGCCAGGCA TGTATGAGGG TTTGGAAATA
TCGTTCAATA AAAACGAAAT TATAAATAAA AAGAACAAAC TGGCTTTTTA TTCCCGAATT
GCCGCAGGAA TCGCCGCAAT ATTCATAATC ATGGTTATCT CTTTCAATTT ATTTAATAAT
AATGATGTAT ATGCTTATGT TGCCATAGAT TCCGATGCCA GCATAGAATT TGAACTGGAT
AAAAACAATA AAATAGTCAA AGTGAATTAC TATAATGATA ATACAAATAC TGTATTGGAT
GAATTAGATT TAAAGAATAA ACCCGTTGAT TTTGCAATAA AAGAGGTAAT AAAAAAACTG
GACTTAAATG AATCCGTTAT TTTGATATCA GCATGTTTGA AAGAACAAAA CACAAAAAAG
TCCTCCGCTT CCGATAATTA TGAGTCTGAA AAATTAAGTA AATTAATTGA TATTTGTAAA
AATGCCGTTG AGGTCAATGT AAGTGAAAAT GTTGAGTCAA AAGTGGTGGA AGTTTCCTAC
GATTATAAAA AACTGGCTGA AAAAAACAAA CTCTCCCTAG GTCGAAGCAT TGTCTATGAA
AAAGCCAAAG AGCAAGGGAT AGCTCTGAAT ATCGAAGACA TAAAAAACAA AAGCATTGGA
GAGACTTTAC AGAAGGTCAA AATTGACGAT GTCGGCGTTG TACACAACGT AAAAAAAGAG
GAACCAAAAA AGCCTATGCC GGAAAAGCCT GAACCTGGAA AGCCCGAACC GCAAAAACCA
GAACCCGGAA AACCTGACCC GGCAAAACCC GAACCGGGAA AACCCGGACC GGAAAAGCCC
GAGCCGGAAA AGCCTGAGCC GGCAAAGCCT GAGCCGGCAA AACCTGAGCC GCAACCACAA
ATAAATGATT TGCCAAAAGA TAAAACCATA CCGGAAGAGA AAACAATTCC GAATTCCGGA
GTTGAACCAA TGGCCGAGCC GATAGTTGAA CCAAAAGACA AACAGCAGGA AAAACCCAGG
CCCGATTCAA AGCTTAAACT TGAAGAAAAA CCTACGGTTG AACCAAAAGA CTCCTTGGAA
GAAAAACCCG TGACAAAACC AAAGGATGAC AAAAAGGAAA AAGCAAAGAA CAGCATTGAA
AAAATGCCAT AG
 
Protein sequence
MTKQKGTILK LKNNLAIIMT SDCKIVSIKR QPGMYEGLEI SFNKNEIINK KNKLAFYSRI 
AAGIAAIFII MVISFNLFNN NDVYAYVAID SDASIEFELD KNNKIVKVNY YNDNTNTVLD
ELDLKNKPVD FAIKEVIKKL DLNESVILIS ACLKEQNTKK SSASDNYESE KLSKLIDICK
NAVEVNVSEN VESKVVEVSY DYKKLAEKNK LSLGRSIVYE KAKEQGIALN IEDIKNKSIG
ETLQKVKIDD VGVVHNVKKE EPKKPMPEKP EPGKPEPQKP EPGKPDPAKP EPGKPGPEKP
EPEKPEPAKP EPAKPEPQPQ INDLPKDKTI PEEKTIPNSG VEPMAEPIVE PKDKQQEKPR
PDSKLKLEEK PTVEPKDSLE EKPVTKPKDD KKEKAKNSIE KMP