Gene Cthe_3214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3214 
Symbol 
ID4809516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3806501 
End bp3808486 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content35% 
IMG OID640108648 
Producthypothetical protein 
Protein accessionYP_001039602 
Protein GI125975692 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAATA ATGAAAATGT TGTGTATAAA TCAAAAGCAC CATATAACTT TATTGGATTG 
GAAGACTTCA TTTTGGATAA ATGTAGTGAT GAAGAAGAAC TTTTAATGCA TGAAAGGTAT
CATGAAGGTT TGAAGACAGG ATGCATTCAG TATGAAATCG AGGTAATAAC ACCTCTTCAT
ATTTCTGCAG GTAAAAAAGA ATCTGGTAAA GAGAAAGAAG ACAGTGAGGA AGAAAGTACC
AGAGAAGAAG AACTCTTTAA AAATCCTCTT GGGCAATATG TTATTCCGGG GAATACAATT
CGTGGCCTCA CAAGGTACAA CGCTTCCATA TTTTCTTTTG CGTCAGTGAT AAATGAGCCA
AAAGGAAAGA AAGATATTGA AAACAAAAGA TTTTTTTACA GAACTTTTGC ATCCAAAGAT
GCAAATATAA GAAAGTGGTA TTCTGATACT TTGGGAATGA GGCTGCGTCA AAGAGGAAGT
TTTAAATACA CTGTTCTTGA AAAGGTTCAT GCCGGTTACA TACGTAAAAG GGGAGAGGAA
TATGTTATTA CTCCTGCAGT TAAGATAGGA AACAAAAATG TGGACGATGA AAGCAGAATG
AAATCGTATA TGCCAATACA TGAGTGGGAA CTAAGAAATT TAAACGTTCA AGGGGTATAT
TATTTATACA ATGACAATCT CAAAAAAGAA GATTACAAAA GTGATTCGAC TTTAAGAGAA
AACCGTAACA GTCAGTACAG ACCATACTTT ATAGAAGTAA GATATAATGT TGCGGAGGGA
AAACCTAAAA TTGATATTAA CGGAAAGTTC AAAGGTATGC TTGCCAATTC AAATTATATA
AATAGCAAGA GGCACCACTA TTTAATATTT GAGGAAGATA AAAACAGTGC TGAAATAGTT
GTATCAAAAA AATTGGCAGA ACTTTACACA GATGATTTGA AGTACACGCA AGAGCGCAAT
GCTGCAAGCA AAGAGATTTA TAAAGAATAT TATGAACTTC CCAAAGAAGG AGAAGTAAAA
CCGGTTTTCT TTGTAAAAGA AGGTGACAGG CTTATTTTTG GTTTTACTCC GTATCTGAGA
ATTCCGGCCG AAGGAGATAT ATACGGCGGA ATACCGGAGG TTCACAAAAA CTATACCGGA
ACGGATTTTG TTGATGCCAT GTTTGGATGG AGAGATTTTA GAACGAAGCT TTCGTTTTTG
GATGCTGTTT GTGAAAGCCC AAATCCTGAA ATAACAAAAG AGTATGAAAT GATGTTGGCG
GAGCCAAAAC CGTCATGGTA TAAAGGTTAT CTGAAACAAA GAAAAGACAC TTTGGAATCG
TATAGCACAG AAGGGTACGA AATAAGAGGA CGGAAGTTTT ACTGGATGAA AGAAAGCCTT
GATATTAAGG GAATGGAAGA ACAAGAAAAG AAAGAAAGAC TTGTTACAAG GATGAAGTGC
TATGCTGAAA ATACTAAATT TATAGGAAAA GTTAAATTTG AAAATTTGAC AGATGAGGAA
TTGGGACTTT TAATATACTC ATTAAAACAA GGAGACACGG AAGGGTACTT TAACCTGGGT
AAAGGAAAAC CTTATGGATT CGGTAAATGC AGGATACGGA TTTTGGGACT TTTTGTGGAA
AATATAAAAG AAAAGTACAC TTCTTTTAAT GCTAATTATC TCAAGGAGGA AAAGTCAGAC
AAATATGTGG AAGCTTTTAG GAAATACATA ATTGAACATT ACAGAAAAAA GGTGAGCAAT
GTAAATGAAA TTATAAGTTA CAGGGAGTTT GAACTTAGTA AAAAGATTTT GAAAAATTCA
GAAACCAGAT ATATGAAGGT GGGAGAGTTT GCTCAAAGAG CTGAGCTGCC TATGTTGGAA
GATGCTGTCC GAGAGACCGG TAATGTAATA TCTACAAAAA TGAAAGGGGC AGAGAAGAAA
AATAGTAATC ACAAGGTAAA AAGTGACAAG CGGAAGAACC ATAATGAATC AAATAAAACA
ACATAA
 
Protein sequence
MGNNENVVYK SKAPYNFIGL EDFILDKCSD EEELLMHERY HEGLKTGCIQ YEIEVITPLH 
ISAGKKESGK EKEDSEEEST REEELFKNPL GQYVIPGNTI RGLTRYNASI FSFASVINEP
KGKKDIENKR FFYRTFASKD ANIRKWYSDT LGMRLRQRGS FKYTVLEKVH AGYIRKRGEE
YVITPAVKIG NKNVDDESRM KSYMPIHEWE LRNLNVQGVY YLYNDNLKKE DYKSDSTLRE
NRNSQYRPYF IEVRYNVAEG KPKIDINGKF KGMLANSNYI NSKRHHYLIF EEDKNSAEIV
VSKKLAELYT DDLKYTQERN AASKEIYKEY YELPKEGEVK PVFFVKEGDR LIFGFTPYLR
IPAEGDIYGG IPEVHKNYTG TDFVDAMFGW RDFRTKLSFL DAVCESPNPE ITKEYEMMLA
EPKPSWYKGY LKQRKDTLES YSTEGYEIRG RKFYWMKESL DIKGMEEQEK KERLVTRMKC
YAENTKFIGK VKFENLTDEE LGLLIYSLKQ GDTEGYFNLG KGKPYGFGKC RIRILGLFVE
NIKEKYTSFN ANYLKEEKSD KYVEAFRKYI IEHYRKKVSN VNEIISYREF ELSKKILKNS
ETRYMKVGEF AQRAELPMLE DAVRETGNVI STKMKGAEKK NSNHKVKSDK RKNHNESNKT
T