Gene Cthe_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0833 
Symbol 
ID4810451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1013569 
End bp1014804 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content40% 
IMG OID640106250 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_001037261 
Protein GI125973351 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0015301 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGAGT TTTTTTCTGA CAATTGTGAC AACATATTAA CGGTTTCAGC AGTCAACAGA 
TATATCAAGG AAATCATGTC AAGGGATCTG ATCCTGTCAA ATCTTTGGGT TAGAGGCGAA
ATATCGAATT TTAAATATCA TTCTTCGGGT CATATGTATT TTACCCTTAA GGATGAGAAC
TGTTCACTAA AATGTGTGAT GTTCAGAACA TACAACTTGC ACCTTAAATT TATGCCTGAA
AACGGCATGA AGGTGATAGT AAAGGGCTAT ATTTCGGTAT TTGAAAGGGA CGGACAATAT
CAGCTCTATG CTGAGGAAAT GCAAAATGAC GGTATAGGAG ACCTTTATAT TGCTTTTGAA
CAGCTAAAGA GAAGACTTGC AAGCGAAGGT CTTTTTGATC CGGCACACAA GAAAAAGATA
CCGTTTATGC CGAGGACAAT AGGAGTGGTT ACCTCTGCCA CCGGTTCGGT TATCAGAGAT
ATTATGAATA TTTTGGACAG ACGGTTCTAT AATTCATATA TAAAGATATT TCCTGTCAGG
GTCCAGGGTG AAACCGCCGC TTTGGAAATA AGCCATGCGA TAAGCAAATT GAATGAAATC
GGCGGTGTGG ATGTCATTAT CCTTGCCAGA GGTGGAGGCT CTTTGGAGGA ATTATGGCCG
TTTAACGAGG AAATAGTGGC AAGAAGCATA TTTAATTCTT CCATACCGGT AATATCGGCC
GTGGGACATG AGACGGACTA TACAATAGCG GATTTTGTTG CAGATTTAAG GGCGCCCACT
CCATCAGCGG CCGCCGAATT GGTAATGCCT GAAAAAGTAA CTATTATAAA CAGAATAAGA
GAGCTTAATG TCAGGATGGT GGACGCACTT CAAAGAAATG TAAAGCAAAA AAGGGATATG
CTTAAAAAAC TTGCCGATTC AGTAGTTTTC AGGCAGCCAT ATGACAGAAT ATATCAGGAA
AGAATGAAGC TGGACATTTT AAACAGGGAC TTGAAAAAGA GCATGTTTGC TTCTTTAGAG
AGGGCAGGAT CAAAGCTTGG ATTTTTGATA GGAAAACTTG ACGCATTAAG CCCCCTTACT
ATATTATCAA GGGGATACGG AATTATAAAG TCGGAAGAAA AAGGGATTTT TGTAAAATCC
GTTAACGATG TGGATGTCGG AGAAGGAATT GAAGTGAGTG TGAAAGACGG AAGGCTTTAC
TGCACGGTAA GGAAGAAGGA ATTGAATGAT GATTAA
 
Protein sequence
MGEFFSDNCD NILTVSAVNR YIKEIMSRDL ILSNLWVRGE ISNFKYHSSG HMYFTLKDEN 
CSLKCVMFRT YNLHLKFMPE NGMKVIVKGY ISVFERDGQY QLYAEEMQND GIGDLYIAFE
QLKRRLASEG LFDPAHKKKI PFMPRTIGVV TSATGSVIRD IMNILDRRFY NSYIKIFPVR
VQGETAALEI SHAISKLNEI GGVDVIILAR GGGSLEELWP FNEEIVARSI FNSSIPVISA
VGHETDYTIA DFVADLRAPT PSAAAELVMP EKVTIINRIR ELNVRMVDAL QRNVKQKRDM
LKKLADSVVF RQPYDRIYQE RMKLDILNRD LKKSMFASLE RAGSKLGFLI GKLDALSPLT
ILSRGYGIIK SEEKGIFVKS VNDVDVGEGI EVSVKDGRLY CTVRKKELND D