Gene Cthe_0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0814 
Symbol 
ID4810432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp983728 
End bp985440 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content40% 
IMG OID640106231 
ProductDNA repair protein RecN 
Protein accessionYP_001037242 
Protein GI125973332 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCAAC GGTTGGAAAT TCAGAATGTA GCAATAATTG ACAAGGTTGA AATTGAGTTG 
GGAGATGGGC TCAATGTACT GACCGGCGAA ACCGGAGCGG GAAAGTCCAT CATAATTGAC
TCAATAAATG CCATTTTAGG GCAAAGACTG TATAAAGACC TTATAAGAAC GGGCAGGGAC
AAAGCCATTG TTGAAGCCGT CTTTCAGGTG GATAAAAAAA GGGTGGAAGA TTTGCTGGAG
GATTTTGGAA TAGACTGGGA AGAAGACGGT ACTTTGGTTG TGTCCAGAGA GTTTACCACT
TCAGGGAAGA ATACTTGCAG GATTAACGGC AGAATTGCAA CGGTGTCAAT GCTAAAACAA
TTGGGAGAAA GGCTTATTGA TGTACATGGA CAGCATGACA ACCAATCCCT CTTAAGAACC
GAAAGCCACA TCGATCTTCT GGATTCTTTT GCGTCTTCCA GGCTTCAAAG CTTGAAAGAT
GAGTATTTAA AACATCTTGA AACATACCGG AAGATTAAAA GCAGGTTGAA GGAACTGACC
GGTGACAAAA ATGAAAGGGA GCGTAAAATA GATATTCTCA AGTATCAGAT TGATGAAATA
AAAAAGGCGA AGCTAAAGAC AGGTGAAGAA GAGGAACTTT CAAAACAGAG AGAACTTTTG
GTGAATTCTG AAAAAATTAC AAACACTCTT TCCAATGCCT ATGAACTTTT AGGAAGCGGA
GGCAAATTCG GAGAATCCGC ACTGGACATG ATAAACAAGG CTGCTTCGGA TTTTGGCGGT
ATAGAGGAGT TTGATGCAAA ATATGATGAA CTTAAAAAAA GGATTGAGGC GGTTGCGATT
GAACTTGATG ATATTGTCTC GGAAATCCGC AATTTGCGCG ATAATATGGA ATATGATCCA
GACCTTCTTA TGCAGATTGA AAGCAGACTT GATGTATTAT ACAGGCTTAA AAAGAAATAT
GGAGATTCGG TGGAAGAAAT CTTAGAGTAC AAGGATAAAA TAGAAAAGGA ACTGGATGAA
ATTTTAAATA ATGAAGAAAT TGTAAATAAG TTAAATGAAG AGCTTTTGGA AGAAGACGGG
AAGCTGTACC GACTGGCAAA GGAAATGAAC AATGAAAGGG TTAAGGCGTC AAAGCTTCTC
GAAGAAAAAA TCGGCGAGGA GCTTAAAGAC CTGGAAAAGA AAAACACCAG TTTCAAGGTG
AGAATAGATT TTGACGATTC AACGGAGAAT GGGGAAAGAA AATACAATAA CAACGGTCTT
GACAGAGTGG AGTTTATGAT ATCCACCAAC GCTGGAGAGC CTTTGAAACC TTTGGCAAAG
ATAGCTTCCG GCGGAGAAAT GTCGAGAGTG ATGCTTGCAA TAAAGACAAT TCTTGCAAAA
GTGGACAAGA TACCCACAAT GATATTCGAC GAGATTGATA TTGGAATAAG CGGTGTTGCG
GCTCAAAAAG TGGGAGAGAA GCTCTGTTAT ATTTCGAAAA ACCACCAGGT CATATCTGTA
ACCCACTTGG CACAAATAGC CTGTATGGCG GACAATAACT ATTATATTGA CAAGGTAACC
GAAAACGGCA ATACCAGGAC GGTGGTTAAA AAGCTTGATG AAAGGGGAAA GAGGGACGAA
ATAGCAAGGA TCCTCGGTGG AGCGAGTATT ACGGACATAA CATTAAAGCA TGCTGAAGAA
ATGCTTGACA AAGCAAAAGA ATTTAAGAAA TAA
 
Protein sequence
MLQRLEIQNV AIIDKVEIEL GDGLNVLTGE TGAGKSIIID SINAILGQRL YKDLIRTGRD 
KAIVEAVFQV DKKRVEDLLE DFGIDWEEDG TLVVSREFTT SGKNTCRING RIATVSMLKQ
LGERLIDVHG QHDNQSLLRT ESHIDLLDSF ASSRLQSLKD EYLKHLETYR KIKSRLKELT
GDKNERERKI DILKYQIDEI KKAKLKTGEE EELSKQRELL VNSEKITNTL SNAYELLGSG
GKFGESALDM INKAASDFGG IEEFDAKYDE LKKRIEAVAI ELDDIVSEIR NLRDNMEYDP
DLLMQIESRL DVLYRLKKKY GDSVEEILEY KDKIEKELDE ILNNEEIVNK LNEELLEEDG
KLYRLAKEMN NERVKASKLL EEKIGEELKD LEKKNTSFKV RIDFDDSTEN GERKYNNNGL
DRVEFMISTN AGEPLKPLAK IASGGEMSRV MLAIKTILAK VDKIPTMIFD EIDIGISGVA
AQKVGEKLCY ISKNHQVISV THLAQIACMA DNNYYIDKVT ENGNTRTVVK KLDERGKRDE
IARILGGASI TDITLKHAEE MLDKAKEFKK