Gene Athe_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1335 
Symbol 
ID7408916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1420671 
End bp1422326 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content33% 
IMG OID643715700 
ProductDNA repair protein RecN 
Protein accessionYP_002573208 
Protein GI222529326 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA GATTATTAAT CGAAAATATT GCTATAATTG ATAGACTTGA CATTGAATTT 
GACAAAGGTC TGACTATACT AACAGGTGAA ACTGGTGCAG GTAAATCTAT TATCATTGAT
TCTTTATCTT TGCTTTTTGG AACAAAATTC AAAAAAGAAA TTATAAGAAC AGGATGTACA
AAGGCCTGCG TATCTGCAGT TTTTGAAATC GAAAAAAAGA GTACTATTGA GAGATTGACT
CAGATGGGAA TTTCTCTTGA AGACAATTAT TTGATTGTTA GTCGTGAAGT GTACAGCAGT
GGCAAAAACA TCTGCAGGGT TAACAATCAG TTTGTATTAC TCTCAACATT AAGAGAAATA
ACTAAGCATA TATTTGAAAT TCATGGCCAG AATGAGACTC ATCTTTTAAA CGATAAAAGG
ATTCAACTTT TGTACATAGA TAGGTTCTGT GGGAGAGAAC TTGAAGAGTT AAAAGCTGAA
TATAAGGATT TGTACCATGA TTACCAAGAG AAAAAAAGGC TTTATGAACA GATAATAACA
AAAGAAGAAG AGCGGGAAAG ACAACTTGAT TTACTAAACT ACCAAATAAA TGAGATTGAA
AGCGTAAAAC CCCAAATAGG TGAGGATACA GAGCTTGAAA AAAGAAAAGA GATTATCCAA
AACAGCTGGA AACTCAAGCA CAACAGTGAA AAAATGCTTG ATACTATCAA TAATACAATT
ATAGACTCTC TTGAGATGTG TATCAGACTT GCTAACGAAA ATTCAAGGTT TGACAAGGAA
TTTGAGGCAA TATCTGAAAG ACTGAACAAC GTGTATTATG AAATAGAAGA CATCTCATTT
TCTATATCCA AAAAAAGCCA GAGCTACGAA TTGAATAAAG ATGAAATAGA ACAAATAGTG
GACAGACTTG ATAAAATAAA CAGATTAAAG AAGAAATATG GAAGCACAAT CGAAAAGATA
CTGGAGTACA GAAAAAATTT ATTAGAAGAG AGGGAAAAAA TTAGGAGTAG TAGCGAACAA
GCTTTTGAAC TAAAAGAGTA TTTGAGCAAA ACTAAAGAAA GACTTGAGGA AATTTCTAAG
AAGATGTCGA ATATCAGGCG GAGAAAATCT GAGGAGTTTG AAAAAAAGGT ATTAGAGATA
CTTTCCCAAC TTGAAATGAA GAATGTAAGT TTTTATATTA ATTTTCTTGA AAGAGAGCTT
TACGAAGAAG GAATTGACGA AGTAGAATTT TTGATATCAA CAAACGTTGG TCAGCAGCTA
AAGCCACTTT CTACAATTGC TTCAGGCGGG GAACTTTCAA GAATAATGCT TGCAATAAAA
TCTATTGTGG CAGAAAAAGA CGATATAGAG CTGATTATCT TTGATGAGAT AGATAGCGGA
CTGAGTGGAG TTGTTGCCAA CAGACTTGCA AAACTTTTAA AAGAACTATC AAAGAAACAC
CAAATTATAT GTATTACGCA TTTGCCCCAG GTTGCTGCTG CCGCAGATAC ACATTATTAC
GTATATAAAG AAGTTAAAGA TAATTTTACA ATCTCAAATA TCAAAAAGCT TGAAGGGAAT
GAACAGTTAA GAGAGATTGC CAGAATGTTT TCTGGAGAAA ATGTTACAGA AAGTTCTCTT
CTTCATGCAA AGCAGTTAAA ATCTCAGTTT ATTTGA
 
Protein sequence
MLKRLLIENI AIIDRLDIEF DKGLTILTGE TGAGKSIIID SLSLLFGTKF KKEIIRTGCT 
KACVSAVFEI EKKSTIERLT QMGISLEDNY LIVSREVYSS GKNICRVNNQ FVLLSTLREI
TKHIFEIHGQ NETHLLNDKR IQLLYIDRFC GRELEELKAE YKDLYHDYQE KKRLYEQIIT
KEEERERQLD LLNYQINEIE SVKPQIGEDT ELEKRKEIIQ NSWKLKHNSE KMLDTINNTI
IDSLEMCIRL ANENSRFDKE FEAISERLNN VYYEIEDISF SISKKSQSYE LNKDEIEQIV
DRLDKINRLK KKYGSTIEKI LEYRKNLLEE REKIRSSSEQ AFELKEYLSK TKERLEEISK
KMSNIRRRKS EEFEKKVLEI LSQLEMKNVS FYINFLEREL YEEGIDEVEF LISTNVGQQL
KPLSTIASGG ELSRIMLAIK SIVAEKDDIE LIIFDEIDSG LSGVVANRLA KLLKELSKKH
QIICITHLPQ VAAAADTHYY VYKEVKDNFT ISNIKKLEGN EQLREIARMF SGENVTESSL
LHAKQLKSQF I