Gene Athe_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1604 
Symbol 
ID7409434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1703965 
End bp1706022 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content32% 
IMG OID643715973 
Producthypothetical protein 
Protein accessionYP_002573471 
Protein GI222529589 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATATTG ATAAAATAAA AATCAGAAAT TTTAGATGCT TTGGTCCTGA GGAAACAGTA 
ATTGAATTTG ATAAGCTAAC TGCTTTAATA GGTGCAAATA GCTGTGGAAA AACAGCAGTT
TTACATGCAT TAATGAAAAT ATTTGGGAGT GACAATAAAG AGATTAAACG TTCTGATTTC
CATGTACCTA AAAGCATGGA TCCGCAAGAA ATTGAAAATA ACGACTTATA CATAGAGGTA
AAATTAAGTT TTCCAGAATT GCGGCAAGAG AATACTGAAC TCAATAGTAT ACCATCTTTT
TGGAATTATA TGGTTATTAG AGAACCTGGC GTTTTACCAT ATGTAAGGAT ATGTTTGAAA
TCTTCATGGC AAAAAAGTAA TACTCCTGAT GGAGATATTG AAACAGAAAT ATTATTTATT
AAAGCCCCGG AAGGTAGAGA AAAAGACAGC GATTACGAAA AAGCAAAACG AGAGCATTTA
TCAAGAATTC TATTTGTATA TGTTCCGGCT ATAAGAGATG TCTCTCCTCA ATTAAGAAAT
GTTTCGAACT CTATACTCTG GAAAATACTC AATAGCATAG AATGGGAGGA AGATTTCAAA
AGTAAGATAA GGGAGAAGAC GGAAGAGATT GACAAACTTT TTGCAAAAAA TCCAGGAGTA
TCTTTAGTAA AAGAAATTAT TAGTAATACG TGGAAGAAAT ATCATAGGGA TTATAGGTAT
AAAGAGGCAC ACATGCGTTT TAGTAGGGGA GACTTGGATA CTGTTTTAAA GAAAGTTGAA
ATAGAATTTT ATCCAACTTC TGAACCTAAG TCTTATACAG TAAATGAATT GGGCGATGGA
TTGCGCTCGC TATTTTATTT AACATTGGTA AATTCCCTTC TTGAATTTGA GAATAGAATC
CTACAGTCAA AACATTCTTC TAAAAGTCCT TTTAACAAAG AACCTTCGAC ACTTGTTTTG
CTTGCAATAG AAGAGCCAGA AAATCATGTT TGTCCTCATT TATTAGGACG TATTATGGAT
AACCTAAAAG ATATTTCCTC AAAGCAAAAT GCACAGGTTA TACTTACTTC ACATTCGGCA
AGTATTATAA GTAGAATTGA GCCTACAGAA ATACGTCACC TAAGAATAAA AAATGAGGAT
TTATGTACTA AAGTTTCAAG GATTATTTTA CCAGAACAAA TTGATGAAGC ATTTAAATAT
GTAAAGGAGG CTGTAAAAGC TTATCCTGAG ATATACTTTT CCAGGTTGGT AATATTAGGT
GAAGGTGATA GTGAAGAACT AATAATCAAG AAAATGATTG AAAAGAGCGG TTTGTCAGCT
GATAGCTGTG CAATAAGTAT TGTGCCACTT GGAGGAAGAT TTGTAAATCA TTTTTGGAGA
CTTCTGCAAG ATATTGGTAT TAATTATATA ACATTGCTTG ATATGGATAT CGAAAGAAAT
ACAGGGGGAT GGGAAAAGTT GCATTATATA ATGAATCAGC TTATACAAAG TGGCTATGAT
GAAAAAAGTG TTTTAGCAGA TTTATCTAAA GAAGAATTTG ACAATATGCC AAATTGGTCA
TATGACGAAT ATAGCAGAGA GAAAATAGAA AAGTTTGCAA AACATCTTCA ACAATTTGAT
ATATTTTTTT CATTTCCACT TGATATAGAT TTTTCAATGT TAAGTACTTA TGAGGAAGCA
TATAAAAAAC TTATACCTAA AAAAGGTGGT CCAAGAATTC CTGATAAAAT TACAAAAAAA
GAAGACTATG CACGTTATAT TGACAATGTT GTAAAAAGTG TTTTGAAATC AGATAATGCT
ACTGGTATAA CTTATTCAGA TAATGAAAAG GAGTTGATGG TCTGGTATAA ATATTTATTT
TTGGGTAGAG GTAAGCCAAA TACTCATTTT CAAGCTTTGA TAGAACTTGG GGATAGTATA
AAATCAAACA TGCCAGATGT TTTTAAAAAC CTTATAAACA GGGCTAAAGA ATTATTAGAA
AATGACCCCT ACTCTGATAT ATCTAATTTA GGAGATGAAA AATATGCTGA TACCAAAGGA
ACAATGGAAA CCTGCTGA
 
Protein sequence
MYIDKIKIRN FRCFGPEETV IEFDKLTALI GANSCGKTAV LHALMKIFGS DNKEIKRSDF 
HVPKSMDPQE IENNDLYIEV KLSFPELRQE NTELNSIPSF WNYMVIREPG VLPYVRICLK
SSWQKSNTPD GDIETEILFI KAPEGREKDS DYEKAKREHL SRILFVYVPA IRDVSPQLRN
VSNSILWKIL NSIEWEEDFK SKIREKTEEI DKLFAKNPGV SLVKEIISNT WKKYHRDYRY
KEAHMRFSRG DLDTVLKKVE IEFYPTSEPK SYTVNELGDG LRSLFYLTLV NSLLEFENRI
LQSKHSSKSP FNKEPSTLVL LAIEEPENHV CPHLLGRIMD NLKDISSKQN AQVILTSHSA
SIISRIEPTE IRHLRIKNED LCTKVSRIIL PEQIDEAFKY VKEAVKAYPE IYFSRLVILG
EGDSEELIIK KMIEKSGLSA DSCAISIVPL GGRFVNHFWR LLQDIGINYI TLLDMDIERN
TGGWEKLHYI MNQLIQSGYD EKSVLADLSK EEFDNMPNWS YDEYSREKIE KFAKHLQQFD
IFFSFPLDID FSMLSTYEEA YKKLIPKKGG PRIPDKITKK EDYARYIDNV VKSVLKSDNA
TGITYSDNEK ELMVWYKYLF LGRGKPNTHF QALIELGDSI KSNMPDVFKN LINRAKELLE
NDPYSDISNL GDEKYADTKG TMETC