Gene Athe_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0891 
Symbol 
ID7407466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp993999 
End bp995228 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content36% 
IMG OID643715265 
Producttransposase mutator type 
Protein accessionYP_002572774 
Protein GI222528892 
COG category[L] Replication, recombination and repair 
COG ID[COG3328] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA ACGAAATTCA CGAAACTGCT AAAAACATGG CTGTAGAGCA AGTATTAAAT 
ATGTATTGCT CCAAAGATGA TCCTAACCGC CCAGCTCTAA AACAACTCTT AGAAAACTTG
CTCGATTGCT TTATGTTATC GGAAAGATCA GTGTACCTTG CTAAAAATGA CAATGACAAA
GGCAATGGTT TTTACGATAG AAAACTTGCA ACACCTGTTG GCAGTCTTGA AATCTCTGTC
CCTCGCACAC GTACTGGTAA TTTCCGACCT TCTATCCTCC CTGACCGCTA CAAAAGAGTT
GATAGTTCAT ACACTGACCT GCTTATGTCT TTAGTCGTCA ATGGTTATTC CGAAAGTTCC
CTTGTCCAGA CTTTGAAAGC TTTGAATCTT CCATATTCCG AAAATGAAAT ACTAAAAATC
AAAGAAGACC TTAAAAATGA GCTTCAGTTA TTCAAACAAA GAGAACTACC AACAAGTGCT
TTTGCTCTCA TCATCGATGG TTATCATTGT GAAGTTAAGG ATAATTCTAA GGTTAAACAA
GCTACTTGTT ATGTTGTCCT CGGTATCGAC TTAGAAGGTA AAAAAGACAT TTTCGGTGTC
TACACTTTCT TCGGCAAAGA AAATAAGGCT GATTGGATGA AAGTATTTGA AGACTTAATT
ACAAGAGGGC TAAAAGAGAT TCTAATTGTC ATAAGTGATG ACTTCCCAGG TATTATAGAT
GCTGTCAAAC TTGCTTATCC TCTTGCTGAC CATCAACTGT GTTTTGTCCA CCTCCAACGT
AATGTCAGAA AACATATGAC AAAAGAGGAT GCTTCAGCTT TTAACAAGAG CTTGGACAAA
ATCAAAACCT TTTCTCCTGA TTTCGATGAA GCTGTATTGA AATTTAAAGA ACTTTGTGAT
GAATACCTTG CAAAATATCC TCGATTTATT AAAGCAATAT CAGAAAAAGC AGAGTTTTAT
CTTGCCCATA TGAAATACCC CGAGGAATTA AGAAAGCATA TCTATACCAC AAACGCCGTT
GAAAGTGTAA ACAGCATGAT TGAAAAGATT AGAGTAAATT CAGGTGGATA CTTTCAGACT
GCCAAAGTCT TAGAAATTAA TATTTACTTA CAGCGAGAGA ACTTACGCCG TACAAAATGG
AAAAATGGAG TTCCCAGTAT TAGAAAATGC ATCAATAACA TAACCCAACT TTACAACTTG
CGTTATAAAT TGGAAACACA AAATTCTTGA
 
Protein sequence
MNKNEIHETA KNMAVEQVLN MYCSKDDPNR PALKQLLENL LDCFMLSERS VYLAKNDNDK 
GNGFYDRKLA TPVGSLEISV PRTRTGNFRP SILPDRYKRV DSSYTDLLMS LVVNGYSESS
LVQTLKALNL PYSENEILKI KEDLKNELQL FKQRELPTSA FALIIDGYHC EVKDNSKVKQ
ATCYVVLGID LEGKKDIFGV YTFFGKENKA DWMKVFEDLI TRGLKEILIV ISDDFPGIID
AVKLAYPLAD HQLCFVHLQR NVRKHMTKED ASAFNKSLDK IKTFSPDFDE AVLKFKELCD
EYLAKYPRFI KAISEKAEFY LAHMKYPEEL RKHIYTTNAV ESVNSMIEKI RVNSGGYFQT
AKVLEINIYL QRENLRRTKW KNGVPSIRKC INNITQLYNL RYKLETQNS