Gene Athe_1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1926 
Symbol 
ID7407339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2032851 
End bp2034098 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content36% 
IMG OID643716298 
ProductRadical SAM domain protein 
Protein accessionYP_002573787 
Protein GI222529905 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000517647 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAC TTGAAAAACT TGAGGTTCTT GGTGCTGCTG CAAAGTATGA TGTTTCGTGC 
GCGTCAAGCG GAAGCAAAAG GGAAAAACAT TTTGGAATTG GCTCAACATT TTCTGCAGGT
ATTTGCCACA GCTGGACAGA TGATGGGCGT TGTATCTCCC TTTTGAAAGT TCTATTTACA
AATGAATGTA TTTTTGACTG TGCTTACTGT ATTAACAGAA GGAGCAATGA TATAAAAAGA
GCAACATTTA CACCCCAGGA AATTGCTGAG CTGACCATAA ACTTTTATAA GAGAAACTAT
ATAGAGGGTC TTTTTCTGAG CTCTGCCATC AAAAATTCTC CTGACTGGAC AATGGAGATG
CTTTTTCGAA CAGTTTGGCT TTTGAGATAT AAATATCAGT TCAATGGATA TATACACGTT
AAAGCTATTC CCTATGCATC ACTTGATTTA ATAAAAAAAA CTGGATTTTT GGTTGACAGA
ATGAGTGTCA ATATTGAACT TCCATCTGAA AAGAGCTTAA GGCTTTTGTG TCCCAATAAA
ACAAAAGAGA GTATATTAAA GCCTATGGAG TTTATAACAA AAGTAGCTGA AGAAGAGAAA
AATTTTGTAT CGGCTGGTCA AAGCACTCAG ATGATAATTG GTGCAACAGA TGACAGTGAT
TATAAAATAA TCAACTTAAG CCAGCACCTT TACAAAAAGT TTAAGCTCAA AAGGGTATAC
TATTCTGCAT ACACGCCTGT AAATCACGAT CCAAGGATAT TGAAAGTGGA CAGTCCACCG
CTTTTGCGTG AACACAGACT TTATCAGGCA GACTGGCTGA TTAGAGTTTA CAACTTTTCT
GCTGATGAGC TTTTTAAAAG CAAGGATGAG AACCTTGACC TTGAGGTTGA CCCAAAAGTG
ATGTGGGCGC TGAGAAACCT TGATAAATTC CCAATTGAGA TAAATAAAGC AAGTTATGAA
CAGCTGATAA GAGTGCCTGG CATTGGAATA AAAAGTGCAA AGAGAATAAT TAAAAACAGG
GTATTTCATT CTTTGGATTT TGAGGATTTA AAAAAGATGG GTGTTGTTCT AAAAAGAGCA
AAATATTTTA TAACTTGCAA CGGCAAGGCA TTTGAAAGGT TTTTAATTGA TTTGCCACCT
GAGAAAATAA GGCAAAAACT TACAGATAAA AATTCTTTGC AAAAACCTCA GCAGCTTTCG
TTTTTTGATA GGGAAGTATT TACATCTGTG ATAACAGGGG AGATTTAA
 
Protein sequence
MDILEKLEVL GAAAKYDVSC ASSGSKREKH FGIGSTFSAG ICHSWTDDGR CISLLKVLFT 
NECIFDCAYC INRRSNDIKR ATFTPQEIAE LTINFYKRNY IEGLFLSSAI KNSPDWTMEM
LFRTVWLLRY KYQFNGYIHV KAIPYASLDL IKKTGFLVDR MSVNIELPSE KSLRLLCPNK
TKESILKPME FITKVAEEEK NFVSAGQSTQ MIIGATDDSD YKIINLSQHL YKKFKLKRVY
YSAYTPVNHD PRILKVDSPP LLREHRLYQA DWLIRVYNFS ADELFKSKDE NLDLEVDPKV
MWALRNLDKF PIEINKASYE QLIRVPGIGI KSAKRIIKNR VFHSLDFEDL KKMGVVLKRA
KYFITCNGKA FERFLIDLPP EKIRQKLTDK NSLQKPQQLS FFDREVFTSV ITGEI