Gene Athe_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1112 
Symbol 
ID7408694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1206355 
End bp1208568 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content28% 
IMG OID643715478 
ProductABC transporter related 
Protein accessionYP_002572986 
Protein GI222529104 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGCTA AAAAGAAGCA CAAAAGAAAG AAAAAAGTTC CTTATGTAGA ACAACTACAG 
CAAGCTGAGT GTGGATTATG CTGCGTAGCA ATGATTTTGA GGTACTATGG AAGTTATTTT
ACATTAAATA ATTTGAGGGA ATATTTAGAT ATTGGTAGAG ATGGAACCAC AATACGACAG
TTAGTAAATT TAATGAATAA ACTAAACTTA AAAACTAAAC TATATGAGTG CTCGACTGAA
GGTTTGTACT ACATAGAATT GCCTGCGATA CTTTTTTGGG AGCAGAGGCA CTTTGTTATA
TTAGAAAGGA TTGATGAGAA ATATGCTTAT ATTGTTGATC CTGCGTGTGG TAGAAGAAAA
TTGACAGTAT CTGAATTATC GAGTTTGTAT TCTAATTATG CTATATATGC CTATCCAAAT
GAAAAATTTG TTCCTAAAAG AAAAAGTGAA AATATATGGT TATATTTCTT ACCAATAATA
TTTAAAGGAA AAAGAAGCCA TTATTTTCAA ATTATAATTT ATTCTATTAT GACCTATTTT
TTAACAACTT TTTGTATTCC AATCTTTATC CAAAAATTAA TTGACAATAG TTTGAATAAG
AAAGATATTG CTTACATTAG AGAATCAATT ATGTATTTAT TATTACTATT GCTGATTTAT
TTTTTGTCTA ATTTTATAAA AGGGTTACGT CTGGTTAAAC TTAAGGCTTT TGTTGACGAG
AATTTAAACA AAAAGGTTGT AGAAGATGTT TTGAAACTTC CTTACAAATT TTTTGATCTT
CGAAACAAAG CAGATATTTT GTTTAGTGTA AATAGTTGTT ATATCATACG AGAATTGTTT
ATAAATCAAA TGATAAACGG AATTATTGAC TGTGGAGCAG CGCTTTTTAT TATTTGCTAT
ATGTTTTCAC AATCTGTACA TTTAACATTT GTAGTAATTA TTTTATTTTT GTTCAATTTA
TTAGTAGTTA AATTTTCTCA ACCTGTTATA TTTGAAAATG GTAAATATTT ACTTAATGAA
CAAAGCAAAG CTCAAAGTGT AATTTCAGAA GCTATATTTT CTATTTTAGG TATAAAAATG
CATGCAATTG AAAATGAAAT TTATGCAATA TGGGAAAATA AATATAAAGA CTACATGAGT
AGGTATATTA ATTGGCAAAA AAGAAATAAT CTTATATTTA GTATTCAGTC TTTCATTCAA
ATGATATCGC CTATAGCTGT GTTATCTATA GGAGTGATCT TTACTATAAA ATCCCTGCTG
AGTGTAGGAC AGGTGATAGC ATTTTATTCT TTAAGCAATA CTTTTTTTTC ACTATCCCAG
TCGGTAGTTG ACACATGGCT TAGTTTTGTA AATAGTAGCC TATACTTAGA GAGACTAAGT
GATATAGTCA GATATGAAAA GGAAAGTGAG TCTGAAGAGT CTGTTAAAAT TAATGTGAAG
GGTAATATTG AATTAAGAAA TGTGTCTTTT TCGTATACAA AACATTCGGC TAAAGTAATA
AATAATATTT CGCTCAAAAT TGAACAAGGC AAGATGATAG CAATTGTTGG TAAGTCGGGG
GCAGGGAAGA GTACATTAGC AAAATTATTA GCGGGTTTAT ATAGTCCGTC TGAAGGTGAA
ATTTTATATG ATGGTATAGA CCTCAGAAGG TTAGATAAGA AATATATAAA AAAACAAATA
GGAATAGTTC CTCAGGATAT TATGCTTTTT AATAGAACAA TTTATGAAAA TATTGTTATG
AATAGAAAAG ATGTTACTCT TGAGGAAGTA AAAAAAGTTT GTCAAATCGC ACAAATTGAT
GATGATATTC AAAAGATGCC CATGGGTTAT TATACTATTA TTACCGAAAT GGGAGTAAAC
TTATCAGCTG GACAGCGACA AAGAATAGCT CTTGCAAGAG CACTTTTGAA TAAACCTAAA
ATAATTATTT TGGACGAAGC TACTAGCTCA TTAGATCCCA TTAATGAGAA GAAAATATTA
GATTATTTTA AAAATATGGG ATGTACCAGA ATTATTATTA CTCATAGGCT TTCATCTATT
ACTGATGCAG ATATTATTGT TGTATTAGAT GAAGGAAGAA TTGTTGAACA AGGAACACAC
GAAGAGTTAT TACGTAAAAA TGGAATGTAT ACAATTCTTT ATTACAATTA TAATAAGAAT
CAAGCTTATG CTGATTTAGA AAATACTAAA TCCAAAGGTT TAAACTGTAT ATGA
 
Protein sequence
MKAKKKHKRK KKVPYVEQLQ QAECGLCCVA MILRYYGSYF TLNNLREYLD IGRDGTTIRQ 
LVNLMNKLNL KTKLYECSTE GLYYIELPAI LFWEQRHFVI LERIDEKYAY IVDPACGRRK
LTVSELSSLY SNYAIYAYPN EKFVPKRKSE NIWLYFLPII FKGKRSHYFQ IIIYSIMTYF
LTTFCIPIFI QKLIDNSLNK KDIAYIRESI MYLLLLLLIY FLSNFIKGLR LVKLKAFVDE
NLNKKVVEDV LKLPYKFFDL RNKADILFSV NSCYIIRELF INQMINGIID CGAALFIICY
MFSQSVHLTF VVIILFLFNL LVVKFSQPVI FENGKYLLNE QSKAQSVISE AIFSILGIKM
HAIENEIYAI WENKYKDYMS RYINWQKRNN LIFSIQSFIQ MISPIAVLSI GVIFTIKSLL
SVGQVIAFYS LSNTFFSLSQ SVVDTWLSFV NSSLYLERLS DIVRYEKESE SEESVKINVK
GNIELRNVSF SYTKHSAKVI NNISLKIEQG KMIAIVGKSG AGKSTLAKLL AGLYSPSEGE
ILYDGIDLRR LDKKYIKKQI GIVPQDIMLF NRTIYENIVM NRKDVTLEEV KKVCQIAQID
DDIQKMPMGY YTIITEMGVN LSAGQRQRIA LARALLNKPK IIILDEATSS LDPINEKKIL
DYFKNMGCTR IIITHRLSSI TDADIIVVLD EGRIVEQGTH EELLRKNGMY TILYYNYNKN
QAYADLENTK SKGLNCI