Gene Athe_1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1399 
Symbol 
ID7409142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1480215 
End bp1481780 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content39% 
IMG OID643715762 
ProductABC transporter related 
Protein accessionYP_002573270 
Protein GI222529388 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR01193] ABC-type bacteriocin transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000528778 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA GATACTATTG TGTAAAGCAG CATGATATAA CAGATTGTGC AGCAGCAAGC 
TTAGCAACAA TTTGTTTGCA ATATGGAAAA GAAGTATCGA TAGCCAGAAT AAGGCAGATG
GCAGGGACAG ACAGGTTTGG CACCACAGCA TATGGAGTTG TAAAAGCGGC AGAAAAGCTT
GGATTTGAAG CAAAAGCAGT CAGGGCAGAA GCAAAAGAAG CAATATTTGA AAAAATACCA
CTTCCGTGTA TAGCACATGT ACTGATAGAT GGCAAGCTAT TTCATTATGT TGTAATACAT
GAAATAAGAA AAGAAAGGAT AGTAATAGCA GACCCGGCAA AAGGAATAGT TAAGCTAAAT
CCAGAAGAGT TTTTTAAGAT ATGGACAGGC ATTTTGATAC TTCATGGGAT AGAAGTAGTA
AAGGCGTTTA ATATAGAAGA GGATGTAAAT TTTAAGACAG AGAGCAAGTT TGTAAAGCTA
TTAAAAGATG TGTTTAAGGT AGCGAATTTA AACAATTTGC AGAGTAATAT AAGCAGCGCA
ATAGCAGCGG TAGGAGTTAT GGTAATACTA TGGGTTGGTG CACACAAAGT AATAAATGGG
CAGATGAGCA TAGGGGAGTT GTTTACGTTC AATGCGCTGC TTGCATACTT TGTAGACCCG
ATAAAGAACT TGATAGGATT GCAGCCGATG TTGCAGACGG CGATAGTTGC AGCCGAAAGG
CTGAGCGAAA TTTTGGAACT TGAGAGTGAG TTTCAAGATG ACGAAGAAAG GAAATTATCA
CCCAGTTTGA AGGGTGATAT AGAGATAGAG GGTTTGAACT TCAGATACGG TACAAGACAG
CTTGTGCTGA GGGATATAAA TCTTAAGATA AGAAGAGGAG AGAAGATAGC GATAGTAGGT
GAGAGTGGTT CAGGTAAGAC CACGTTAGCA AAATTGCTTT TAGGATTTTA TGATTATGAA
AGCGGAGAGA TAAGAATAAA TGGCTATAAC TTAAAAGATA TAAACAAGAA GTATTTGAGG
GAGAAGATAG CATATATATC TCAAGACATC TTTTTGTTCA GTGGTACGAT ATTTGAGAAT
TTGGTATTAG GCAACAGGAA TATAAAAATG GAAGATATAA TTGAAATAAG CAGATTAACC
ACACTGGATG AGTTTGTATC AAAACTTCCT TTGAGATACA ATACCATGAT AGAAGAGAAT
GGAGCTAATT TGTCAGGCGG GCAGAAGCAG CTAATAGCAA TAACCAGGGC ACTTTTAAAA
AATCCTGAGA TAGTGATAAT GGATGAAGCG ACCAGTAATC TTGATTCTGT AACCGAGCAG
GCGATAGGAA AGGTAGTAGA GAAGGTATGT GAGGGGATAA CCACTATAAT AATAGCGCAC
AGGCTATCCA CTATTTTGAA ATGCGACAGG GTTGTGGTAA TGCATGAAGG CAGGATAGTA
GAGGTGGGAA CGCATGAGGA GCTGATGAGA AAGAAAGGGT ATTATTATAA CCTGTGGAGA
GAGCAACTGA TGGGACTTGA ACAAAAAGGG CTATGGGATT TGGTTGGGAG TGCAGCTGGA
GGATGA
 
Protein sequence
MRKRYYCVKQ HDITDCAAAS LATICLQYGK EVSIARIRQM AGTDRFGTTA YGVVKAAEKL 
GFEAKAVRAE AKEAIFEKIP LPCIAHVLID GKLFHYVVIH EIRKERIVIA DPAKGIVKLN
PEEFFKIWTG ILILHGIEVV KAFNIEEDVN FKTESKFVKL LKDVFKVANL NNLQSNISSA
IAAVGVMVIL WVGAHKVING QMSIGELFTF NALLAYFVDP IKNLIGLQPM LQTAIVAAER
LSEILELESE FQDDEERKLS PSLKGDIEIE GLNFRYGTRQ LVLRDINLKI RRGEKIAIVG
ESGSGKTTLA KLLLGFYDYE SGEIRINGYN LKDINKKYLR EKIAYISQDI FLFSGTIFEN
LVLGNRNIKM EDIIEISRLT TLDEFVSKLP LRYNTMIEEN GANLSGGQKQ LIAITRALLK
NPEIVIMDEA TSNLDSVTEQ AIGKVVEKVC EGITTIIIAH RLSTILKCDR VVVMHEGRIV
EVGTHEELMR KKGYYYNLWR EQLMGLEQKG LWDLVGSAAG G