Gene Athe_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0251 
Symbol 
ID7407568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp302429 
End bp304633 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content39% 
IMG OID643714651 
ProductABC-type bacteriocin transporter 
Protein accessionYP_002572174 
Protein GI222528292 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR01193] ABC-type bacteriocin transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000911417 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA GATACTATTG TGTAAAGCAG CATGATATAA CAGATTGTGC AGCAGCAAGC 
TTAGCAACAA TTTGTTTGCA ATATGGAAAA GAAGTATCGA TAGCCAGAAT AAGGCAGATG
GCAGGTACAG ACAGGTTTGG CACCACAGCA TATGGAGTTG TAAAAGCGGC AGAAAAGCTT
GGATTTGAAG CAAAAGCAGT CAGGGCAGAA GCAAAAGAAG CAATATTTGA AAAAATACCA
CTTCCGTGTA TAGCACATGT ACTGATAGAT GGCAAGCTAT TTCATTATGT TGTAATACAT
GAAATAAGAA AAGAAAGGAT AGTAATAGCA GACCCGGCAA AAGGAATAGT TAAGCTAAAT
CCAGAAGAGT TTTTTAAGAT ATGGACAGGC ATTTTGATAC TTCTTGTACC AAATGAGAGG
TTCAAGAAAG GAAAGCAAGA GGGAGTTTTG AAAAAGTTTT TCAAGTTATT GAGACCACAG
AAGGACTTGA TATTGAATAT TTTTGCAGTG TCAGTTGTAT ATACCTTGCT TGGGATAGCG
GCAGCATTTT ATTACAAGTT TTTGATGGAT GATGTAATAC CGAATCTTTT GAAGAATACA
CTTCACATTA TAGCAGCAGG TGCGATACTG ATTACAATAT TCAAGGTGAT ATTAGGGGCA
TTCAGGGTAA GGCTTTTGAT ACATTTAAGT CAAAGATTAG ATATTAAGCT GATGCTGGGA
TATTATGAAC ATGTGATAGA GCTTCCGATG AGTTTTTTTG GTAGCAGGAA GATAGGGGAG
ATAATATCGA GGTTTATGGA CGCATCAAAG ATAAGGGATG CAGTATCTGG TGCAACTTTG
ACGCTAATGA TAGACAGCAT AATGGCAGTG GCAGGCGCAA GTATTTTATA CCTGCAAAAT
TCCACACTGT TTTTTATAGC GCTTGTGATG GTTTTGCTAT ACGCAGCAGT AGTGTTTGGA
TTTAACAGGG TATTGAAGGA AGCGAACAGG CAGGAGATGG AAGACAATGC AATTTTGACA
TCGTATTTGG TAGAGTCACT GAATGGGATA GAAGTAGTAA AGGCATTTAA TATAGAAGAG
GATGTAAATT TTAAGACAGA GAGCAAGTTT GTAAAGCTAT TAAAAGATGT GTTTAAGGTA
GCGAATTTAA ACAATTTGCA GAGTAATATA AGCAGCGCAA TAGCAGCGGT AGGAGTTATG
GTAATACTAT GGGTTGGTGC ACACAAAGTA ATAAATGGGC AGATGAGCAT AGGGGAGTTG
TTTACGTTCA ATGCACTGCT TGCATACTTT GTAGACCCGA TAAAGAACTT GATAGGATTG
CAGCCGATGT TGCAGACGGC GATAGTTGCA GCCGAAAGGC TGAGCGAAAT TTTGGAACTT
GAGAGTGAGT TTCAAGATGA CGAAGAAAGG AAATTATCAC CCAGTTTGAA GGGTGATATA
GAGATAGAGG GTTTGAACTT CAGATACGGT ACAAGACAGC TTGTGCTGAG GGATATAAAT
CTTAAGATAA GAAGAGGAGA GAAGATAGCG ATAGTAGATG AAAGTGGTTC AGGTAAGACC
ACGTTAGCAA AATTGCTTTT AGGATTTTAT GATTATGAAA GCGGAGAGAT AAGAATAAAT
GGCTATAACT TAAAAGATAT AAACAAGAAG CATTTGAGGG AGAAGATAGC ATATATATCT
CAAGACATCT TTTTGTTCAG TGGTACGATA TTTGAGAATT TGGTATTAGG CAACAGGAAT
ATAAAAATGG AAGATATAAT TGAAATAAGC AGATTAACCA CACTGGATGA GTTTGTATCA
AAACTTCCTT TGAGATACAA TACCATGATA GAAGAGAATG GAGCTAATTT GTCAGGCGGG
CAGAAGCAGC TAATAGCAAT AACCAGGGCA CTTTTAAAAA ATCCTGAGAT AGTGATAATG
GATGAAGCGA CCAGTAATCT TGATTCTGTA ACCGAGCAGG CGATAGGAAA GGTAATAGAG
AAGGTATGTG AGGGGATAAC CACTATAATA ATAGCGCACA GGCTATCCAC TATTTTGAAA
TGCGACAGGG TTGTGGTAAT GCATGAAGGC AGGATAGTAG AGGTGGGAAC GCATGAGGAG
CTGATGAGAA AGAAAGGGTA TTATTATAAC CTGTGGAGAG AGCAACTGAT GGGACTTGAG
CAAAAAGGGC TATGGGATTT GGTTGGGAGT GCAGCTGGAG GATGA
 
Protein sequence
MRKRYYCVKQ HDITDCAAAS LATICLQYGK EVSIARIRQM AGTDRFGTTA YGVVKAAEKL 
GFEAKAVRAE AKEAIFEKIP LPCIAHVLID GKLFHYVVIH EIRKERIVIA DPAKGIVKLN
PEEFFKIWTG ILILLVPNER FKKGKQEGVL KKFFKLLRPQ KDLILNIFAV SVVYTLLGIA
AAFYYKFLMD DVIPNLLKNT LHIIAAGAIL ITIFKVILGA FRVRLLIHLS QRLDIKLMLG
YYEHVIELPM SFFGSRKIGE IISRFMDASK IRDAVSGATL TLMIDSIMAV AGASILYLQN
STLFFIALVM VLLYAAVVFG FNRVLKEANR QEMEDNAILT SYLVESLNGI EVVKAFNIEE
DVNFKTESKF VKLLKDVFKV ANLNNLQSNI SSAIAAVGVM VILWVGAHKV INGQMSIGEL
FTFNALLAYF VDPIKNLIGL QPMLQTAIVA AERLSEILEL ESEFQDDEER KLSPSLKGDI
EIEGLNFRYG TRQLVLRDIN LKIRRGEKIA IVDESGSGKT TLAKLLLGFY DYESGEIRIN
GYNLKDINKK HLREKIAYIS QDIFLFSGTI FENLVLGNRN IKMEDIIEIS RLTTLDEFVS
KLPLRYNTMI EENGANLSGG QKQLIAITRA LLKNPEIVIM DEATSNLDSV TEQAIGKVIE
KVCEGITTII IAHRLSTILK CDRVVVMHEG RIVEVGTHEE LMRKKGYYYN LWREQLMGLE
QKGLWDLVGS AAGG