Gene Athe_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0051 
Symbol 
ID7407288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp67900 
End bp70107 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content37% 
IMG OID643714463 
ProductO-antigen polymerase 
Protein accessionYP_002571986 
Protein GI222528104 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000539528 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA AAAGTGAGAA GAAAAGTATA TATCAAAGTG CAAATAAATT TGGTGAAGGT 
GGAACTTCAA TTTTTCCAGG CAAAACACTT GCGGCATACA AAGCATTTGT GCTTTTTGTC
TTTTGTGTGC TTGTTTTGAT GAGTCCATAT TATAGAGGAC TTTATTTTGA TTATGAACTA
AGTGTATTTC AAGCAGTTAT GGCTGGGATA TTTATTCTTT TTGCAATATA TCTTTATCTT
TCAAAAGAGG GTTTTTTAAT AAATTCAAAA CTTGAACTTA TGCTGCTTCT TTTTATGGTT
GCATATATTG TTCCCTACTT TTTTGCAGCA AACAGAAGGC TTGCTCTTGG AGAATTTTTC
AAGTATGCAT TTTACTTTGC AGTTTTTTAT GTTGCGTCAA GAATTTCAAA AGGCAAAGCA
GAAAAGTTTG CAATTTTGAA TACTCTTTTT CTCTCAACAG TAGGTGTTGC ATTTTTTGGG
TATCAAGCAG CGGTAAAGTT AATTCCAGAA ACTGCCCGCC CTCTTGGCAT GGCCATGAAC
GGGCTTTGGG TTGGAAATAT GATAAACTCA ACACTGCAGT ATCACAACAC AGCAGGAACT
GTGCTGGCGT TCGGATTTAT AATCTCTTTG ATGCTGGCAA TATATAGTAG AAATAAGCTG
CTTAAAAGCT TCTATTTTGC CTTTTCAAGC TTTATATTTA CAGCGTTTTT CTTTACATAC
TCAAGAGGCT CATATATTAC CCTCTTGCTT GCTCTTTTAG TGTTTTTCTT GCTTTTGCCG
AGAGAAAAAA GAATTTCGCT CATTTTTAAC ATAGCGATTG TTGGCGCTTT TGTTATTACT
TTTTTGAATA AGGTTGGGGC AAACCTAAAC GAACATGGGA AAGTAAAACT TTGGCTTGTC
TTGCTCTTCC AGATGCTTCT GGTTTTTGCC CTGACATATG CTTTTGGATT TGTGGAGAGA
AGACTTTATG GTATTAGCAA CAACATTTAT ATAGTGGCTG CAGGCGTTGT TGGCATTTTG
GCTATCATTG GTTTTGCCAT TGCTCTAAAG ATGCATTTGA TTCCTTCAGA CATGGTCGAG
AAAATAAAAT CCATAGCTAT GTTCTGGAAA GAGAGAAACT TTGTTGAAAG AATGGTGTTT
TACAGAGATG GTTTAAAGAT ATTCTTAAAA AGTCCTGTAT TCGGTTATGG TGGTGGGGCA
TGGGTATCGC TGTATTTTAT GTACCAGTCT TATTTATATT TTACAACCCA GTCTCACAAC
TATTTTTTGC AGGTGCTTCT TGACACGGGG ATTGTTGGAT TTAGTATACT TTTAGTGTTT
TTATGGCTTT TATTTTCTGC TTCGCTCAAG GCATGGGATA AAAAAGAACA AAAAGAGAAT
GTTATTATTG CTGGGCTTGT GGCTGCAGCT ATACAGCTTT ATTCTCACTC AGTGCTTGAC
TTTGACTTTT CGCTCGCATC TGTGCAAGTT CTGCTATTTG CAGCTTTAGG GGTATTAGTT
TCAACCTCTT TACAAATTCT TCAGAAGCAT AAGCAAGAAA AGGTGATTTA CACGAGCAGA
AAGACAAATT TTGTACCTGT TTTGCTGGCA ATATTTTATC TGTTTGTGAT AGTAATTTCA
TTGAATTTCA GACTTGGAAA TTACTATGCT AACATTGGTC AGCAGGCGCT GCAAGCAGGG
AATTTGTCTG CTGCATATTC GTTTTTGTCA AAAGCTGTCA CATACGACTC ACTCAATTCC
AATGCGCTTT CAGACTATGC GGTTGCTCTA TACAGAATAG GTGACCAGAA CAAAGATGCA
AACCTGATTG CGAAGGCAGA CGGTTATTTC AGACAGGCAA TTGTAAATGA CAGGTTCAAT
CCAAAGATAA GGTTTAAATA TGCCGTATAT CTTCTTTCTC ATGGAGCAAT AGATAGCGGA
CTTTCACAGA TAGAAGAGGG GATAAAGCTT CAGCCTCTTC AGCCAGCAAA CTATGAGCTG
AAGGCTGATG CATATGCAAA GGTTGGAGAT TATTACCTTG GAAAAGGTGA TAAAGAAAAA
GCGAAGAAGT ATTTTGAAGT TGTGTTAAAG ATTCCTGAGG AAATTGAGAG ATTGAAAAAG
TACAGAGAAC ACATTCCAAA AGAGCTAATT GGCCAAGAAA AAATTGTGCC GTTTGCGATG
ACACAAAGAA CTCAGCAAAT AATTGAAGAA GTCAAGAAAA AGATATAG
 
Protein sequence
MAKKSEKKSI YQSANKFGEG GTSIFPGKTL AAYKAFVLFV FCVLVLMSPY YRGLYFDYEL 
SVFQAVMAGI FILFAIYLYL SKEGFLINSK LELMLLLFMV AYIVPYFFAA NRRLALGEFF
KYAFYFAVFY VASRISKGKA EKFAILNTLF LSTVGVAFFG YQAAVKLIPE TARPLGMAMN
GLWVGNMINS TLQYHNTAGT VLAFGFIISL MLAIYSRNKL LKSFYFAFSS FIFTAFFFTY
SRGSYITLLL ALLVFFLLLP REKRISLIFN IAIVGAFVIT FLNKVGANLN EHGKVKLWLV
LLFQMLLVFA LTYAFGFVER RLYGISNNIY IVAAGVVGIL AIIGFAIALK MHLIPSDMVE
KIKSIAMFWK ERNFVERMVF YRDGLKIFLK SPVFGYGGGA WVSLYFMYQS YLYFTTQSHN
YFLQVLLDTG IVGFSILLVF LWLLFSASLK AWDKKEQKEN VIIAGLVAAA IQLYSHSVLD
FDFSLASVQV LLFAALGVLV STSLQILQKH KQEKVIYTSR KTNFVPVLLA IFYLFVIVIS
LNFRLGNYYA NIGQQALQAG NLSAAYSFLS KAVTYDSLNS NALSDYAVAL YRIGDQNKDA
NLIAKADGYF RQAIVNDRFN PKIRFKYAVY LLSHGAIDSG LSQIEEGIKL QPLQPANYEL
KADAYAKVGD YYLGKGDKEK AKKYFEVVLK IPEEIERLKK YREHIPKELI GQEKIVPFAM
TQRTQQIIEE VKKKI