Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0051 |
Symbol | |
ID | 7407288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 67900 |
End bp | 70107 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714463 |
Product | O-antigen polymerase |
Protein accession | YP_002571986 |
Protein GI | 222528104 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000539528 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA AAAGTGAGAA GAAAAGTATA TATCAAAGTG CAAATAAATT TGGTGAAGGT GGAACTTCAA TTTTTCCAGG CAAAACACTT GCGGCATACA AAGCATTTGT GCTTTTTGTC TTTTGTGTGC TTGTTTTGAT GAGTCCATAT TATAGAGGAC TTTATTTTGA TTATGAACTA AGTGTATTTC AAGCAGTTAT GGCTGGGATA TTTATTCTTT TTGCAATATA TCTTTATCTT TCAAAAGAGG GTTTTTTAAT AAATTCAAAA CTTGAACTTA TGCTGCTTCT TTTTATGGTT GCATATATTG TTCCCTACTT TTTTGCAGCA AACAGAAGGC TTGCTCTTGG AGAATTTTTC AAGTATGCAT TTTACTTTGC AGTTTTTTAT GTTGCGTCAA GAATTTCAAA AGGCAAAGCA GAAAAGTTTG CAATTTTGAA TACTCTTTTT CTCTCAACAG TAGGTGTTGC ATTTTTTGGG TATCAAGCAG CGGTAAAGTT AATTCCAGAA ACTGCCCGCC CTCTTGGCAT GGCCATGAAC GGGCTTTGGG TTGGAAATAT GATAAACTCA ACACTGCAGT ATCACAACAC AGCAGGAACT GTGCTGGCGT TCGGATTTAT AATCTCTTTG ATGCTGGCAA TATATAGTAG AAATAAGCTG CTTAAAAGCT TCTATTTTGC CTTTTCAAGC TTTATATTTA CAGCGTTTTT CTTTACATAC TCAAGAGGCT CATATATTAC CCTCTTGCTT GCTCTTTTAG TGTTTTTCTT GCTTTTGCCG AGAGAAAAAA GAATTTCGCT CATTTTTAAC ATAGCGATTG TTGGCGCTTT TGTTATTACT TTTTTGAATA AGGTTGGGGC AAACCTAAAC GAACATGGGA AAGTAAAACT TTGGCTTGTC TTGCTCTTCC AGATGCTTCT GGTTTTTGCC CTGACATATG CTTTTGGATT TGTGGAGAGA AGACTTTATG GTATTAGCAA CAACATTTAT ATAGTGGCTG CAGGCGTTGT TGGCATTTTG GCTATCATTG GTTTTGCCAT TGCTCTAAAG ATGCATTTGA TTCCTTCAGA CATGGTCGAG AAAATAAAAT CCATAGCTAT GTTCTGGAAA GAGAGAAACT TTGTTGAAAG AATGGTGTTT TACAGAGATG GTTTAAAGAT ATTCTTAAAA AGTCCTGTAT TCGGTTATGG TGGTGGGGCA TGGGTATCGC TGTATTTTAT GTACCAGTCT TATTTATATT TTACAACCCA GTCTCACAAC TATTTTTTGC AGGTGCTTCT TGACACGGGG ATTGTTGGAT TTAGTATACT TTTAGTGTTT TTATGGCTTT TATTTTCTGC TTCGCTCAAG GCATGGGATA AAAAAGAACA AAAAGAGAAT GTTATTATTG CTGGGCTTGT GGCTGCAGCT ATACAGCTTT ATTCTCACTC AGTGCTTGAC TTTGACTTTT CGCTCGCATC TGTGCAAGTT CTGCTATTTG CAGCTTTAGG GGTATTAGTT TCAACCTCTT TACAAATTCT TCAGAAGCAT AAGCAAGAAA AGGTGATTTA CACGAGCAGA AAGACAAATT TTGTACCTGT TTTGCTGGCA ATATTTTATC TGTTTGTGAT AGTAATTTCA TTGAATTTCA GACTTGGAAA TTACTATGCT AACATTGGTC AGCAGGCGCT GCAAGCAGGG AATTTGTCTG CTGCATATTC GTTTTTGTCA AAAGCTGTCA CATACGACTC ACTCAATTCC AATGCGCTTT CAGACTATGC GGTTGCTCTA TACAGAATAG GTGACCAGAA CAAAGATGCA AACCTGATTG CGAAGGCAGA CGGTTATTTC AGACAGGCAA TTGTAAATGA CAGGTTCAAT CCAAAGATAA GGTTTAAATA TGCCGTATAT CTTCTTTCTC ATGGAGCAAT AGATAGCGGA CTTTCACAGA TAGAAGAGGG GATAAAGCTT CAGCCTCTTC AGCCAGCAAA CTATGAGCTG AAGGCTGATG CATATGCAAA GGTTGGAGAT TATTACCTTG GAAAAGGTGA TAAAGAAAAA GCGAAGAAGT ATTTTGAAGT TGTGTTAAAG ATTCCTGAGG AAATTGAGAG ATTGAAAAAG TACAGAGAAC ACATTCCAAA AGAGCTAATT GGCCAAGAAA AAATTGTGCC GTTTGCGATG ACACAAAGAA CTCAGCAAAT AATTGAAGAA GTCAAGAAAA AGATATAG
|
Protein sequence | MAKKSEKKSI YQSANKFGEG GTSIFPGKTL AAYKAFVLFV FCVLVLMSPY YRGLYFDYEL SVFQAVMAGI FILFAIYLYL SKEGFLINSK LELMLLLFMV AYIVPYFFAA NRRLALGEFF KYAFYFAVFY VASRISKGKA EKFAILNTLF LSTVGVAFFG YQAAVKLIPE TARPLGMAMN GLWVGNMINS TLQYHNTAGT VLAFGFIISL MLAIYSRNKL LKSFYFAFSS FIFTAFFFTY SRGSYITLLL ALLVFFLLLP REKRISLIFN IAIVGAFVIT FLNKVGANLN EHGKVKLWLV LLFQMLLVFA LTYAFGFVER RLYGISNNIY IVAAGVVGIL AIIGFAIALK MHLIPSDMVE KIKSIAMFWK ERNFVERMVF YRDGLKIFLK SPVFGYGGGA WVSLYFMYQS YLYFTTQSHN YFLQVLLDTG IVGFSILLVF LWLLFSASLK AWDKKEQKEN VIIAGLVAAA IQLYSHSVLD FDFSLASVQV LLFAALGVLV STSLQILQKH KQEKVIYTSR KTNFVPVLLA IFYLFVIVIS LNFRLGNYYA NIGQQALQAG NLSAAYSFLS KAVTYDSLNS NALSDYAVAL YRIGDQNKDA NLIAKADGYF RQAIVNDRFN PKIRFKYAVY LLSHGAIDSG LSQIEEGIKL QPLQPANYEL KADAYAKVGD YYLGKGDKEK AKKYFEVVLK IPEEIERLKK YREHIPKELI GQEKIVPFAM TQRTQQIIEE VKKKI
|
| |