Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1253 |
Symbol | |
ID | 3833048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1294752 |
End bp | 1297250 |
Gene Length | 2499 bp |
Protein Length | 832 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829189 |
Product | O-antigen polymerase |
Protein accession | YP_430110 |
Protein GI | 83590101 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0517425 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACATA AAAAGAAAAT TAAAGACAAC GGCAGGGGAA AGGTTATTTC CCTGGAGAGT GCCGTCAATA AAGGGCGAAA AACCAAGACG GATAATAAGG CGCACGGGGA AAGTGACGCT GGTAGCCAGG TGTCCCCCTG GCGCCGCCGG ACGGAAGATC AGGTCGCTGA AGAGACCTTC TGGCCGGATA AGGGTAATCG AACTTTATTT GCCCTGGCCT TTAGCGGGCT AATTTTGCTT CTTTTCTACC CGCCCTTCTT TCGCGGCCTG TTCTTCCCGG TGGAACAGCG CTGGACGCTG ATACTGGCTG CCGTCATCTT TTTCTTCGCT TATCTGTGGA AGTTTTCCCG CCGGGAGATC GCCTTTCTCA ACCGGCCCCT TGATTTCGCG GCGGCTGCCC TGGTAGTGGT CTATATCCTG GCGGCCATAA AACCGGCTAG CCGGGGCCTG GCCATGGCCG AGATCGCGAA GGTGCTGCTC TATTTTTTGA CCTTCTGGCT TGTTTCGCGC CTGGGAGGGC AGCGCCGAAC CCTTTACCTG CTGCATGCCC TTTACCTGGC CGGTGTTGGT GCGGCTCTGG CCGCCCTCCT CGCGGCTACC GATCTGGTTT ACATTAAAGA CGGGTTTGTT GGCGGTCGTT TTTTCTCCAC CCTGCAATAT CCCAACGCCC TGGCCAGTTA TGTCGGCGCC GCCAGCATCA TCGGTTTTTA TCTCTGGGCA TGGTCAGGAA ACCGGTGGCG CTATGGTTAC GCTGCCGCCA ACTACCTCTT GCTCATGGTC TTCCTGGGGA CTGGCTCCCG TGGGGCCTAC CTGATTTTCC CGGCGGTGGT TTTCCTGTAC TGGTTGCTGG CGCCGGGCGG CTACCGGCTG AATACCCTGG CCCACCTGGT GGCCTGCGGC GCCGCCGCCC TGTTGGGCGT TGCCCGTTTT ATCCCCCTGG CCTTAGCCAA GGCCCACGGG CAGGCCTGGG GCTGGTTTAC CCTGGGGCTG ATGGTGGCCC TAGTTGGCCA GTTGCTTATC CAGGGGGCCG GAAGGGTTTT AACGACCCCC CGGGCCAGGA TGGCAGCCGG TTTGGTTATA CTGGTCATCC TGGTAGGAGG CGCTGCCGTC TTTGTCCTGC ACCAGCCGGG AATTAGCGCG GCATCTACCG GGGGACAGGT CCCCGGCGTC CTGGGCAAAA TCCTGCCGCC CCAGGTTGTA TCTCGCCTCA AGGATATTAA CCTTAAGACC AGGAGCAGCC GGGAGAGGAT TATCTGGACC CAGGATGCCC TGCAGATGGT GCGGCAGCGC CCCATCCTGG GCTTTGGCGG CGGCGGCTGG GAAGCAGCCT ATCGCCAGTA CCAGCGGTAC TACTACAACT CCACCCAGGT GCATAACGAT TATGCCCAGG TGGCCGTAGA AGCAGGTCTG GTCGGTCTGG TCGTCCTGGC GGCGGTGTGG TTGTTGTTCC TCCTGGTCAC GGCCGGCAAC TACCGACATA GCCAGGGTCA AGGTCGCCTC CAGGCCCTGG CCGTCGGCGC AGCTGCCGTC AACCTGGGAC TTCACGCGGC CATTGACTTT GACCTGGCCC TGGGGGCGGT GTCGATAATG CTCTGGGCCT GTTTCGGCCT GGCCCGTAGC CTTGAAGGCC AGCGTTTGGA GCCGGAGCCG GCCCTCCCAC CGTTAATTTT TAAGAACCGG CAATTGCCCT GTATAACAGC CGTCTCCCTG GCAACCCTGG TGCTGATTCT ATTTGCCGGC TCGTACCTGG CCGGGGTTTC CAGTTACCGC CAGGCTACTG CCGCCCTGCA GCAAAACAAC CTCCCAGCTG CGGCAGCTTA CCTGGAAGAG GCCAGCCGTT ACGATCCCTT TACAGCTTCT TATAACAGCG ATCTGGCAGG TATTTATCTA CAAGAAGGAA AAACTAAGGA GGCCCTGAAC CAGGCCCTGG CTGCCAGCGC CAAAGAACCT TATAATCTGG CAATCTTAAA CCGCCTGGCT GAAGTCTACT GGCAGGAGGG GTCGGCTCAG GAGGCCGTGG CCACCATGGA AAGGGCCCGG CAACTGGCCC CCTGGGTCGG CGCTGTCTGG GAAAACCTGG GTCAGGTTTA CGATGCCGCC GGCATCAGTT ACCTTCAGGC CGGACAGAAG GATAGGGCCC GGCAAATGTT CCAGGAAGCG GCGGCCCTGC CGGAGTCTAT CCAAGCCAAG GTGGATACCC TGGGAGATTT TAAAGACCTC CATCAACCCG GCGGAGTAGC CCTCAGTCCC GCCATTCAGC TCCGCGCAGG GATCGCCCAG TACTTCCTGG GTCAGGAAAA AGAAGCCGCT ATTAATCTTG AGGCGGCAGC AAGGGACGCC AACCTGCAGG CCGAGGCCCG GCTATGGCAG GCCGTCATGG CCTTTCACCA TGGTGACGGC CTCCTGTCTA GCCGGTTGCT GGCCGAAGTC CAGAAGACCA ACACCAGCCT GGCTAAGGAA TACGACCAGT TAAAAACTCT GCCTGTTCTC TCTAAATAG
|
Protein sequence | MAHKKKIKDN GRGKVISLES AVNKGRKTKT DNKAHGESDA GSQVSPWRRR TEDQVAEETF WPDKGNRTLF ALAFSGLILL LFYPPFFRGL FFPVEQRWTL ILAAVIFFFA YLWKFSRREI AFLNRPLDFA AAALVVVYIL AAIKPASRGL AMAEIAKVLL YFLTFWLVSR LGGQRRTLYL LHALYLAGVG AALAALLAAT DLVYIKDGFV GGRFFSTLQY PNALASYVGA ASIIGFYLWA WSGNRWRYGY AAANYLLLMV FLGTGSRGAY LIFPAVVFLY WLLAPGGYRL NTLAHLVACG AAALLGVARF IPLALAKAHG QAWGWFTLGL MVALVGQLLI QGAGRVLTTP RARMAAGLVI LVILVGGAAV FVLHQPGISA ASTGGQVPGV LGKILPPQVV SRLKDINLKT RSSRERIIWT QDALQMVRQR PILGFGGGGW EAAYRQYQRY YYNSTQVHND YAQVAVEAGL VGLVVLAAVW LLFLLVTAGN YRHSQGQGRL QALAVGAAAV NLGLHAAIDF DLALGAVSIM LWACFGLARS LEGQRLEPEP ALPPLIFKNR QLPCITAVSL ATLVLILFAG SYLAGVSSYR QATAALQQNN LPAAAAYLEE ASRYDPFTAS YNSDLAGIYL QEGKTKEALN QALAASAKEP YNLAILNRLA EVYWQEGSAQ EAVATMERAR QLAPWVGAVW ENLGQVYDAA GISYLQAGQK DRARQMFQEA AALPESIQAK VDTLGDFKDL HQPGGVALSP AIQLRAGIAQ YFLGQEKEAA INLEAAARDA NLQAEARLWQ AVMAFHHGDG LLSSRLLAEV QKTNTSLAKE YDQLKTLPVL SK
|
| |