Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1143 |
Symbol | |
ID | 3833242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1175268 |
End bp | 1176812 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637829074 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_430000 |
Protein GI | 83589991 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | [TIGR02900] stage V sporulation protein B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGGG TATCCATCTG GCAAGGGACA GTTATCCTGA TGGTGGCCAG CTTATTGAAC CGCATCCTGA GCTTTGGCTA CCGAATGCTG GTTGTGCGCT ACATAGGGGC CGAAGGTATG GGCCTCTATG AAATGGTTTT CCCTTTTTAT AGCCTGGTGC TTATGGTCAC TACCGCCGGC ATACCCGTTG CCCTGGCTAA ACTAACAGCC GAAAGGATAG CCCTGGCCCG CTGGGGACAG GTACGCAGCG TCTTTCGCCT ATCCCTTATT TTTTTGACCC TTAGCGGCTT ACTGGCCGCC TTGGTGTTGT GGCGGCTGGC TCCCTTTTTA ACCGGCAGGA TGTTTGCCGA TACCCGGGTT TACCAGGCCT TTGTGGTGAT GATCCTGGCC CTGCCGGTAG TCTGTATTTG TTCGGCCTTT CGGGGCTACT TCCAGGGCTG GCAGTTGATG CGACCGGTGG CTCTGGCCCA GGTTGTTGAA CAGGTTGTGC GGGTGAGTGC CGGGTTTTTC CTGGGCATTT ATTTGCTCCC CTACGGCGTG GCCATGGCTG CGGCCGGCCT GGCGGCAGGT ATGGTTTTGG GAGAACTGGC AGGCCTGGGG ATTAGCGTCT TTATCTTTAA CCTGGCCCGG CCGTATTACG ATATCGCTGC CGACCAGACA GGCTCGTTGA AAGCAGATAT TCTTCCCCTG GTCCGTTTGG CCATCCCGGT GATGCTGGCC CGTATGGCCG GAGGTATTAT GTTAACTATT GAAGCCTTGT TAATCCCTCG CCAGCTCCAG GCCTGGGGGG TGACCATGCG GGAGGCCACA ACCATTTACG GCCAGTATGC AGGCATTGCC TTTACTTTGA TATACCTGCC CATGGTTATC ACTGTGGCCC TGGCCATGAC CATGGTGCCC GCCATTTCTG AGGCTAGGGC AGTTGGGGAT TGCGATTTAT TAAATAAACG CTGCCGGCAG TCTTTAAAGA TGACTATTTA TAGCAGCCTG CCCTTTGCCA TAACCTTTTA CCTCTTTGCC GCCCCCATTT GCGGCCTTAT CTTCGCCACT CCTGAGGCCG GTATACCCCT GAAAATCCTC GCCTGGGGGA GTATCTTTAT CTACCTGGAG CAGACTACCG TGGGCATCCT GAATGGCCTG GGGTCCATGT CTACCATCCT CTGGACGACT GTCGCCGGAG GTATTGTCGA CCTCCTGGGG ATCTATTACC TGACGCCGGT GCTGGGTATT GCCGGCGCCG CCCTTGGGGT AAACCTGGGA ACTGCCGTTA CCGCCATCCT GAACCTTCTG GCCCTGATCC GGCAGACCGG TTTTCACCTG GACTTCCGGA CCTTTGTTTA CTGGCCAGCC GTAGCCGGGG CGGGAATGTT CCTTGGGGCT TCCCTTCTCT GGCGGCTGCT GGTAGCTACA CCGGAACCAT GGCGCCTGTT CCAGACCCTG GCCGGTAGCA GTTTGTTTTA CCTGCTGATT CTCCTGGTAG CGGGAGAAAT ATCCCCCGGG CACTTTTACC TATTCCCCTG GCCGGGCCAG AGGAACGACA AATAA
|
Protein sequence | MAGVSIWQGT VILMVASLLN RILSFGYRML VVRYIGAEGM GLYEMVFPFY SLVLMVTTAG IPVALAKLTA ERIALARWGQ VRSVFRLSLI FLTLSGLLAA LVLWRLAPFL TGRMFADTRV YQAFVVMILA LPVVCICSAF RGYFQGWQLM RPVALAQVVE QVVRVSAGFF LGIYLLPYGV AMAAAGLAAG MVLGELAGLG ISVFIFNLAR PYYDIAADQT GSLKADILPL VRLAIPVMLA RMAGGIMLTI EALLIPRQLQ AWGVTMREAT TIYGQYAGIA FTLIYLPMVI TVALAMTMVP AISEARAVGD CDLLNKRCRQ SLKMTIYSSL PFAITFYLFA APICGLIFAT PEAGIPLKIL AWGSIFIYLE QTTVGILNGL GSMSTILWTT VAGGIVDLLG IYYLTPVLGI AGAALGVNLG TAVTAILNLL ALIRQTGFHL DFRTFVYWPA VAGAGMFLGA SLLWRLLVAT PEPWRLFQTL AGSSLFYLLI LLVAGEISPG HFYLFPWPGQ RNDK
|
| |