Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0700 |
Symbol | |
ID | 3832701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 730282 |
End bp | 731760 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637828632 |
Product | ABC transporter related |
Protein accession | YP_429562 |
Protein GI | 83589553 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000766615 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000015345 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAAAAG TTTTGGAATT AAAAAATATC GTTAAAAGAT ATGGCCAGGT GCCGGTTTTA AAAGGTGTAG ATTTTGATCT TTATGCCGGA GAGGTACATG CCATTGTTGG TCAGAATGGG GCTGGAAAGA GTACTTTAAT GAAAATTTTA GCTGGCGTTA TCACTGATTA TGAAGGAACT GAGGTTTTAA AAGGGAAACC TGTCAGGTTT CGTTCAGCTG GCGAAGCTCA GGTCCATGGC ATTGGCATGG TACATCAGGA ACTGAGCGTT ATTCTAAAAC TTTCTGTAGC TGAGAATTTA TTTATTGGTA CACCCGGTAG TAAAAAAACC TTTGTAAACT GGTGGGAAAT GGAGAATAAA GCTCGACAAT TACTAAAGGA TTTTGGCCTG GAAAGGATTA ATGTAAAACG TCCCCTGGGT AGTTACCTCT TGGGTATTCA ACAAATGATT GAAATCATTC GTACTATCCA TTCTGGCGCT AAAATCATCA TAATGGATGA ACCGACCTCG GCTCTATCAC CACCCGAGGT AAAGCGGTTG TTTGAACTTA TCGGCCGGTT GAAACAGGCC GGTACTAGCA TTATCTTTAT TTCCCATTTT CTTGATGACG TCTTGGACAT CGCTGATCGG ATTACCGTTC TCCGGGATGG ACGTAAGATA ACAACCTTGG AGAATAAGGA TATTAATAAG GCGGAACTTA TCAGATTAAT GCTTGGTAGC AGTGAGGGTA TAAGTGAAAA CACGGAGATC GAGCTATCAG CTAGTGAAAA GGAACCTGTT TTGGAGATCA AGGATCTGAG TTGTAGGCGT TTATTTAGGG ACGTAGCCTT TAGCGTTGGG AAAGGTGAAG TGGTAGGACT TTTTGGCTAT ATGGGTGCCG GCCATATGGA ATTACCACGA GTGTTATTTG GTCTTGAAGT ACCGGAAAAA GGACGGGTGA TTCTACAAGG AAAAGAAGTC AAAATTAAAT CTCCGGGTCA TGCCAGAAGT TTGGGGTTAG CTTATGCTCC GGAAAGCAGA AAAAAGGCCC TGTGCCTTAC AAAACCTATT TACGCTAATA TAACTCTACC TTTCCTGGCA ACTATCGGCA GGTTTGTTAA TAATCGTACG CGGGAACTAG AGATCAGCCG CCAATTAATC GAACGCACTG CCCTTAGACC ACCAAAACCC CTTTTAAATG TCGGTAATCT CAGTGGAGGT AATCAGCAGA AAGTCTCAGT CTCCCGTTGG TTACCTACTC ATCCTATCGT TTTTATTCTC AGCGAACCTA CCAGGGGCAT GGATGTCGGA GCCAAAGAAG AGATAATTAA TCTAGTCCGT GACCTTAAGG CCCAAGGTAT GGGAATCATT GTTGCTTCCT CAGAACCAGA AACGATTTTT GCTCTGGCCG ATCGCATATT GGTGTTCTCG AAAGGTAAAA TTGTGCATGA GTTTAAGCAA GGTAAAGTCA ATAAAGAGCT TTTATTTCAG TATGCTTAA
|
Protein sequence | MEKVLELKNI VKRYGQVPVL KGVDFDLYAG EVHAIVGQNG AGKSTLMKIL AGVITDYEGT EVLKGKPVRF RSAGEAQVHG IGMVHQELSV ILKLSVAENL FIGTPGSKKT FVNWWEMENK ARQLLKDFGL ERINVKRPLG SYLLGIQQMI EIIRTIHSGA KIIIMDEPTS ALSPPEVKRL FELIGRLKQA GTSIIFISHF LDDVLDIADR ITVLRDGRKI TTLENKDINK AELIRLMLGS SEGISENTEI ELSASEKEPV LEIKDLSCRR LFRDVAFSVG KGEVVGLFGY MGAGHMELPR VLFGLEVPEK GRVILQGKEV KIKSPGHARS LGLAYAPESR KKALCLTKPI YANITLPFLA TIGRFVNNRT RELEISRQLI ERTALRPPKP LLNVGNLSGG NQQKVSVSRW LPTHPIVFIL SEPTRGMDVG AKEEIINLVR DLKAQGMGII VASSEPETIF ALADRILVFS KGKIVHEFKQ GKVNKELLFQ YA
|
| |