Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0241 |
Symbol | |
ID | 3832569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 239756 |
End bp | 243187 |
Gene Length | 3432 bp |
Protein Length | 1143 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828177 |
Product | BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_429119 |
Protein GI | 83589110 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000382571 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGCGC AATCTTTGAC TAAAAGTTTT AAAAAGGCGC GCAGGGCCGT AGCCGTAGTG GCGGTCCTCG CCCTGCTCGC GGCGCTTTTG CCGGCCGTAC CGGCGTGGGC GGCGGACGTG ACTACGATCA ACATAAGCAA TCCAACGTCT ACGTCGCCCA AATACGTCAA GGCCGGGGAG ACTTTTACTG TCCAATACAC TACTGATGGC CAGGGGACCG GCTATGTGCG GTTCTCTTTA GTGAATGCAA CAGGAACAAG GGAGTTGGCC GTTGATCAGG TCACGTTTCC TGTGACCAGC GGCTCCAAGG TTCTCACCGT TCCAGCGGAC ATGGCGGATG GAACGTATGA CCTGAAAGTC GAGGCCAGGG AGCAATATCA GGCCAACTGG TACCCCACGA CTGTGAATGG CGCCGTTATA GTGGACAGCA CACCCCCGAG CATTACGGCA AGCACCCTAA CCAGCCCCAA CGGCGGCGAA AATTGGAAGG TTGGTACGTC GCAAAACATA ACCTGGCGCT CCACCGATAT ATCCGATGAT AATTTGAAAG GCATCAACCT CTACTACTCC ACCGACGGCG GCGCTAACTG GACGCTCATC GAGGGCAATA AAACCAATGA TGGAACCTAC TCCTGGACTG TGCCGGGCGC CATAAGCACT AACTGTAAGG TGAAGATCGA GGCCGAGGAC ATGGCCGGCA ACAAGGTCTC GGACGTCAGC GACGGCACTT TCACCATCTA CGGTGTCGAC AATTCCGCCC CGACTGTCAG CCTGACTGCG CCCGCGAATA ACGCGTGGGT CGGTGGGACC TACTCCGTCA CCGCTAATGC CAGCGACTCT GAGTCTGGCA TCGCCAGCGT GAGGTTCGAG TACTCCACCG ACAACGGCAG CAACTGGACC ACCATCGGAA CAGCTACTTC TTCTCCCTAC TCCGTGAGCT GGAGCACCGC CAGCATTGCC GACGGTGCCA CGGTGTTGGT GCGGGCCACG GCTACTAACG GTGTTGGCGT TACCAAGACC AGCGACCCGG TGACCGTCCG GATCGACAAT TCCGCTCCGA CTGTCAGTCT GACTGCGCCC GCGAATAATG CGTGGGTCGG TGGGACCTAT ACCGTCACCG CTGACGCCAG CGACTCTGAG TCTGGCATCG CCAGCGTGAG GTTCGAGTAT TCCACCGACA ACGGCAGCAA CTGGACCACC ATCGGAACAG CTACTTCTTC CCCCTACTCC GTGAGCTGGA GCACCGCCAG CATTGCCGAC GGTACCACGG TGCTGGTGCG GGCCACGGCT ACCAACAATG CCGGTGTCGA GGCCAGCGAC ACCAGGACCG TCACGGTGGA CAACTCGGCA CCTTCTAGCA CGGGCGTGAC CGCCCCCGCG GCTGGCGCCG TGTGGAAGAT CGGCACAACG CAGGAGATCA AGTGGAATCC ATTCACTGAC GCCAAGAACA ACCTGTTGCC GAACCCGATT ACGATAAAGC TCTCCACGGA CGGCGGGACT AATTGGACCA CCATCGCCGC GAACGAGGCC AACGACGGAA CTTATACCTG GGTCGTGTCA GGGGTTCCTT CGCAAAATTG CAAGATAAAG ATTGAGGCTA CCGACACCGC CGGGAACACC GGTGAAGCGG TGAGCGATGA ATTCACCATC TGGGGCCAGG ATACCAGCGG CCCGACGGTG GCGCTGTCGG GAGTGAACAA CAACGATAAA GTAAAGGGCA CTGTTACCCT GAACGCCACG GCCAGCGACT CTGAGTCTGG CATCGCCAGG GTGAAGTTCG AGTACTCCAC CGACAACGGC AGCAACTGGA CTAACATCGG CACGGATACT TCTTCCCCCT ACTCCGTGAA CTGGGACACC ACCAGCGGCA TTGCCAACGG CACTACGGTG ATGGTACGCG CGGTGGCCAC GAACGGTGTT GGGGCTGAGG TTGCCGATGT CAGGACGGGC ATTCTGGTGG ACAACTCTCC ACCAAGCGTG ACTCTACAGC AAATACCCAA CAATCTGATT GGCGCCACTT GCACCATCAA GGCCGATGCC AGCGATAACG AGTCCGGCAT CGCCAGCGTG AAGTTCCAGT ACTCCACCGA CAACGGCAAC ACCTGGAGTG ACATTGGAAC GGCGACCGCG CCGAACGCCG AGGGCTACTA TCATGTAACC TGGAATGTGC CTCTGGCGGA CGGCACCACC GGCGTGCAGG TGAGAGCCAC GGCTGTTAAC GGTGTAGGCC TAGAATCTGC TCCATCAACC GCGACCGGCC TGACAGTGGA CAAGACGCCA CCTTCGATCA AGACCAACAC ACTGGCCTCG CCCAACGGAA ACGAGCGCTG GGCCAGGGGT TCGAAGCAGT CTATAACCTG GACGAGCGGT GATATTACTG ATAACCACCT GGGTGCGACC CCGGTCAGCC TGTACTATTC CACCGACGGC GGCGCTAACT GGACGCCCAT CGCGGCCGAC GAGACCAACG ACGGCACGTA CGACTGGGTC GTCCCCGCGG TGATCAGCGA TAAGTGCCGC GTCAAGATCG AGGTGCGCGA CCAGGCGGGC AACGTGGCCA GCGACATCAG CGATGCCGAC TTCACCATCT ACGCGGTGGA GCAGAGCGCG CCTGATGTGA CCGTCAACAG CCCGAACGGC GGCGAGAGCT ACACGCCCGG ATCGTCCGTG GTGATCACCT GGACGGCTGC CGACGACACC ACGCCACGGG CTGACCTTAA GGTGGACCTC TACTACTCCA CCGACGGCGG CGCGACCTGG ACTGCCATCG CTACCAACGA GGCCAACGAC GGTGCCTACG CCTGGACGGT GCCCAGCGTT AGCAGCAGCA ATTGCCTGGT GAAAGTGGAG GCCCGGGACG CCGTGGGCAA CGTGGGATAT GATATTAGCG ACCGCACTTT CAGCATCAAC GCCCCCGCGA CCCCGCCAGC AGAAGTTAAC ACGGTAGCGC TTAAGGCCGG CTGGAACCTG GTATCGCTGC CTCTCATCCC GAAGGAGCCG GCCATTGACA CCGTCCTGGC TGGCGTCTAC GGCAATCTCG ATACGGTCTG GGGTTACGAC ACAACAACCG GCACCTGGTC CTCCTACGCG CCGGGCGCGC CGAGCACCCT GACGCAGATG CGAGACGGCC GCGGCTACTG GGTGAAGGTT AACAATGACT GCTCTTTTGC CGTTGACGGT GTGGTGCTGC CCATGCCACC CCAGGTACCG CCGAGCTACG AGCTGGTGCC GGGCTGGAAC CTGATCGGCT TCAAGTCCAC GGTGCCGAAG AAGGCCAGCG AGTACCTGGC GGCGATGGCG GGCAAGTACA CTGTTATCTA TGGCTTCGAC GCTGCCACGC AGCGGTTCCA ACAGGTGCTG GCCGACGACA ACCTGCAGCC CGGTAAAGGT TATTGGGTTG CGGTGACCGA CTCCGGCCGC ATCTATCCGT AA
|
Protein sequence | MSAQSLTKSF KKARRAVAVV AVLALLAALL PAVPAWAADV TTINISNPTS TSPKYVKAGE TFTVQYTTDG QGTGYVRFSL VNATGTRELA VDQVTFPVTS GSKVLTVPAD MADGTYDLKV EAREQYQANW YPTTVNGAVI VDSTPPSITA STLTSPNGGE NWKVGTSQNI TWRSTDISDD NLKGINLYYS TDGGANWTLI EGNKTNDGTY SWTVPGAIST NCKVKIEAED MAGNKVSDVS DGTFTIYGVD NSAPTVSLTA PANNAWVGGT YSVTANASDS ESGIASVRFE YSTDNGSNWT TIGTATSSPY SVSWSTASIA DGATVLVRAT ATNGVGVTKT SDPVTVRIDN SAPTVSLTAP ANNAWVGGTY TVTADASDSE SGIASVRFEY STDNGSNWTT IGTATSSPYS VSWSTASIAD GTTVLVRATA TNNAGVEASD TRTVTVDNSA PSSTGVTAPA AGAVWKIGTT QEIKWNPFTD AKNNLLPNPI TIKLSTDGGT NWTTIAANEA NDGTYTWVVS GVPSQNCKIK IEATDTAGNT GEAVSDEFTI WGQDTSGPTV ALSGVNNNDK VKGTVTLNAT ASDSESGIAR VKFEYSTDNG SNWTNIGTDT SSPYSVNWDT TSGIANGTTV MVRAVATNGV GAEVADVRTG ILVDNSPPSV TLQQIPNNLI GATCTIKADA SDNESGIASV KFQYSTDNGN TWSDIGTATA PNAEGYYHVT WNVPLADGTT GVQVRATAVN GVGLESAPST ATGLTVDKTP PSIKTNTLAS PNGNERWARG SKQSITWTSG DITDNHLGAT PVSLYYSTDG GANWTPIAAD ETNDGTYDWV VPAVISDKCR VKIEVRDQAG NVASDISDAD FTIYAVEQSA PDVTVNSPNG GESYTPGSSV VITWTAADDT TPRADLKVDL YYSTDGGATW TAIATNEAND GAYAWTVPSV SSSNCLVKVE ARDAVGNVGY DISDRTFSIN APATPPAEVN TVALKAGWNL VSLPLIPKEP AIDTVLAGVY GNLDTVWGYD TTTGTWSSYA PGAPSTLTQM RDGRGYWVKV NNDCSFAVDG VVLPMPPQVP PSYELVPGWN LIGFKSTVPK KASEYLAAMA GKYTVIYGFD AATQRFQQVL ADDNLQPGKG YWVAVTDSGR IYP
|
| |