Gene Moth_0241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0241 
Symbol 
ID3832569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp239756 
End bp243187 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content60% 
IMG OID637828177 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_429119 
Protein GI83589110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000382571 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCGC AATCTTTGAC TAAAAGTTTT AAAAAGGCGC GCAGGGCCGT AGCCGTAGTG 
GCGGTCCTCG CCCTGCTCGC GGCGCTTTTG CCGGCCGTAC CGGCGTGGGC GGCGGACGTG
ACTACGATCA ACATAAGCAA TCCAACGTCT ACGTCGCCCA AATACGTCAA GGCCGGGGAG
ACTTTTACTG TCCAATACAC TACTGATGGC CAGGGGACCG GCTATGTGCG GTTCTCTTTA
GTGAATGCAA CAGGAACAAG GGAGTTGGCC GTTGATCAGG TCACGTTTCC TGTGACCAGC
GGCTCCAAGG TTCTCACCGT TCCAGCGGAC ATGGCGGATG GAACGTATGA CCTGAAAGTC
GAGGCCAGGG AGCAATATCA GGCCAACTGG TACCCCACGA CTGTGAATGG CGCCGTTATA
GTGGACAGCA CACCCCCGAG CATTACGGCA AGCACCCTAA CCAGCCCCAA CGGCGGCGAA
AATTGGAAGG TTGGTACGTC GCAAAACATA ACCTGGCGCT CCACCGATAT ATCCGATGAT
AATTTGAAAG GCATCAACCT CTACTACTCC ACCGACGGCG GCGCTAACTG GACGCTCATC
GAGGGCAATA AAACCAATGA TGGAACCTAC TCCTGGACTG TGCCGGGCGC CATAAGCACT
AACTGTAAGG TGAAGATCGA GGCCGAGGAC ATGGCCGGCA ACAAGGTCTC GGACGTCAGC
GACGGCACTT TCACCATCTA CGGTGTCGAC AATTCCGCCC CGACTGTCAG CCTGACTGCG
CCCGCGAATA ACGCGTGGGT CGGTGGGACC TACTCCGTCA CCGCTAATGC CAGCGACTCT
GAGTCTGGCA TCGCCAGCGT GAGGTTCGAG TACTCCACCG ACAACGGCAG CAACTGGACC
ACCATCGGAA CAGCTACTTC TTCTCCCTAC TCCGTGAGCT GGAGCACCGC CAGCATTGCC
GACGGTGCCA CGGTGTTGGT GCGGGCCACG GCTACTAACG GTGTTGGCGT TACCAAGACC
AGCGACCCGG TGACCGTCCG GATCGACAAT TCCGCTCCGA CTGTCAGTCT GACTGCGCCC
GCGAATAATG CGTGGGTCGG TGGGACCTAT ACCGTCACCG CTGACGCCAG CGACTCTGAG
TCTGGCATCG CCAGCGTGAG GTTCGAGTAT TCCACCGACA ACGGCAGCAA CTGGACCACC
ATCGGAACAG CTACTTCTTC CCCCTACTCC GTGAGCTGGA GCACCGCCAG CATTGCCGAC
GGTACCACGG TGCTGGTGCG GGCCACGGCT ACCAACAATG CCGGTGTCGA GGCCAGCGAC
ACCAGGACCG TCACGGTGGA CAACTCGGCA CCTTCTAGCA CGGGCGTGAC CGCCCCCGCG
GCTGGCGCCG TGTGGAAGAT CGGCACAACG CAGGAGATCA AGTGGAATCC ATTCACTGAC
GCCAAGAACA ACCTGTTGCC GAACCCGATT ACGATAAAGC TCTCCACGGA CGGCGGGACT
AATTGGACCA CCATCGCCGC GAACGAGGCC AACGACGGAA CTTATACCTG GGTCGTGTCA
GGGGTTCCTT CGCAAAATTG CAAGATAAAG ATTGAGGCTA CCGACACCGC CGGGAACACC
GGTGAAGCGG TGAGCGATGA ATTCACCATC TGGGGCCAGG ATACCAGCGG CCCGACGGTG
GCGCTGTCGG GAGTGAACAA CAACGATAAA GTAAAGGGCA CTGTTACCCT GAACGCCACG
GCCAGCGACT CTGAGTCTGG CATCGCCAGG GTGAAGTTCG AGTACTCCAC CGACAACGGC
AGCAACTGGA CTAACATCGG CACGGATACT TCTTCCCCCT ACTCCGTGAA CTGGGACACC
ACCAGCGGCA TTGCCAACGG CACTACGGTG ATGGTACGCG CGGTGGCCAC GAACGGTGTT
GGGGCTGAGG TTGCCGATGT CAGGACGGGC ATTCTGGTGG ACAACTCTCC ACCAAGCGTG
ACTCTACAGC AAATACCCAA CAATCTGATT GGCGCCACTT GCACCATCAA GGCCGATGCC
AGCGATAACG AGTCCGGCAT CGCCAGCGTG AAGTTCCAGT ACTCCACCGA CAACGGCAAC
ACCTGGAGTG ACATTGGAAC GGCGACCGCG CCGAACGCCG AGGGCTACTA TCATGTAACC
TGGAATGTGC CTCTGGCGGA CGGCACCACC GGCGTGCAGG TGAGAGCCAC GGCTGTTAAC
GGTGTAGGCC TAGAATCTGC TCCATCAACC GCGACCGGCC TGACAGTGGA CAAGACGCCA
CCTTCGATCA AGACCAACAC ACTGGCCTCG CCCAACGGAA ACGAGCGCTG GGCCAGGGGT
TCGAAGCAGT CTATAACCTG GACGAGCGGT GATATTACTG ATAACCACCT GGGTGCGACC
CCGGTCAGCC TGTACTATTC CACCGACGGC GGCGCTAACT GGACGCCCAT CGCGGCCGAC
GAGACCAACG ACGGCACGTA CGACTGGGTC GTCCCCGCGG TGATCAGCGA TAAGTGCCGC
GTCAAGATCG AGGTGCGCGA CCAGGCGGGC AACGTGGCCA GCGACATCAG CGATGCCGAC
TTCACCATCT ACGCGGTGGA GCAGAGCGCG CCTGATGTGA CCGTCAACAG CCCGAACGGC
GGCGAGAGCT ACACGCCCGG ATCGTCCGTG GTGATCACCT GGACGGCTGC CGACGACACC
ACGCCACGGG CTGACCTTAA GGTGGACCTC TACTACTCCA CCGACGGCGG CGCGACCTGG
ACTGCCATCG CTACCAACGA GGCCAACGAC GGTGCCTACG CCTGGACGGT GCCCAGCGTT
AGCAGCAGCA ATTGCCTGGT GAAAGTGGAG GCCCGGGACG CCGTGGGCAA CGTGGGATAT
GATATTAGCG ACCGCACTTT CAGCATCAAC GCCCCCGCGA CCCCGCCAGC AGAAGTTAAC
ACGGTAGCGC TTAAGGCCGG CTGGAACCTG GTATCGCTGC CTCTCATCCC GAAGGAGCCG
GCCATTGACA CCGTCCTGGC TGGCGTCTAC GGCAATCTCG ATACGGTCTG GGGTTACGAC
ACAACAACCG GCACCTGGTC CTCCTACGCG CCGGGCGCGC CGAGCACCCT GACGCAGATG
CGAGACGGCC GCGGCTACTG GGTGAAGGTT AACAATGACT GCTCTTTTGC CGTTGACGGT
GTGGTGCTGC CCATGCCACC CCAGGTACCG CCGAGCTACG AGCTGGTGCC GGGCTGGAAC
CTGATCGGCT TCAAGTCCAC GGTGCCGAAG AAGGCCAGCG AGTACCTGGC GGCGATGGCG
GGCAAGTACA CTGTTATCTA TGGCTTCGAC GCTGCCACGC AGCGGTTCCA ACAGGTGCTG
GCCGACGACA ACCTGCAGCC CGGTAAAGGT TATTGGGTTG CGGTGACCGA CTCCGGCCGC
ATCTATCCGT AA
 
Protein sequence
MSAQSLTKSF KKARRAVAVV AVLALLAALL PAVPAWAADV TTINISNPTS TSPKYVKAGE 
TFTVQYTTDG QGTGYVRFSL VNATGTRELA VDQVTFPVTS GSKVLTVPAD MADGTYDLKV
EAREQYQANW YPTTVNGAVI VDSTPPSITA STLTSPNGGE NWKVGTSQNI TWRSTDISDD
NLKGINLYYS TDGGANWTLI EGNKTNDGTY SWTVPGAIST NCKVKIEAED MAGNKVSDVS
DGTFTIYGVD NSAPTVSLTA PANNAWVGGT YSVTANASDS ESGIASVRFE YSTDNGSNWT
TIGTATSSPY SVSWSTASIA DGATVLVRAT ATNGVGVTKT SDPVTVRIDN SAPTVSLTAP
ANNAWVGGTY TVTADASDSE SGIASVRFEY STDNGSNWTT IGTATSSPYS VSWSTASIAD
GTTVLVRATA TNNAGVEASD TRTVTVDNSA PSSTGVTAPA AGAVWKIGTT QEIKWNPFTD
AKNNLLPNPI TIKLSTDGGT NWTTIAANEA NDGTYTWVVS GVPSQNCKIK IEATDTAGNT
GEAVSDEFTI WGQDTSGPTV ALSGVNNNDK VKGTVTLNAT ASDSESGIAR VKFEYSTDNG
SNWTNIGTDT SSPYSVNWDT TSGIANGTTV MVRAVATNGV GAEVADVRTG ILVDNSPPSV
TLQQIPNNLI GATCTIKADA SDNESGIASV KFQYSTDNGN TWSDIGTATA PNAEGYYHVT
WNVPLADGTT GVQVRATAVN GVGLESAPST ATGLTVDKTP PSIKTNTLAS PNGNERWARG
SKQSITWTSG DITDNHLGAT PVSLYYSTDG GANWTPIAAD ETNDGTYDWV VPAVISDKCR
VKIEVRDQAG NVASDISDAD FTIYAVEQSA PDVTVNSPNG GESYTPGSSV VITWTAADDT
TPRADLKVDL YYSTDGGATW TAIATNEAND GAYAWTVPSV SSSNCLVKVE ARDAVGNVGY
DISDRTFSIN APATPPAEVN TVALKAGWNL VSLPLIPKEP AIDTVLAGVY GNLDTVWGYD
TTTGTWSSYA PGAPSTLTQM RDGRGYWVKV NNDCSFAVDG VVLPMPPQVP PSYELVPGWN
LIGFKSTVPK KASEYLAAMA GKYTVIYGFD AATQRFQQVL ADDNLQPGKG YWVAVTDSGR
IYP