Gene Moth_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2139 
Symbol 
ID3833139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2238704 
End bp2240263 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content54% 
IMG OID637830064 
ProductBeta-mannanase-like 
Protein accessionYP_430974 
Protein GI83590965 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.926185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCAGAT GGACCTGGGT TGTCTTAACC GCCATCCTCC TGGTGGTACT GGTAGTGACC 
GGGCGCCTGG ACAAGCAGGA TAGACAGCTT TCCTTCTTGA TCCCGGGGGT GGCGGGGAAC
TCAATCTCGG GTAGTGATGA TAAAAGTTGG AAAGAATACC AGAATAATTA TGACGGTTTT
AGCCTGGCTT ATCCGGAGGA CTGGCAGCAA GAAGTCATTC CCGGAGTGGC TACTATCCTG
ACGAAGGCCG GTTATGTCAA GCTTACCGTC CTGGCCCAGC CCCTGGGGAA AATTAGCGCC
GAAGAGTACG TCCTTTACAG TAACCGCAGC CTCCAGGAGG GATGGGGCGG TATCAAACTT
CAGGACCAGA AAAAGTTGAA TCTAAGGGGT TACAGCGCCT GGCAGTTTGA CTGGACCCGC
CCGCAGCTGG GACCCGGGGA TTTAAACTAC TACCGCGAAT ACGACCTGGT GGACGGGAAA
ACGGTTTATA CCTTTATGTT AAAGAGCAAC CAGGCTAATT TTGCGGCGGC AACAAAGGAT
TTAAACCGGA TTATCCTGAC CTTTAAGCCC CTGCCGGCTG CGGGAGCCCC TCCCTTGCCG
CCGGCACCGC CGGTGCGGCG GGAGGTTGCC ATCGAAGGCC AGCACCACCG CCTGGTAATC
CCAGCCGACA AGACCCTGTG GGGGATTCTC AATCCCCACA AGCTGGGCCA GATGCAATAC
TTTGATAAGT TGCTGCCCCT GGAGAAACAA CTGGACTTCA AGTTCCAGTT CTTGATTACC
TATGCCGCCT TTGATACCAA ATTCAACACA GCGGAATTAC AAAAAATCTA TAACGACAAC
CGCGTCCTTA TGGTTGCCCT GCAGCCCTGG TGGTACGGTA AGAAGAACGA TACTTCCTTG
ATTGGTCTCG TCCAGGGCAA ATACGACTCG ATTTTACGGG TGTGGGCGCG GCAGTTTAAG
AACCTGGGCG ACCCGGTCTT TGTCCGTTTC GGCAACGAGA TGAACGGCGA CTGGTCTACC
TGGTCGTCCT GGTACACCGG CAAGGATACG GATATTTATA AAATGGCCTG GGAACATGTC
TATAACCTCT TCAAGGCTGA AGGGGCGAAC AATGTAATCT TTGTTTTCAA CCCCCACGAC
CGCTCTTTCC CCAACTTTAA GTGGAATAAC TACCTGCTTT ACTATCCCGG GGATAAAACT
GTGGACTGGA TCGGCTTGAC GGGTTATAAC AACGGTACCA GCTATGCCGC CGACGTCTGG
CGGGACTTCG ACGCCATCTA TGACCCCCTT TACCGGGAGT ACATGCACTA CTTCCCGGAC
AAGCCCTTTA TGATTACGGA GTTCTCGAGC AACGAGGAGG GCGGCGATAA AGCAGCCTGG
ATTCGCCAGT GCTTCCAGCA CATGGCCACC CGCTACCCAA ACATCAAGAT CGCCGTATGG
TTTAACCAGG TCGATGGCAA GTGGCTGTAC AACCTGGATT CTTCGCCCCG GAGTTTTGCC
GCCTTTAAGG AAGGCCTGAA GGATCCCCAC TACCAATTCC AGGCGGTAGT CCCGCGGTAG
 
Protein sequence
MRRWTWVVLT AILLVVLVVT GRLDKQDRQL SFLIPGVAGN SISGSDDKSW KEYQNNYDGF 
SLAYPEDWQQ EVIPGVATIL TKAGYVKLTV LAQPLGKISA EEYVLYSNRS LQEGWGGIKL
QDQKKLNLRG YSAWQFDWTR PQLGPGDLNY YREYDLVDGK TVYTFMLKSN QANFAAATKD
LNRIILTFKP LPAAGAPPLP PAPPVRREVA IEGQHHRLVI PADKTLWGIL NPHKLGQMQY
FDKLLPLEKQ LDFKFQFLIT YAAFDTKFNT AELQKIYNDN RVLMVALQPW WYGKKNDTSL
IGLVQGKYDS ILRVWARQFK NLGDPVFVRF GNEMNGDWST WSSWYTGKDT DIYKMAWEHV
YNLFKAEGAN NVIFVFNPHD RSFPNFKWNN YLLYYPGDKT VDWIGLTGYN NGTSYAADVW
RDFDAIYDPL YREYMHYFPD KPFMITEFSS NEEGGDKAAW IRQCFQHMAT RYPNIKIAVW
FNQVDGKWLY NLDSSPRSFA AFKEGLKDPH YQFQAVVPR