Gene Moth_0651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0651 
Symbol 
ID3832047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp682435 
End bp684243 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content61% 
IMG OID637828592 
Productglycoside hydrolase 15-like protein 
Protein accessionYP_429522 
Protein GI83589513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0253524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000870842 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGAGATA TCTTGAAGCA CCGGCCTTAT TACGGGTTTA TCGGCAATGG CGAGACGGCC 
GCCCTTGTTG CTCCGGATTT TGCCATCACC TGGCTGTGCG TGCCCCGTTT CGACAGTTTT
CCCCTTTTCG CCGCCGCCCT GTCCCCGGAA CGGGGCGGCA GCCTGCGTCT GGATATCGAA
CCGGCTGTAA ATCCTGTCAG CCAGCGCTAC CTGCCGGATA GTAACGTCCT GGAGACTGTT
GGCAGGGGCC AGGGCCTGCA GGTGACGGTA CTGGACTTTA TGCCCTGGGG CTACCATCAT
TTAACTAGAC TGATTATTTT AGAAAATACC GGCGTGGATA CTCTAAAACC CCGGGTGCGC
TGGCAGGCCA GCCCCATTAT TACCGGCGCC CATACCTTTA AAGTCCAGTC CCGCGGTTCT
TTCCAGTTTA TTTATGGCCC CGGCGGCGCG GCCTGCATCG GCATGGCCGG TGCCCCGGGA
CCTACTTCGA CCCTGACTTT AAAACCCGGG GAAAAGCACC GCCTCTGGCT GGTCCTGGCT
TACGGCACTA ATCTAACCCT GGCCCGCCAG CACTGGACGG TGGGCTTCTA CAGTTCCCTG
GAAGAAAACC TCGCCTGGTG GCGCCGGTGG TTTAAAAAAG CGCGCCGGCC CAACACGACC
AGCGCGGAAG TTATGGAAGC GTATTACCGC AGCCTGATGG TTTTAAAGCT CCTTACTTAC
GAGCGTACCG GGGCCATAAT CGCGGCGCCG ACCACCTCTT TTCCGGCGGT GCCCGGCGGT
AACGACAACT GGGATTACCG TTACTGCTGG CTGCGAGATG GCTATTTTAC CGCCCTGGCC
TTCGATGCCG CCGGCTACCA TGAGGAGGCC CGCAGCTTTT ATGACTTCGC CCTTTCCCTG
CAGCAGCCCG ACGGGGGCTG GTACATGCCC CTGGTGCCGG TGGAGGGCCG GGGCGGTAAG
GAGTATATCG CCGCTGACCT GGCCGGTCCC CATGGGGAAA AACCCATTCG CTTCGGCAAC
GCCGCCATGC ACCAGATCCA GCTGGACAAT GCCGGTAATG TCCTGGACGG CCTCTGGAAT
CACTACCTGG CCACCCGGGA CCGGGAGTAT ATCCGCTCCC GCTGGGACGC CATCCGCCGG
GCGGCCCTCT GGCTGGAAAA CTACTGGGAC CGGCCGGAAA ACGGCATCTG GGAAATCCGC
GAGCGCAAGG ATCACTGGCT CTATGGCAAG ATCCTCTGCT ACGCCGGTTT GACGGCCGCC
TCCCATTTGT CCATCGAAAT GGGGCGGTTG CAGTGGGCCG GGCGGTGGCA CCGGGCGGCC
AGCCGCGTCC GGCGCCAGCT GCTGATCCAG GGCTGGTCGG CGGAGCGCCA GGCCTACCTG
CAGCATTACG GTCCCGACGC GCCCCTGGAT ATTTCCGTGC TGGCCCTGGA GTTTTACGGT
TTACTGGCAG CCAACCATCC CCGCCTGCTG AAAACGGTGG CCGCCATAGA GCAACCTTCT
CCTGTTGCTA AAGGCGAACC TGCAACCCGG GGCGGGCTCA ATATGTGGGG CGGCATTGCC
CGCTTCGAGC AGGCGGCTAT TCCCTTTTAC TTGCCCACCC TGTGGCTGGG GCGCTATTAC
CTCCATGCCG GCAATTACGA GCGCGCCCGT GAACTGTTGC AGGTCTGCCT GGATAATGCT
ACCGACCTTT ACTTGATGGC CGAACACTTC GACCCCCGGA CCGGCGAGCA ATGGGGTAAT
TTTCCCCAGG GGTTCAGCCA CCAGGAAATA GTCCGTTTCC TGCTGGATTA CGCCTACCGG
GAGGATTAG
 
Protein sequence
MRDILKHRPY YGFIGNGETA ALVAPDFAIT WLCVPRFDSF PLFAAALSPE RGGSLRLDIE 
PAVNPVSQRY LPDSNVLETV GRGQGLQVTV LDFMPWGYHH LTRLIILENT GVDTLKPRVR
WQASPIITGA HTFKVQSRGS FQFIYGPGGA ACIGMAGAPG PTSTLTLKPG EKHRLWLVLA
YGTNLTLARQ HWTVGFYSSL EENLAWWRRW FKKARRPNTT SAEVMEAYYR SLMVLKLLTY
ERTGAIIAAP TTSFPAVPGG NDNWDYRYCW LRDGYFTALA FDAAGYHEEA RSFYDFALSL
QQPDGGWYMP LVPVEGRGGK EYIAADLAGP HGEKPIRFGN AAMHQIQLDN AGNVLDGLWN
HYLATRDREY IRSRWDAIRR AALWLENYWD RPENGIWEIR ERKDHWLYGK ILCYAGLTAA
SHLSIEMGRL QWAGRWHRAA SRVRRQLLIQ GWSAERQAYL QHYGPDAPLD ISVLALEFYG
LLAANHPRLL KTVAAIEQPS PVAKGEPATR GGLNMWGGIA RFEQAAIPFY LPTLWLGRYY
LHAGNYERAR ELLQVCLDNA TDLYLMAEHF DPRTGEQWGN FPQGFSHQEI VRFLLDYAYR
ED