Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0651 |
Symbol | |
ID | 3832047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 682435 |
End bp | 684243 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828592 |
Product | glycoside hydrolase 15-like protein |
Protein accession | YP_429522 |
Protein GI | 83589513 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0253524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000870842 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAGAGATA TCTTGAAGCA CCGGCCTTAT TACGGGTTTA TCGGCAATGG CGAGACGGCC GCCCTTGTTG CTCCGGATTT TGCCATCACC TGGCTGTGCG TGCCCCGTTT CGACAGTTTT CCCCTTTTCG CCGCCGCCCT GTCCCCGGAA CGGGGCGGCA GCCTGCGTCT GGATATCGAA CCGGCTGTAA ATCCTGTCAG CCAGCGCTAC CTGCCGGATA GTAACGTCCT GGAGACTGTT GGCAGGGGCC AGGGCCTGCA GGTGACGGTA CTGGACTTTA TGCCCTGGGG CTACCATCAT TTAACTAGAC TGATTATTTT AGAAAATACC GGCGTGGATA CTCTAAAACC CCGGGTGCGC TGGCAGGCCA GCCCCATTAT TACCGGCGCC CATACCTTTA AAGTCCAGTC CCGCGGTTCT TTCCAGTTTA TTTATGGCCC CGGCGGCGCG GCCTGCATCG GCATGGCCGG TGCCCCGGGA CCTACTTCGA CCCTGACTTT AAAACCCGGG GAAAAGCACC GCCTCTGGCT GGTCCTGGCT TACGGCACTA ATCTAACCCT GGCCCGCCAG CACTGGACGG TGGGCTTCTA CAGTTCCCTG GAAGAAAACC TCGCCTGGTG GCGCCGGTGG TTTAAAAAAG CGCGCCGGCC CAACACGACC AGCGCGGAAG TTATGGAAGC GTATTACCGC AGCCTGATGG TTTTAAAGCT CCTTACTTAC GAGCGTACCG GGGCCATAAT CGCGGCGCCG ACCACCTCTT TTCCGGCGGT GCCCGGCGGT AACGACAACT GGGATTACCG TTACTGCTGG CTGCGAGATG GCTATTTTAC CGCCCTGGCC TTCGATGCCG CCGGCTACCA TGAGGAGGCC CGCAGCTTTT ATGACTTCGC CCTTTCCCTG CAGCAGCCCG ACGGGGGCTG GTACATGCCC CTGGTGCCGG TGGAGGGCCG GGGCGGTAAG GAGTATATCG CCGCTGACCT GGCCGGTCCC CATGGGGAAA AACCCATTCG CTTCGGCAAC GCCGCCATGC ACCAGATCCA GCTGGACAAT GCCGGTAATG TCCTGGACGG CCTCTGGAAT CACTACCTGG CCACCCGGGA CCGGGAGTAT ATCCGCTCCC GCTGGGACGC CATCCGCCGG GCGGCCCTCT GGCTGGAAAA CTACTGGGAC CGGCCGGAAA ACGGCATCTG GGAAATCCGC GAGCGCAAGG ATCACTGGCT CTATGGCAAG ATCCTCTGCT ACGCCGGTTT GACGGCCGCC TCCCATTTGT CCATCGAAAT GGGGCGGTTG CAGTGGGCCG GGCGGTGGCA CCGGGCGGCC AGCCGCGTCC GGCGCCAGCT GCTGATCCAG GGCTGGTCGG CGGAGCGCCA GGCCTACCTG CAGCATTACG GTCCCGACGC GCCCCTGGAT ATTTCCGTGC TGGCCCTGGA GTTTTACGGT TTACTGGCAG CCAACCATCC CCGCCTGCTG AAAACGGTGG CCGCCATAGA GCAACCTTCT CCTGTTGCTA AAGGCGAACC TGCAACCCGG GGCGGGCTCA ATATGTGGGG CGGCATTGCC CGCTTCGAGC AGGCGGCTAT TCCCTTTTAC TTGCCCACCC TGTGGCTGGG GCGCTATTAC CTCCATGCCG GCAATTACGA GCGCGCCCGT GAACTGTTGC AGGTCTGCCT GGATAATGCT ACCGACCTTT ACTTGATGGC CGAACACTTC GACCCCCGGA CCGGCGAGCA ATGGGGTAAT TTTCCCCAGG GGTTCAGCCA CCAGGAAATA GTCCGTTTCC TGCTGGATTA CGCCTACCGG GAGGATTAG
|
Protein sequence | MRDILKHRPY YGFIGNGETA ALVAPDFAIT WLCVPRFDSF PLFAAALSPE RGGSLRLDIE PAVNPVSQRY LPDSNVLETV GRGQGLQVTV LDFMPWGYHH LTRLIILENT GVDTLKPRVR WQASPIITGA HTFKVQSRGS FQFIYGPGGA ACIGMAGAPG PTSTLTLKPG EKHRLWLVLA YGTNLTLARQ HWTVGFYSSL EENLAWWRRW FKKARRPNTT SAEVMEAYYR SLMVLKLLTY ERTGAIIAAP TTSFPAVPGG NDNWDYRYCW LRDGYFTALA FDAAGYHEEA RSFYDFALSL QQPDGGWYMP LVPVEGRGGK EYIAADLAGP HGEKPIRFGN AAMHQIQLDN AGNVLDGLWN HYLATRDREY IRSRWDAIRR AALWLENYWD RPENGIWEIR ERKDHWLYGK ILCYAGLTAA SHLSIEMGRL QWAGRWHRAA SRVRRQLLIQ GWSAERQAYL QHYGPDAPLD ISVLALEFYG LLAANHPRLL KTVAAIEQPS PVAKGEPATR GGLNMWGGIA RFEQAAIPFY LPTLWLGRYY LHAGNYERAR ELLQVCLDNA TDLYLMAEHF DPRTGEQWGN FPQGFSHQEI VRFLLDYAYR ED
|
| |