Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2048 |
Symbol | |
ID | 7083808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2317393 |
End bp | 2319111 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643699075 |
Product | alpha amylase catalytic region |
Protein accession | YP_002355692 |
Protein GI | 217970458 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.105277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCCCT CTGCAGCCCC CTCCGTCGGC CCGCAGCCGG TTTCGAGCCC CGCTGCCCCC TGGTGGAAAT CCGCCGTCGC CTACCAGCTC TACCCGCGCA GCTTCATGGA CAGCGACGGC GACGGCATCG GCGACCTGCG CGGGATCCTC GCGCGGCTCG ACTACCTGCA ATGGCTGGGC GTCGATGTGA TCTGGATTTG CCCGATCTTC GACTCCCCGA ACGACGACAA CGGCTACGAT ATCCGCGACT ACCGCAAGAT CCTGTCGACC TTCGGCACCA TGGAGGATTT CGACCGCCTG CTCGCCGAAG TCCACCGCCG CGGCATGAGG CTGATCCTCG ACCTGGTGCT CAACCACACC AGCGACGAGC ACCCCTGGTT CCTCGAGTCG CGCGCCTCGC CCGACAACCC CAAGCGCGAC TGGTACATCT GGCGCGAGGG CCGCGACGGG CATCCGCCCA ACAACTGGGA GAGCATCTTC CGCGGCCCCG CGTGGGCCCG GGACCCGGCC ACCGGGCAGT ACTACCTGCA CCTGTTCACC CGCCGCCAGC CCGACCTCGA CTGGACCAAT CCGGAGATGC GCGCGGCCTT CCACGACATC GTGCGCTGGT GGCTGGACAA GGGCATCGAC GGCTTCCGCC TCGATGCGGT GAGCCACATC CGCAAGATGC CGGGCCTGCC CGACCTGCCC AATCCGCGCG GCCTGCCCTG GGTGCCGTCC TTCGCCATGC ACATGAACGT CGACGGCGTG CTCGACACCA TCGGCGAGCT CTGCCGCGAG ACCTTCCGCC GCTACGACGT GATGACCGTG GGCGAGGCCA ACGGCGTCGG CCCGCGCGAG GCGGTGGAGT GGGTGGGCAG CGACCGCGGC CGGCTCGACA TGATCTTCCA GTTCGAGCAC CTGGCGCTGT GGTCGCGCGC GCCCGGCGCG CCGCTCGACG TGGTCGCGCT CAAGAAGGTG CTGTCGCGCT GGCAGCATGC GCTGCACGGC AAGGGCTGGA ACGCGCTCTT CCTGGAGAAC CACGACATCC CCCGCGTCGT CTCCAAGTGG GGCGACACCG GCGCGCTGTG GCGCGAGAGC GCGACCGCGC TGGCGACGAT GTACTTCCTG ATGGAGGGCA CGCCCTTCAT CTACCAGGGC CAGGAGATCG GCATGACCAA CGGCGTCTTC GAGCGCATCG AGGACTTCGA CGACGTGCTG GCGAAGAACG ACTACGCGCA GCGCCGCCTG GCCGGCGAGG AGGAGGGCGC GATCGTCGCC GACCTCGTGC TCACCGGCCG CGACCCGGCG CGCACGCCGA TGCAGTGGGA CGACGGCCCT CAGGCGGGCT TCACGCGCGG ACGGCCGTGG CTGGCGGTCA ATCCCAACCA TCGCCGCATC AACGTCGCCC AGCAGCGCCA CGACCCGGCT TCGGTGCTCA ACCACTACCG CCGCCTGATC GCGCTGCGCA AGGCCGAGCC GGCGCTGGTG CTCGGCGACT ATCGCCTGCT GATGAAGGAC GACGCGCAGA TCTACGCCTA CCAGCGCCGG CTGGACGGCG AGCGCATTGC GGTGATCGTC AACCTGAGCC CTCGCCCGGC GCGCTTCGAT CATCCCGGCG TGGTGCTCCG CCACGAACGC CTGCTGCTCG CCAACCGCGC CGTCGAAGCG CACCCTGCCG CCCACGCGCT CGAGCTCGCA CCCTACGAGG CCCGCGTATA CCGCGTGGGA AAAATGTAG
|
Protein sequence | MPPSAAPSVG PQPVSSPAAP WWKSAVAYQL YPRSFMDSDG DGIGDLRGIL ARLDYLQWLG VDVIWICPIF DSPNDDNGYD IRDYRKILST FGTMEDFDRL LAEVHRRGMR LILDLVLNHT SDEHPWFLES RASPDNPKRD WYIWREGRDG HPPNNWESIF RGPAWARDPA TGQYYLHLFT RRQPDLDWTN PEMRAAFHDI VRWWLDKGID GFRLDAVSHI RKMPGLPDLP NPRGLPWVPS FAMHMNVDGV LDTIGELCRE TFRRYDVMTV GEANGVGPRE AVEWVGSDRG RLDMIFQFEH LALWSRAPGA PLDVVALKKV LSRWQHALHG KGWNALFLEN HDIPRVVSKW GDTGALWRES ATALATMYFL MEGTPFIYQG QEIGMTNGVF ERIEDFDDVL AKNDYAQRRL AGEEEGAIVA DLVLTGRDPA RTPMQWDDGP QAGFTRGRPW LAVNPNHRRI NVAQQRHDPA SVLNHYRRLI ALRKAEPALV LGDYRLLMKD DAQIYAYQRR LDGERIAVIV NLSPRPARFD HPGVVLRHER LLLANRAVEA HPAAHALELA PYEARVYRVG KM
|
| |