Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0497 |
Symbol | |
ID | 7085008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 558560 |
End bp | 559540 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697526 |
Product | BAAT/Acyl-CoA thioester hydrolase |
Protein accession | YP_002354168 |
Protein GI | 217968934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.702531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGCAC CCGGACTGAT CGCAGCCTTC ACCGCCGGCG CCGCCACGCT CGCCGGCGGG CGGGCGCTGG TGCATTGGGG GATCCGCAAG GGCCTGGCGG CGCCGCGGGT GCCGCATCAC ACCGATCCGG GCGCGCTCGG GCTGGCGTTC GAGACGCTGC GCATCGGCAC CGAGCATGGG AAGTCGCTGC ATGCGTGGTT CATTCCGGCG CCGGGGAGCG CCTGTGATGG ATCGGACCAC GCTGCAAGCC CGGGCGCGGA CCTTGCGGAC GGCGAGCGCG ATGACGCTCG TGCCGAGTTC GAATCCGCAC CGGCCGTGGT GGTGATGCAC GGCTGGGGCG GCAATGCGGC GCTGATGCTG CCGCTCGCCC GCCCGCTGCA CGAGGCCGGC TACGCGATGC TGTTCGTCGA CGCGCGCTGC CACGGCGCAA GCGACGACGA CAGCTTCGCC TCGCTGCCGC GCTTCGCCGA GGACGCCGAG CACGCCTTCG CGTGGCTGGC GGCGCAGCCC GGGGTGGATC CGGCGCGCAT CGCGCTGCTC GGCCATTCGG TCGGCGCCGG CGCCGTGCTG TTCGCGGCCT TGCGCACGCC GCAGGTGGCG GCGGTGGTGA GCGTGGCGGC GTTCTCGCAT CCGGCGGCGA TGATGCGGCG CTGGCTGGCG GGCAAGCGCA TCCCCGAGAA GCCGTTGGGG CGCTACATCC TCGACTACGT GCAGAAGACG ATCGGCCACC GCTTCGACGA CATCGCGCCG GTGAACACCA TCGCCCGCAT TCGCCGCCCG GTGCTGCTGG TGCATGGCGC GGACGACGAG GTGGTGCCGA TCGACGAGGC CATGCAGATC TACGCGATGC GTGGCGATAC GCCGGTCGAG CTGATGACGT TGTCGGGCGA CCACGAATCC TTCGTCGACC TCGAGCACCA CGTCGGGCGG CTGGTGGAGT TCCTCGGGCG GGTGCTCGCG CAGGGCGGTG CGCGGGAATA G
|
Protein sequence | MGAPGLIAAF TAGAATLAGG RALVHWGIRK GLAAPRVPHH TDPGALGLAF ETLRIGTEHG KSLHAWFIPA PGSACDGSDH AASPGADLAD GERDDARAEF ESAPAVVVMH GWGGNAALML PLARPLHEAG YAMLFVDARC HGASDDDSFA SLPRFAEDAE HAFAWLAAQP GVDPARIALL GHSVGAGAVL FAALRTPQVA AVVSVAAFSH PAAMMRRWLA GKRIPEKPLG RYILDYVQKT IGHRFDDIAP VNTIARIRRP VLLVHGADDE VVPIDEAMQI YAMRGDTPVE LMTLSGDHES FVDLEHHVGR LVEFLGRVLA QGGARE
|
| |