Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1513 |
Symbol | |
ID | 7083595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1690223 |
End bp | 1691890 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698530 |
Product | phenylacetic acid degradation protein paaN |
Protein accession | YP_002355167 |
Protein GI | 217969933 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02288] phenylacetic acid degradation protein paaN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.166964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCACC CCTTGCTCGA AAAGCACCGC GCCACCCTCG AAAGCGCCCT CGACGCCATC GCCACGCGCG GCTACTGGTC TGCCTTCCCC GAGATGCCGA GCCCCAAGCT CTATGGCGAG GCGGCGCCGG ACGAGGGCAA GCGCGCCTTC GAGGCGCACC TGGGCAAGCC GTTCGAGCTC GGGCAGCCGG GCCAGACGGG CTGGCACGGC GGCGAGAGCT CGCCCTATGG CGTGGCGCTC GACGTGCGCT ATCCGGTGTG CGACCCGGAG ACGCTGATCG CCGCCGGGCT CGAGGCAATG AAGGGCTGGC AGGCGGCGGG CGCCGACGGC CGCACCGGCA TCTGCCTGGA AATCCTCCAG CGCCTGAACA AGCAGAGCTT CGAGATCGCC CACGCGGTCA TGATGACCAC CGGCCAGGGC TGGATGATGG CCTTCCAGGC CGGCGGCCCG CACGCCCAGG ATCGCGGCCT CGAGGCCGTC ACCTACGCCT GGCGCGAGCA GAGCTTCGTG CCGGCCGAGA CCACCTGGGA GAAGCCCCAG GGCAAGAACC CGGCGCTGGT CATGAAGAAG CACTTCCAGA TCGTCGGCCG CGGCGTGGGC CTGGTGATCG GCTGCGGCAC CTTCCCGACC TGGAACACCT ACCCGGGCCT GTTCGCCGCG CTCGCCACCG GCAACGCGGT CATCGTCAAG GCGCACAGCA ATGCCATCCT GCCGGCGGCG ATCACCGTGC GCACGATCCG CACCGTGCTC GCCGAGAACG GCATCGACCC CAACCTGGTG AGCCTGTGCG TGGCCACCCA GCGCAGCGTC ACCCAGGCGC TCGCCACCCA CCCGGCGGTG CAGTCGGTCG ACTTCACCGG CAGCAACGTG TTCGGCCAGT GGCTGATCGA CAACTGCCGC CAGGCCCAGG TCTATGCAGA GCTCGCCGGC GTGAACAACA TCGTCATCGA CTCGACGGAC TCCTACAAGG CGATGCTGGG CAACCTCGCC TTCACGCTCT CGCTGTATTC CGGCCAGATG TGCACCACCT CGCAGGCGAT CTTCGTGCCC GCGGGCGGGA TCGACACCGA GGACGGCCAC AAGAGCTACG ACGAGGTCTG CGCCGACCTC GCCCGTGCGG TCGAGCGCTT CCTGTCCAAA CCCGAGGTCG CCCACGCCGT GCTCGGCGCG ATCCAGTCCG CCGACACCGC CGAACGCATC GACATCGCCA ACAGCGGCGC GCTGGGCAAG GTCGTGCTGG CCTCGCAGAA GCTCGACAAC CCCGAGTTCC CGGGCGCCAA GGTGCGCACC CCGGTGCTGC TCGCCTGCGA CGCCGCCGAC GAGAAGGCCT ACATGGAAGA GCGCTTCGGC CCGATCAGCT TCATCGTCAA GGTCGCCGAC ACCGCCGCAG GGATCGCGCT CTCCGAGCGT GTGGTCAGGA CCCACGGCGC GCTCACCGTC GGGCTCTACT CCACGAGGCA GGACGTCATC GACGCGATGA CCGAAGCCAC CTGGCGCGGC AAGGTCGCGC TGTCGATCAA CCTCACCGGC GGCGTGTTCG TGAACCAGTC GGCGGCCTTC TCCGATTACC ACGGCACCGG CGGCAACCCG TCGGCCAACG CCTCGTATTC GGATTCCGCC TTCGTCGCCA ACCGCTTCCG CGTCGTCCAG CGCCGCTACC ACGTCTGA
|
Protein sequence | MPHPLLEKHR ATLESALDAI ATRGYWSAFP EMPSPKLYGE AAPDEGKRAF EAHLGKPFEL GQPGQTGWHG GESSPYGVAL DVRYPVCDPE TLIAAGLEAM KGWQAAGADG RTGICLEILQ RLNKQSFEIA HAVMMTTGQG WMMAFQAGGP HAQDRGLEAV TYAWREQSFV PAETTWEKPQ GKNPALVMKK HFQIVGRGVG LVIGCGTFPT WNTYPGLFAA LATGNAVIVK AHSNAILPAA ITVRTIRTVL AENGIDPNLV SLCVATQRSV TQALATHPAV QSVDFTGSNV FGQWLIDNCR QAQVYAELAG VNNIVIDSTD SYKAMLGNLA FTLSLYSGQM CTTSQAIFVP AGGIDTEDGH KSYDEVCADL ARAVERFLSK PEVAHAVLGA IQSADTAERI DIANSGALGK VVLASQKLDN PEFPGAKVRT PVLLACDAAD EKAYMEERFG PISFIVKVAD TAAGIALSER VVRTHGALTV GLYSTRQDVI DAMTEATWRG KVALSINLTG GVFVNQSAAF SDYHGTGGNP SANASYSDSA FVANRFRVVQ RRYHV
|
| |