Gene Tmz1t_1513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1513 
Symbol 
ID7083595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1690223 
End bp1691890 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content69% 
IMG OID643698530 
Productphenylacetic acid degradation protein paaN 
Protein accessionYP_002355167 
Protein GI217969933 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02288] phenylacetic acid degradation protein paaN 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.166964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCACC CCTTGCTCGA AAAGCACCGC GCCACCCTCG AAAGCGCCCT CGACGCCATC 
GCCACGCGCG GCTACTGGTC TGCCTTCCCC GAGATGCCGA GCCCCAAGCT CTATGGCGAG
GCGGCGCCGG ACGAGGGCAA GCGCGCCTTC GAGGCGCACC TGGGCAAGCC GTTCGAGCTC
GGGCAGCCGG GCCAGACGGG CTGGCACGGC GGCGAGAGCT CGCCCTATGG CGTGGCGCTC
GACGTGCGCT ATCCGGTGTG CGACCCGGAG ACGCTGATCG CCGCCGGGCT CGAGGCAATG
AAGGGCTGGC AGGCGGCGGG CGCCGACGGC CGCACCGGCA TCTGCCTGGA AATCCTCCAG
CGCCTGAACA AGCAGAGCTT CGAGATCGCC CACGCGGTCA TGATGACCAC CGGCCAGGGC
TGGATGATGG CCTTCCAGGC CGGCGGCCCG CACGCCCAGG ATCGCGGCCT CGAGGCCGTC
ACCTACGCCT GGCGCGAGCA GAGCTTCGTG CCGGCCGAGA CCACCTGGGA GAAGCCCCAG
GGCAAGAACC CGGCGCTGGT CATGAAGAAG CACTTCCAGA TCGTCGGCCG CGGCGTGGGC
CTGGTGATCG GCTGCGGCAC CTTCCCGACC TGGAACACCT ACCCGGGCCT GTTCGCCGCG
CTCGCCACCG GCAACGCGGT CATCGTCAAG GCGCACAGCA ATGCCATCCT GCCGGCGGCG
ATCACCGTGC GCACGATCCG CACCGTGCTC GCCGAGAACG GCATCGACCC CAACCTGGTG
AGCCTGTGCG TGGCCACCCA GCGCAGCGTC ACCCAGGCGC TCGCCACCCA CCCGGCGGTG
CAGTCGGTCG ACTTCACCGG CAGCAACGTG TTCGGCCAGT GGCTGATCGA CAACTGCCGC
CAGGCCCAGG TCTATGCAGA GCTCGCCGGC GTGAACAACA TCGTCATCGA CTCGACGGAC
TCCTACAAGG CGATGCTGGG CAACCTCGCC TTCACGCTCT CGCTGTATTC CGGCCAGATG
TGCACCACCT CGCAGGCGAT CTTCGTGCCC GCGGGCGGGA TCGACACCGA GGACGGCCAC
AAGAGCTACG ACGAGGTCTG CGCCGACCTC GCCCGTGCGG TCGAGCGCTT CCTGTCCAAA
CCCGAGGTCG CCCACGCCGT GCTCGGCGCG ATCCAGTCCG CCGACACCGC CGAACGCATC
GACATCGCCA ACAGCGGCGC GCTGGGCAAG GTCGTGCTGG CCTCGCAGAA GCTCGACAAC
CCCGAGTTCC CGGGCGCCAA GGTGCGCACC CCGGTGCTGC TCGCCTGCGA CGCCGCCGAC
GAGAAGGCCT ACATGGAAGA GCGCTTCGGC CCGATCAGCT TCATCGTCAA GGTCGCCGAC
ACCGCCGCAG GGATCGCGCT CTCCGAGCGT GTGGTCAGGA CCCACGGCGC GCTCACCGTC
GGGCTCTACT CCACGAGGCA GGACGTCATC GACGCGATGA CCGAAGCCAC CTGGCGCGGC
AAGGTCGCGC TGTCGATCAA CCTCACCGGC GGCGTGTTCG TGAACCAGTC GGCGGCCTTC
TCCGATTACC ACGGCACCGG CGGCAACCCG TCGGCCAACG CCTCGTATTC GGATTCCGCC
TTCGTCGCCA ACCGCTTCCG CGTCGTCCAG CGCCGCTACC ACGTCTGA
 
Protein sequence
MPHPLLEKHR ATLESALDAI ATRGYWSAFP EMPSPKLYGE AAPDEGKRAF EAHLGKPFEL 
GQPGQTGWHG GESSPYGVAL DVRYPVCDPE TLIAAGLEAM KGWQAAGADG RTGICLEILQ
RLNKQSFEIA HAVMMTTGQG WMMAFQAGGP HAQDRGLEAV TYAWREQSFV PAETTWEKPQ
GKNPALVMKK HFQIVGRGVG LVIGCGTFPT WNTYPGLFAA LATGNAVIVK AHSNAILPAA
ITVRTIRTVL AENGIDPNLV SLCVATQRSV TQALATHPAV QSVDFTGSNV FGQWLIDNCR
QAQVYAELAG VNNIVIDSTD SYKAMLGNLA FTLSLYSGQM CTTSQAIFVP AGGIDTEDGH
KSYDEVCADL ARAVERFLSK PEVAHAVLGA IQSADTAERI DIANSGALGK VVLASQKLDN
PEFPGAKVRT PVLLACDAAD EKAYMEERFG PISFIVKVAD TAAGIALSER VVRTHGALTV
GLYSTRQDVI DAMTEATWRG KVALSINLTG GVFVNQSAAF SDYHGTGGNP SANASYSDSA
FVANRFRVVQ RRYHV