Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2504 |
Symbol | |
ID | 7873943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2706507 |
End bp | 2707520 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699426 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_002889483 |
Protein GI | 237653169 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.917927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGTT TTCTCCCCCG CCTGCTCCTG CTGCTCGCCG TGCTTGCCAT CCTTGCCGCG ATGATCGCGG CGACGGGCTG GTGGTATGCG CACCGGCCGC TCGCGCTCGC CGCCGAGCGG GTGGATTTCA CCGTGGCCCG CGGCATGGGC ATGCGCCAGG CCGCCGCCGC CATCGAGCGC GCCGGCGTGG GCGTGGATGC GCGCCTGCTC GCGCTGCTCG CGCGTCTGAC GAAGCGCGAC GCCCGCATCA AGGCCGGCAG CTACGAGGTG CACGCCGGCA TCACGCCCTG GCAGCTCATC CTCAAGCTCT CCGACGGCGA CGTCACGCAG GGCGAGCTGT TGCTGGTCGA GGGCTGGACC TTCCGCCAGG TGCGCCAGGC GCTGGAGTCC CATCCGGATC TGGAAGCCGA CACCGCCGGG CTGGGCGAGG CGGAGATCCT CGCGCGCATC GGCGCGAGCG CGCAGAACGC CGAGGGCCTC TTCTTCCCCG ATACCTATCT GTTCGACAAG CGCTCGGGCG CGCTCGCCGT GCTGCGACGC GCGCACGAGG CCATGCAGGC CCGCCTCGAC AAGGCCTGGG CCGAGCGCGA CCCGGCCACG CCGCTGGCCT CGCCCTACGA GGCGCTGATC CTCGCCTCGA TCGTCGAGAA GGAGACCGGC CGCCCCGAGG ACCGCGCCCT GGTCGCCTCG GTGTTCGCCA ACCGGCTGCG CATCGGCATG CGTCTGCAGA CCGACCCCAC GGTGATCTAC GGCCTCGGCC CCGAGTTCGA CGGCCGCCTG CGCCGGGCGC ATCTCGATGC CGACCACCCG TGGAACACCT ACACCCGTGC CGGCCTGCCG CCGACGCCGA TCGCGATGCC GGGCGAGGCC GCGCTGCGCG CTGCGCTCAA ACCCGAGAAG AGCGACTTCC TTTATTTCGT CGCGCGCGGC GACGGCAGCA GCGAGTTTTC GCGCGACCTC GCGGCGCACA ATCGCGCCGT CGATAAATAC ATCCGCAACG GAGGTGGGGG ATGA
|
Protein sequence | MKRFLPRLLL LLAVLAILAA MIAATGWWYA HRPLALAAER VDFTVARGMG MRQAAAAIER AGVGVDARLL ALLARLTKRD ARIKAGSYEV HAGITPWQLI LKLSDGDVTQ GELLLVEGWT FRQVRQALES HPDLEADTAG LGEAEILARI GASAQNAEGL FFPDTYLFDK RSGALAVLRR AHEAMQARLD KAWAERDPAT PLASPYEALI LASIVEKETG RPEDRALVAS VFANRLRIGM RLQTDPTVIY GLGPEFDGRL RRAHLDADHP WNTYTRAGLP PTPIAMPGEA ALRAALKPEK SDFLYFVARG DGSSEFSRDL AAHNRAVDKY IRNGGGG
|
| |