Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2844 |
Symbol | |
ID | 7873252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3078910 |
End bp | 3080931 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643699765 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_002889820 |
Protein GI | 237653506 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0929877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCTC CGCCCGCCGT CCCACCGCTC GACCCCGCCT GCTTCGCGCT GCTCGACGAC TGCCATGCCA CCGGCGCTTC GCCGTCCAGC CGCCTTTATC GTGGCCTGGT GCGCGCGCAT CGCTGCGCGG ACCCGTCCGC GCTGGAGGCG ACGTGGGCCG CGGTGGACGC CGACCTGCGC GCCGGCCTGC ACGCGGTGGT GCTGGCCGAC TACGAGTGGG GCGTGCGGCT GAATGGCGTC GGGCAGGAGG ACGTCGGGCG GGAGAGTAGG GCGCCGGAGG CTTGCGGCCG CTTGGCCGTG CTGATGTTCG CCACGCTGCA GCACCTTTCG GCCGTCGCGG TCGAGGCCTG GCTGGCGGCG GCCGAGGCCG GCGGGAGCGA GACGGGCAAG CTCGAGGTGT GTGAGGCGCA GCGCCAGCGC CGGCCTGCGA TCGGCACATC GGCGACGGCG CCCGATCCCG ACACCGGTCC GGCTCCCGCC GGCGTGCTCG GCCTGCGCGC CTCGGTCGAT CGCGCGGCCT TCGAGGACGC GATCGCGCGC ATCCACGCCG CCATCGCCGA GGGCGAGACC TACCAGGTCA ACTACAGCTA CCGCCTGGAT TTCGACACCT TCGGCTCGCC GCTCGCGCTC TACCGCCGCC TGCGTGCGCG CCAGCCGGTG GCCTTCGGCG CGCTGATCCG CCTGCCCGCC GACGCCGACG AGGGCGGCCC GGACTGGGTG CTGTCGTGCT CGCCGGAGCT CTTCCTGCGC CACCACGACG GCGTGCTGCA GGCGCGCCCG ATGAAGGGCA CTGCGGCGCG CAGCGGCGAC GACGCGGCCG ATGCGCTCGC CGCGCGCAGG CTCGCGGCCG ACGCCAAGAA CCGCGCCGAA AACCTCATGA TCGTCGACCT GCTGCGCAAC GACCTCGGCC GCGTCGCGAC CACCGGCAGC GTGCGCGTGC CGGCGCTGTT CGAGGTCGAG CGCTACCCGA CCGTGCTGCA GATGACCTCC ACGGTCGCGG CGGAGCTGCC CGCGGACGTC GGCTTTCCCG CGCTGCTGCG CGCGCTGTTT CCCTGCGGCT CGATCACCGG CGCGCCCAAG CACCGCACGA TGCAGCTCAT CGCCGAGCTG GAGACCGAGC CGCGCGGCCT CTACACCGGC AGCCTCGGCT GGATCGACGC GCCGCGCCCG GGGCACGCCT GCGGCGACTT CTGCCTGTCG GTGGCGATCC GCACGCTGAC GCTGGAGCGC ATCGGGCAGG GTGCGGATGC GCTCGCCGGC GCGCGGCATC GCGGCCGCAT GGGCGTGGGC GCCGGCATCG TGATCGACAG CGCGGCCGCC GACGAGTACG CGGAGTGCCG GCTCAAGGCG CGCTTCCTCA GCGCGCTCGA CCCCGGCTTC GCGCTCTTCG AGACCCTGTA CGCGACCCGC GAGGCCGGCG TGCGCAATCT CGACCGCCAC CTCGCCCGCC TCGAAGCCAG CGCCGCCGCG CTCGACTTCG TGTTCGAGCG CGCACGCATC GAGGTCGCGC TGGGCGCCCA GCTCGCCGCG CTGCCGCCGG CCACGCCCTC GCGCCTGCGC CTGGCGCTGC ACAAGGACGG CCGCCTCGAG CTCGCCGCCG CCGCGCTCGA CGCGCTGCCG CCCGGCGCGG TCACGGTGCT GCTCGCCGAG CGCCCGCTCG ACGACCCGCA GGGCCTCGGC GCGCACAAGA CCACGCTGCG CGCGCGCTAC GACGAGGGCC TACGCGCCGC GCTGGCCGCG GGCGCCTTCG ACACGCTGTT CTTCGACTCG GCGGGCCGCC TCACCGAGGG CGCGCGCAGC AACGTCTTCC TGCTCCTCGA CGGCGAGTGG CGCACCCCGC CCGCCGGCCG CGGCGTGCTG CCCGGCACGA TGCGCGCCGC GCTGCTGGAG GATCCGTCCT GGGCCGCGCG CGAGGCCGAG CTGCGCGTGG AGGACCTGCT GCGCGCGCAG CGCATCGTGC TGACGAACGC GCTGCGCGGG GTGGTGGAGG CGAGGCTGGA GCGCGCCGTC GGCGGTCGCT GA
|
Protein sequence | MSAPPAVPPL DPACFALLDD CHATGASPSS RLYRGLVRAH RCADPSALEA TWAAVDADLR AGLHAVVLAD YEWGVRLNGV GQEDVGRESR APEACGRLAV LMFATLQHLS AVAVEAWLAA AEAGGSETGK LEVCEAQRQR RPAIGTSATA PDPDTGPAPA GVLGLRASVD RAAFEDAIAR IHAAIAEGET YQVNYSYRLD FDTFGSPLAL YRRLRARQPV AFGALIRLPA DADEGGPDWV LSCSPELFLR HHDGVLQARP MKGTAARSGD DAADALAARR LAADAKNRAE NLMIVDLLRN DLGRVATTGS VRVPALFEVE RYPTVLQMTS TVAAELPADV GFPALLRALF PCGSITGAPK HRTMQLIAEL ETEPRGLYTG SLGWIDAPRP GHACGDFCLS VAIRTLTLER IGQGADALAG ARHRGRMGVG AGIVIDSAAA DEYAECRLKA RFLSALDPGF ALFETLYATR EAGVRNLDRH LARLEASAAA LDFVFERARI EVALGAQLAA LPPATPSRLR LALHKDGRLE LAAAALDALP PGAVTVLLAE RPLDDPQGLG AHKTTLRARY DEGLRAALAA GAFDTLFFDS AGRLTEGARS NVFLLLDGEW RTPPAGRGVL PGTMRAALLE DPSWAAREAE LRVEDLLRAQ RIVLTNALRG VVEARLERAV GGR
|
| |