Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0926 |
Symbol | |
ID | 7085029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1016188 |
End bp | 1017660 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643697949 |
Product | anthranilate synthase component I |
Protein accession | YP_002354589 |
Protein GI | 217969355 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGAAC ACGAATTCAA CGCCCTCGCC GCCGAGGGCT ACAACCGCAT CCCGCTCACC CTCGAGACCT TCGCCGACCT CGACACCCCG CTCTCCATCT ACCTGAAGCT CGCCAACGAG CCCTACACCT ACCTGCTCGA ATCGGTGCAG GGCGGCGAGC GCTTCGGGCG CTATTCCTTC ATCGGCCTGT CCTCGCCCAC CCGCATCGAG GTCTATGGCC GCTCCGCCCT GCTGCTCACC GGCAACCGCC TGGTCGAGCG TCGCGACTAC GGCGACCCGC TCAACTTCGT CGCCGAGTTC ATGAACCGCA TCAAGGTGCC GCCGCGCCAG CACCTGCCGC GCTTCGCCGG CGGTCTGGTG GGCTGTTTCG GCTACGACAC CGTGCGCTAC ATCGAGCCGC GCCTGGCCAA GACCGAGAAA ACCGACACCG TCGGCACGCC CGACATCCTG CTGCTGCTGT CGGAAGAGGT CGCGATCGTC GACAATCTAA GCGGCAAGCT CACCCTCGTC GTCTATGCCG AGCCCGAGGT GCCGGGCGCG TACAAGCGCG CGCAGAGGCG CCTGCGCGAG CTCCTCGCCC GCCTGCGCGA GGCGGTGGAG ATCCCCGAGG AGTTCCGCGG CGAGTCCGCC GAGCCGGTGT CGAGCTTCGG CGAGGAAGCC TTCAAGGACG CGGTGCGCCG CGCCAAGCAG TACATCGTCG ACGGCGACCT CATGCAGGTC GTGCTGTCGC AGCGCATGAG CAAGCCCTAC GCCGCCAGCC CGATGGCGCT GTACCGCGCG ATCCGCTCGC TCAACCCCTC GCCCTACCTG TTCTACTTCA ACCTCGAGGA CTTCCACGTC GTCGGCGCCT CGCCCGAGAT CCTCACCCGG CTCGAGGACG ACGTCGTCAC CGTGCGTCCG ATCGCCGGCA CCCGCAAGCG CGGCGCCACC CCGGCCGAGG ACCTTGCGCT CGAACAGGAA CTGCTCGCCG ATCAGAAGGA GATCGCCGAG CACGTGCAAC TGCTCGACCT CGGCCGCAAC GACGCCGGCC GCGTCTCCGA GGTCGGCACG GTCAGGGTCA CCGAGCGCTT CACCGTCGAG CGCTACTCGC ACGTGATGCA CATCGTCTCC AACGTCGAGG GCAAGCTGCG CGAGGGCCTC GACGCGCTCG CCGTGCTGCG CGCCACCTTC CCCGCGGGCA CCGTGTCGGG CGCGCCCAAG GTGCGCGCGA TGGAGATCAT CGACGAGCTC GAGCCGGTCA AGCGTGGCAT CTACGCCGGC GCGGTGGGCT ACCTCGGCTT CCATGGCGAC ATGGACCTGG CGATCGCGAT CCGTACCGCG GTGGTGAAGG ACGGCCAGAT CCACGTCCAG GCCGGCGCCG GCATCGTCGC CGACTCCGAC CCCGACGCCG AGTGGAACGA GACCCAGAGC AAGGCGCGCG CGATGCTGCG CGCCGCCGAG ATGGCTGAAG GCGGACTCGA CACCCGCTTC TGA
|
Protein sequence | MLEHEFNALA AEGYNRIPLT LETFADLDTP LSIYLKLANE PYTYLLESVQ GGERFGRYSF IGLSSPTRIE VYGRSALLLT GNRLVERRDY GDPLNFVAEF MNRIKVPPRQ HLPRFAGGLV GCFGYDTVRY IEPRLAKTEK TDTVGTPDIL LLLSEEVAIV DNLSGKLTLV VYAEPEVPGA YKRAQRRLRE LLARLREAVE IPEEFRGESA EPVSSFGEEA FKDAVRRAKQ YIVDGDLMQV VLSQRMSKPY AASPMALYRA IRSLNPSPYL FYFNLEDFHV VGASPEILTR LEDDVVTVRP IAGTRKRGAT PAEDLALEQE LLADQKEIAE HVQLLDLGRN DAGRVSEVGT VRVTERFTVE RYSHVMHIVS NVEGKLREGL DALAVLRATF PAGTVSGAPK VRAMEIIDEL EPVKRGIYAG AVGYLGFHGD MDLAIAIRTA VVKDGQIHVQ AGAGIVADSD PDAEWNETQS KARAMLRAAE MAEGGLDTRF
|
| |