Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1541 |
Symbol | |
ID | 4462523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1672182 |
End bp | 1673618 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639700564 |
Product | anthranilate synthase component I |
Protein accession | YP_843953 |
Protein GI | 116754835 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR01820] anthranilate synthase component I, archaeal clade |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACATATC CAGTCCTAAT AGACGTCACA GACAAGATCA GCGTGGATGC GCCGCTATCG CTCTACCTCT CCCTGAGAGG TAGACGCTAT CCATACCTCC TGGAGTCTGT GGAGAAATCG GGCCAGAGGG CCAGGTTCTC GTTCGTCGGC GCAGACCCCT CAGCGGTTGT GAGATTGAAG AACCGGGAGA TAGAGGTAGA GGTATTCAAC GGCGGGGAGG AGTTCCTGAG CCGGAGGCTC TCGCGGTGCG CGGAGATCGA GGAGTTCAGC CACGGAATAA AAGGAAGGCT GCTTCCGGAG TTCGATATGT TTGACGCCCT CAGAGCCGCG ATACCATGCC CGAGATCCGA TGAGAGGAAC TTCTTCGGCA GACAGATCTT CCTTGGAGGC GGCATCGGGT ACATCGCATA TGATATGGTC AAGGAACGGC TCGGAAAGGT CTCGACCTCA GAAACTCCTG ATGCACAGTT CGCAATAGTC GAGAGCACAT TCATCTTCGA TCACCTCATG AGACGGGTCT ACTTCGCTGT AGTCCCGATG CTCCCCGGGG CGAAGACAGA TCTGATCGAG AGGGTCGAGG GCGTGGACGA TTACCCTGAA AACTCCGAGC TCCGCGGGAG GGTCGTGCGA TGCGGAGATC CTGAGGAGTA CATGGAGGCT GTCGTTGCTG CAAAGAGGCA CATAATCGAT GGAGACATAT TCCAGGTGGT GCTCGCCAGA TCCACAGACG TAGAATGCAG TGATACCATA GCGCTCTACA GAAACCTCCG GAGGATCAAT CCTAGCCCGT ATACATACCT CTTCGAGTTC GGGGGTCTCT CAATAGTGGG CGCATCTCCT GAGACGCTCT TCAACACCTA CGCCGGAATA CTCAAAGTCA ATCCGATTGC TGGGACTTGT CCGCGGGGGA GAACCCCTGA GGAGGATGAG GCCCTGGCCA GGGCGATGCT GAACGACGAG AAGGAGAGGG CCGAGCATGT GATGCTCGTG GATCTCGGAA GGAACGACGT GAGGAGCGTC TGCAGGGCGG GAAGTGTGAA GGTCGAGGAC TTCATGTCGG TTCTGAGATA CTCTCACGTC CAGCACATAG AGACCACGGT CTCGGGCGTG CTGAGGGAGG AGTGCGATCA GTTCGATGCC GCACGCGCGA TCTTTCCCGC AGGAACTCTG TCAGGCGCGC CGAAGATGAG AGCGATGGAG ATCATCGATG AGCTCGAGAA AGAGCCCAGG GGGATATACG GAGGAGGCAT AGGATACTTC TCAGCTGATG GTAGTGCGGA CTTCGCGATA GCGATCAGGA GTATCATTCT GAAGGATAAT ATCGCAAGGG TGCAGGCTGG AGCTGGAATA GTTGCGGACT CTGACCCTGA GAGGGAGCTG GCCGAGACCG AGAGGAAGAT GGGCGCGATG AAGCGTGCTC TGGGGGTGAT CGATTGA
|
Protein sequence | MTYPVLIDVT DKISVDAPLS LYLSLRGRRY PYLLESVEKS GQRARFSFVG ADPSAVVRLK NREIEVEVFN GGEEFLSRRL SRCAEIEEFS HGIKGRLLPE FDMFDALRAA IPCPRSDERN FFGRQIFLGG GIGYIAYDMV KERLGKVSTS ETPDAQFAIV ESTFIFDHLM RRVYFAVVPM LPGAKTDLIE RVEGVDDYPE NSELRGRVVR CGDPEEYMEA VVAAKRHIID GDIFQVVLAR STDVECSDTI ALYRNLRRIN PSPYTYLFEF GGLSIVGASP ETLFNTYAGI LKVNPIAGTC PRGRTPEEDE ALARAMLNDE KERAEHVMLV DLGRNDVRSV CRAGSVKVED FMSVLRYSHV QHIETTVSGV LREECDQFDA ARAIFPAGTL SGAPKMRAME IIDELEKEPR GIYGGGIGYF SADGSADFAI AIRSIILKDN IARVQAGAGI VADSDPEREL AETERKMGAM KRALGVID
|
| |