Gene Mthe_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1541 
Symbol 
ID4462523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1672182 
End bp1673618 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content57% 
IMG OID639700564 
Productanthranilate synthase component I 
Protein accessionYP_843953 
Protein GI116754835 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR01820] anthranilate synthase component I, archaeal clade 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACATATC CAGTCCTAAT AGACGTCACA GACAAGATCA GCGTGGATGC GCCGCTATCG 
CTCTACCTCT CCCTGAGAGG TAGACGCTAT CCATACCTCC TGGAGTCTGT GGAGAAATCG
GGCCAGAGGG CCAGGTTCTC GTTCGTCGGC GCAGACCCCT CAGCGGTTGT GAGATTGAAG
AACCGGGAGA TAGAGGTAGA GGTATTCAAC GGCGGGGAGG AGTTCCTGAG CCGGAGGCTC
TCGCGGTGCG CGGAGATCGA GGAGTTCAGC CACGGAATAA AAGGAAGGCT GCTTCCGGAG
TTCGATATGT TTGACGCCCT CAGAGCCGCG ATACCATGCC CGAGATCCGA TGAGAGGAAC
TTCTTCGGCA GACAGATCTT CCTTGGAGGC GGCATCGGGT ACATCGCATA TGATATGGTC
AAGGAACGGC TCGGAAAGGT CTCGACCTCA GAAACTCCTG ATGCACAGTT CGCAATAGTC
GAGAGCACAT TCATCTTCGA TCACCTCATG AGACGGGTCT ACTTCGCTGT AGTCCCGATG
CTCCCCGGGG CGAAGACAGA TCTGATCGAG AGGGTCGAGG GCGTGGACGA TTACCCTGAA
AACTCCGAGC TCCGCGGGAG GGTCGTGCGA TGCGGAGATC CTGAGGAGTA CATGGAGGCT
GTCGTTGCTG CAAAGAGGCA CATAATCGAT GGAGACATAT TCCAGGTGGT GCTCGCCAGA
TCCACAGACG TAGAATGCAG TGATACCATA GCGCTCTACA GAAACCTCCG GAGGATCAAT
CCTAGCCCGT ATACATACCT CTTCGAGTTC GGGGGTCTCT CAATAGTGGG CGCATCTCCT
GAGACGCTCT TCAACACCTA CGCCGGAATA CTCAAAGTCA ATCCGATTGC TGGGACTTGT
CCGCGGGGGA GAACCCCTGA GGAGGATGAG GCCCTGGCCA GGGCGATGCT GAACGACGAG
AAGGAGAGGG CCGAGCATGT GATGCTCGTG GATCTCGGAA GGAACGACGT GAGGAGCGTC
TGCAGGGCGG GAAGTGTGAA GGTCGAGGAC TTCATGTCGG TTCTGAGATA CTCTCACGTC
CAGCACATAG AGACCACGGT CTCGGGCGTG CTGAGGGAGG AGTGCGATCA GTTCGATGCC
GCACGCGCGA TCTTTCCCGC AGGAACTCTG TCAGGCGCGC CGAAGATGAG AGCGATGGAG
ATCATCGATG AGCTCGAGAA AGAGCCCAGG GGGATATACG GAGGAGGCAT AGGATACTTC
TCAGCTGATG GTAGTGCGGA CTTCGCGATA GCGATCAGGA GTATCATTCT GAAGGATAAT
ATCGCAAGGG TGCAGGCTGG AGCTGGAATA GTTGCGGACT CTGACCCTGA GAGGGAGCTG
GCCGAGACCG AGAGGAAGAT GGGCGCGATG AAGCGTGCTC TGGGGGTGAT CGATTGA
 
Protein sequence
MTYPVLIDVT DKISVDAPLS LYLSLRGRRY PYLLESVEKS GQRARFSFVG ADPSAVVRLK 
NREIEVEVFN GGEEFLSRRL SRCAEIEEFS HGIKGRLLPE FDMFDALRAA IPCPRSDERN
FFGRQIFLGG GIGYIAYDMV KERLGKVSTS ETPDAQFAIV ESTFIFDHLM RRVYFAVVPM
LPGAKTDLIE RVEGVDDYPE NSELRGRVVR CGDPEEYMEA VVAAKRHIID GDIFQVVLAR
STDVECSDTI ALYRNLRRIN PSPYTYLFEF GGLSIVGASP ETLFNTYAGI LKVNPIAGTC
PRGRTPEEDE ALARAMLNDE KERAEHVMLV DLGRNDVRSV CRAGSVKVED FMSVLRYSHV
QHIETTVSGV LREECDQFDA ARAIFPAGTL SGAPKMRAME IIDELEKEPR GIYGGGIGYF
SADGSADFAI AIRSIILKDN IARVQAGAGI VADSDPEREL AETERKMGAM KRALGVID