Gene Mthe_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1539 
Symbol 
ID4462521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1670502 
End bp1671527 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content59% 
IMG OID639700562 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_843951 
Protein GI116754833 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACTATC TGGCGAGGCT GATTGAGGGG CAGAATCTCA CCATAGAGGA GGCAGAATCT 
CTCCTGGGCG CGTTCTTCGA TGGCGCTACC GATGCGCAGA TCGCCTCAGC TCTTACCGCT
CTGAGGATGA AGGGTGAGAC TGCTGAGGAG CTCGCAGGTA TGGCAAAGAG GATGCGTGAG
TCCGCGATCC GAATACGCCC CAGGGTCTCC GGAACGCTGG TCGATACATG CGGGACTGGT
GGGGACAGCA CAAACACGAT AAATGTGAGC ACAGCAGCCG CGATAGTTGC AGCAGCGTGC
GGCGTGCCAG TCGCGAAGCA CGGGAACTAC GCTGTGAGCT CACGATGCGG AAGCGCCAAC
GTCCTCGAGG CTCTGGGTGT CAACATCTCC TGCCCTCCTG AGAGGGTGGA GAGCATCATA
GAGTCTGTCG GGATCGGGTT CATGCTCGCC CCGCTCTTTC ATCCGGCGAT GAAGCGCGTA
GCGCATATCA GAAAGGAGAT GGGGATCAGG ACCGTGTTCA ACGTTCTTGG GCCGCTCACA
AATCCGGCAG GTGCTGAGGC TCAGGTCGTG GGGGTGTACT CACCAGCACT CTGTGAGAAG
ATCGCAAATG TTCTGAACCT TCTCGGAACT AAACGGGCGA TGGTTGTGCA CGGCAGCGGT
CTTGACGAGA TATCAAACAC AGGCAGCACC TTCGTCTCCG AGCTGTGCGA TGGGGTGGTG
AGAAACTACG TTGTGGATCC CCGGGATCTT GGGTATCCGC TCGCAGATCT GAATGAGATC
GCTGGAGGGA CTCCTGATGA GAACGCGGAG CGTCTCGTGA GGATATTGAA GGGCGAGAAG
AGCAGGGCGA GGGAGCTGGT GGCGATGAAC GCAGGCGCAG CAGTGTACGT CTCGGGAATC
GCATCCAGCC TGAGAGAGGG GTGCGCGATC GCAGAGGGCG CCATAAGCTC CGGTAGCGCT
CTGGAGACCC TGAAGACCCT GGTCGAGGAG AACGGGGATC CTGGAAGGCT CAGGAGATTC
CTGTGA
 
Protein sequence
MNYLARLIEG QNLTIEEAES LLGAFFDGAT DAQIASALTA LRMKGETAEE LAGMAKRMRE 
SAIRIRPRVS GTLVDTCGTG GDSTNTINVS TAAAIVAAAC GVPVAKHGNY AVSSRCGSAN
VLEALGVNIS CPPERVESII ESVGIGFMLA PLFHPAMKRV AHIRKEMGIR TVFNVLGPLT
NPAGAEAQVV GVYSPALCEK IANVLNLLGT KRAMVVHGSG LDEISNTGST FVSELCDGVV
RNYVVDPRDL GYPLADLNEI AGGTPDENAE RLVRILKGEK SRARELVAMN AGAAVYVSGI
ASSLREGCAI AEGAISSGSA LETLKTLVEE NGDPGRLRRF L