Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0935 |
Symbol | |
ID | 4463328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1019339 |
End bp | 1020868 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639699954 |
Product | beta-ribofuranosylaminobenzene 5'-phosphate synthase |
Protein accession | YP_843363 |
Protein GI | 116754245 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1907] Predicted archaeal sugar kinases [COG3161] 4-hydroxybenzoate synthetase (chorismate lyase) |
TIGRFAM ID | [TIGR00144] beta-RFAP synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTTG CCTTTCCTGT GGCCCAGGAG ATAGCGAAGC TCGAGAGGAT CGTGGGCAGG CTCAGTCCCG TACAGAAGAT GCTTCTGGGG ACCGATGGCT CTGTGACTAG CCTGCTGGAG GTTATCACAG GCTCGCCTGT GGGAATAGAG ACGTTGGAGC AGAGGGTGGT GCCTGCGACT GATGACGTCG CGAGGGAGCT TGATATAGAT GTTGGGGAGG ACGTCAACTA CCGAGTCGTC CGGCTGAAAA ATGCCCGCAC CGGCGAGACG CTGATACACG CGGTCTCCTA CACTCCACTC AAGAGACTGG AGCCCGGGTT CAAGAACGAC CTGATGCGCG CGGACATCCC GATAGGCCAG ATACTCCACA AGCACCGCAT AGAGTCGCGG AGGGATATAA CACAGACGGA GTGCGAGCAG GCGGACGATA GGATGAGCCA GCTCTTCAAC ATCTTCCCGA AGGAGCTGAT GCTCTCAAGG AGATACAAGA TCATAAGAAA GGGGGAGCCG CTGATCGCAA TAAGGGAGAC GTTCCCCTAC AACATGTTTC AGGATACAAG AAGGGTGATC ATCGAGACGC CTGCGAGGAT CCACATGACG CTCACAGACC TCTGCGGCGA GGCCGGGCGG GTCGATGGCG GGGTGGGGAT AGCGCTCGAT AAGCCGAACA TAGTCGTGGA GGGAGAGATC GACAGGGATC TATCAGTCGA GGGAGAGCAG TCTGAGAGAG CGCTGGAGGC GGCGAAGAGG GTCGCCGAGA GGTTCGGGCT TGGAGGAGCG CGCATATCTG TTAGAAGCTG CTACAGGACG CATGTGGGTC TTGGCAGCGG AACACAGCTC GCTGTGGCGG TTGGAAAGGC GCTCTGCGAG CTTTACGGCG AGAAGGCGAG CATCAGAGAG ATTGCCTCTG CGGTTAGCCG CGGCGGGACC AGCGGCATAG GTGTCGCGGC ATTTGAGATG GGCGGCTTCA TAGTCGACGG GGGTCACACA TTCGGCCCGG GAAGGGAGAA GTCTGACTTC AGACCATCAT CTGCTAGCTC TGGGGTGAGA CCACCGCCCG TGATAGCGCG CCATGACTTT CCTGAGAGCT GGAGGATAGT GCTTGCCGTT CCAAACATAG AGAAGGGCGC ATACGGCCAG CGCGAGATCG ACATATTCAG GGAGTACTGC CCTGTCCCGC TCTCAGAGGT CCAGGAGCTG TGCTACCAGA TAATGGTCAG GATGATGCCG TCCGTTGTTG AGGAGGACCT AGATGCATTC GGGATGGCGG TGAACAGGAT ACAGCAGCTG GGCTTCAAGC GCGTGGAGGT CGAGCTGCAG CATCCTATGA TCAAAATGCT GATGCAGGAG ATGGTCTCAG CTGGAGCAGC ATGCGCCGGC CTGAGCTCCT TCGGGCCCAC GGTGTATGCG GTAACAGACA CTAACACCAG GGACATAGAG TCAGCAGCCC GCGATGTGAT GGGCGATATC GGCGGAGAGA TTATTATAAC AAGATCGAGG AACGAGGGGG CCAGGATAAG GACCGCGTAG
|
Protein sequence | MTFAFPVAQE IAKLERIVGR LSPVQKMLLG TDGSVTSLLE VITGSPVGIE TLEQRVVPAT DDVARELDID VGEDVNYRVV RLKNARTGET LIHAVSYTPL KRLEPGFKND LMRADIPIGQ ILHKHRIESR RDITQTECEQ ADDRMSQLFN IFPKELMLSR RYKIIRKGEP LIAIRETFPY NMFQDTRRVI IETPARIHMT LTDLCGEAGR VDGGVGIALD KPNIVVEGEI DRDLSVEGEQ SERALEAAKR VAERFGLGGA RISVRSCYRT HVGLGSGTQL AVAVGKALCE LYGEKASIRE IASAVSRGGT SGIGVAAFEM GGFIVDGGHT FGPGREKSDF RPSSASSGVR PPPVIARHDF PESWRIVLAV PNIEKGAYGQ REIDIFREYC PVPLSEVQEL CYQIMVRMMP SVVEEDLDAF GMAVNRIQQL GFKRVEVELQ HPMIKMLMQE MVSAGAACAG LSSFGPTVYA VTDTNTRDIE SAARDVMGDI GGEIIITRSR NEGARIRTA
|
| |