Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1506 |
Symbol | |
ID | 4462897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1633110 |
End bp | 1633928 |
Gene Length | 819 bp |
Protein Length | 272 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639700529 |
Product | imidazole glycerol phosphate synthase subunit HisF |
Protein accession | YP_843918 |
Protein GI | 116754800 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0107] Imidazoleglycerol-phosphate synthase |
TIGRFAM ID | [TIGR00735] imidazoleglycerol phosphate synthase, cyclase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAGCCC GCAGAATCAT TCCGTGTTTG GATTGCGATC TAGGAGTACC GAATGGCAGG GTTGTCAAGG GGATCGAATT CAAGCAGATA AGATACGCAG GGGTTCCGTG GGAGCTGGCG ACCCGGTACT ACGAGGATGG CGCTGACGAG ATCGTATTTC TGGATATCAC AGCCTCCCAC GAGCGCAGGG CGACGATGTT CGACGTCATC AAAAAGACAT CAGAGCATGT TTTCGTGCCG CTGACAGTGG GTGGAGGCAT CTCGAGCCTG GAGGACGCGA GGAATGCGTT CAACGCAGGT GCTGACAAGG TGACTGTGAA CACAGCAGCA CTCAGGAGGC CGGAGCTCAT CAGAGAGATC TCGGAGAGCT ACGGCAGCCA GGCGGTTGTG GTCGCGATCG ACGCGAAGAG AAGGTATCAG GATCTGGACG GCAGGATAAC AATAAACACT GAAAGCGGCC GGTGTTGGTT CGAGTGCTCC TATTACGGAG GGAGGCGATT CACAGGGGTC GACGCGCTCG CATGGGCAAG GCGCGTTGAG GAGCTCGGTG CAGGGGAGAT ACTGCTCACA AGCATGGATC GAGATGGCAC GTACGATGGC TTCGATATCG AGCTCACCGA TGCGGTATCG AGAATGGTGA GGATACCTGT GATCGCATCA GGCGGATGCG CCAGCCCTGA GCACATGTAT GAGGTCTTCA GAAGAACGGA TGCATCCGCA GCGCTCGCTG CGAGCATATT CCACTTCAAC CAGTGGAGTA TAAGGGATTG CAAGAGATAT CTCCACGAAC GCGGAATAAA CGTGAGGATC ACCGATTAG
|
Protein sequence | MLARRIIPCL DCDLGVPNGR VVKGIEFKQI RYAGVPWELA TRYYEDGADE IVFLDITASH ERRATMFDVI KKTSEHVFVP LTVGGGISSL EDARNAFNAG ADKVTVNTAA LRRPELIREI SESYGSQAVV VAIDAKRRYQ DLDGRITINT ESGRCWFECS YYGGRRFTGV DALAWARRVE ELGAGEILLT SMDRDGTYDG FDIELTDAVS RMVRIPVIAS GGCASPEHMY EVFRRTDASA ALAASIFHFN QWSIRDCKRY LHERGINVRI TD
|
| |