Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1689 |
Symbol | trpD |
ID | 5105335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1627506 |
End bp | 1628537 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507583 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_001191768 |
Protein GI | 146304452 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.304746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0692576 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCTA AGGAGGTCCT CAAGAGATTA ACCGAAGGCG TATCCTTGTC TCAAGAGGAG GCAAAGGAAT TAGCCGATTT AATCATGGAG GGATCAATAC CAGAACCTTT GGTTGCTGGT ATCTTAGTTG CACTAAAGAT GAAAGGAGAG ACCCCAGACG AAATAATAGG ATTTGTCAAT TCCATGAGGC AACATGCGCT AAAACTGGAC TTGAGGAACA CCCTCGACAC GGCTGGCACG GGAGGAGACG GTATAGGAAC AATTAACGTT AGCACGGCTA CTGCTCTGGC CGTTAGCTCT GTCTTTCCTG TGGCGAAACA TGGTAATAGG GCTGCAAGTA GTAGAAGCGG AAGCGCGGAC TTCCTTGAAT CCCTTGGGTA CAATATCCAA GTTCCTCCAG AGAAGGCCAA GGATCTACTT TCGCGGAACA ATTTCGTGTT TCTCTTTGCA CAGCTATATC ATCCTTCCAT GAAGAACGTT GCACCTGTAA GGAAGGTCCT AGGAGTTAGA ACTATCTTCA ATCTCCTGGG CCCTCTCACC AACCCAGCAG GATCCGAGAG GCAGGTCATG GGAGTATACT CTCTGCCTTT CATGAGAAAG CTAGCTGAAG CAGCACTTAA GCTAGGTTAC GTCAAGCTTG TACTTGTTCA CGGGGAGCCT GGACTGGATG AGGTCAGCCC TCAAGGGAAG ACATATATCA CAGAGGTAAC GGGGAGTAAG GTTGAGGAAT ATACCTATGA TTTCTCTGAA ATTATAGGTC AACCAGTTCC TGTTTCCAGA TTGACGACCA CAGATCCTCT CGACTCCGTG AGAAGAGTTC TGATGGCGTC CATGGGGAGG GATGAGGCCG TAGAGAAGTT CATTAGAATA AACGTGAGCG TCGCACTTTA CACTGCGGGA CTTGTTTCTG ATTTTAAGGA TGGATTTGAA TTATCAGAGG AACTTGTAAG AAAGTTGCCA GATCGAATTG AGACTTTGGT AAGGGACAAT GGAGATCCTA GTAAGATCAA GGCCATAAAG GGATCCCTAT GA
|
Protein sequence | MDPKEVLKRL TEGVSLSQEE AKELADLIME GSIPEPLVAG ILVALKMKGE TPDEIIGFVN SMRQHALKLD LRNTLDTAGT GGDGIGTINV STATALAVSS VFPVAKHGNR AASSRSGSAD FLESLGYNIQ VPPEKAKDLL SRNNFVFLFA QLYHPSMKNV APVRKVLGVR TIFNLLGPLT NPAGSERQVM GVYSLPFMRK LAEAALKLGY VKLVLVHGEP GLDEVSPQGK TYITEVTGSK VEEYTYDFSE IIGQPVPVSR LTTTDPLDSV RRVLMASMGR DEAVEKFIRI NVSVALYTAG LVSDFKDGFE LSEELVRKLP DRIETLVRDN GDPSKIKAIK GSL
|
| |