Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1687 |
Symbol | |
ID | 5105333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1625666 |
End bp | 1626928 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507581 |
Product | anthranilate synthase component I |
Protein accession | YP_001191766 |
Protein GI | 146304450 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR01820] anthranilate synthase component I, archaeal clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00216782 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0764877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCT ACCCTATCAC GGCATTTGCT CAGCCCTATG AGGTTTATCA GTGCATTGAG AGAGATCAGG AGATTGCTGC TCTCATGGAG AGTGTAGAGG GCTCTCAAAA CACCGCCAGA TATAGCGTAA TTGCCTGGGG GGTTAAGAGG AAGGTACAGG TGAATAGGGG AGATGACCTG GAGGAATCGA TACTGAACGC TCTAAGAGGT GTAGAGGAGG GCGAGCTTAG GTTTTCAGGT GGTCTTCTAG GTTACATATC GTACGACGCG GTGAGAAGAT GGGAAACAGT TAGGGACTTG AAGCCTGCAA TAGAGGATTG GCCAGACGCC GAGTTCTTCC TTCCAGAGAA CGTTTTGGTC TTCGATCATG CACTGGGAAA GGTGTTTGTG GAGGGAGATA TACCATCAAT AGCTGGATGT TTTGAACAGG GGGAATTCAA GGTGACTCCC CATGACGAGT CGATGACTAA ACAAGAGTAT GAGTCAGGGG TTAACTCGAT ACTAGAATAC ATCAAGTCAG GATACGCATT TCAGGTTGTC CTCTCCAGGT TCTATAGATA CGCTGTCCAG GGTGACCCAA TGAGACTTTA CAGAAACTTG CGAAAGATTA ATCCATCTCC CTACATGTTT TACATTAAAT TTGGGGAGAG GAAACTCATT GGATCCAGTC CGGAGCTTCT ATTCTCAGTT CAAAGGGGGA TCGCTGAAAC TTTCCCGATC GCGGGCACTA GACCTAGGGG AAAGACCAGT GAAGAGGATT TTGAACTGGA ACAGGAACTT CTATCCTCTG AGAAGGAGAT GGCCGAGCAC CTAATGCTTG TGGATTTGGC CAGAAACGAC ATAGGAAAGT CCTGTGTACC AGGAACTGTG AAGGTCCCAG AATTTGCCTA CGTTGAGAAG TACAGCCACG TACAACACAT TGTTAGTAGA GTGGTGGGAA CCCTGAGGAA GGATGCAAAT TCCTTGGATG TTCTAAAGTC CATGTTCCCT GCCGGTACAG TCAGCGGTGC TCCAAAGCCC ATGGCAATGA ACATAATAGA GTTGCTAGAG CCTTACAAGA GGGGTCCCTA TGCTGGTGCA GTGGGTTTCA TCTCAAGGAA TTCAGCGGAG TTTGCAATCA CCATCAGAAC CGCAATGATT AACAGGGATA TTCTTCGCAT ACAAGCTGGA GCTGGGATAG TCTACGATTC AGTTCCTGAG CAGGAGTACT ATGAAACTGA GCATAAAATG AGAGCCCTTA AGGTGGCACT TGGGGTGAGC TAA
|
Protein sequence | MKTYPITAFA QPYEVYQCIE RDQEIAALME SVEGSQNTAR YSVIAWGVKR KVQVNRGDDL EESILNALRG VEEGELRFSG GLLGYISYDA VRRWETVRDL KPAIEDWPDA EFFLPENVLV FDHALGKVFV EGDIPSIAGC FEQGEFKVTP HDESMTKQEY ESGVNSILEY IKSGYAFQVV LSRFYRYAVQ GDPMRLYRNL RKINPSPYMF YIKFGERKLI GSSPELLFSV QRGIAETFPI AGTRPRGKTS EEDFELEQEL LSSEKEMAEH LMLVDLARND IGKSCVPGTV KVPEFAYVEK YSHVQHIVSR VVGTLRKDAN SLDVLKSMFP AGTVSGAPKP MAMNIIELLE PYKRGPYAGA VGFISRNSAE FAITIRTAMI NRDILRIQAG AGIVYDSVPE QEYYETEHKM RALKVALGVS
|
| |