Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1691 |
Symbol | |
ID | 5105337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1629255 |
End bp | 1630529 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507585 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_001191770 |
Protein GI | 146304454 |
COG category | [R] General function prediction only |
COG ID | [COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) |
TIGRFAM ID | [TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.248453 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAATA GTGAGGAAAT AATACCAAGC TATTGGTATA ACATAATACC CGATTTACCC AAACCTTTAC CTCCTCCTAG GGATCCCCCA GACGCGGAGT TCTCTAGGAT TGACCTTTTG AGAAAAATAA TGCCCAGTGA GGTTCTAAGA CAGCAGTTTA CAGTGGAGAG GTTTGTTCCA ATTCCTAAAG AAGTAAGGGA GGCTTACATC AATATAGGCA GACCTACGCC TTTAGTCAGA GCTAGAAGGC TTGAGGAATT CCTGAACACT CCGGCTAAAA TATATTTCAA GTATGAAGGG GCAACTCCCA CTGGTTCCCA TAAGATAAAC ACAGCCCTAC CGCAGGCTTA CTTTTCCATG AAGGAGGGAG TGGACCACCT GGTCACTGAG ACTGGGGCTG GCCAATGGGG AACAGCTGTA GCTCTATCTG CGAGAATGTA TGGACTAAAT TCCACCATTT TCATGGTAAA GGTGAGTTAC GAACAGAAAC CGCAAAGGAG AACCATCATG CAATTGTATG GCGCACGAGT TTTCGCCAGC CCTACCTCGC ATACCGAGTA CGGCAAGAAG GTTCTAAACG AGAACCCGAA TCATCCAGGT TCCCTCGGAA TAGCCATGAG CGAGGCAATA GAGTACGCTC TCTCCAACGG CTATAAGTAT CTCGTCGGGA GCGTGTTAGA TGTTGTGGTG TTACATCAAA GCGTAATAGG GCTTGAGGCT ATGAAGCAGT TGCAGGAACT AGACGAGGAA CCAGACGTGT TAGTGGGCTG TGTGGGAGGA GGGAGTAACT TTGGAGGCTT TACCTTCCCG TTCATCGGGT CCAAGAAGGG ATCGAAGTAC ATTGCAGTTG GATCTTACGA GATACCAAAG TTCAGTAAGG GGGCCTATAA CTACGATTTC CCAGACAGTG CTGGGCTGTT ACCCCTTGTC AAGATGATTA CCCTAGGAAG GGATTACGTA CCTCCACCAA TATATTCGGG TGGGCTCAGA TATCACGGGG CCGCGCCCTC CCTAAGTATG TTGATCAAGG AAGGGATAGT GGATTGGAGG GAGTACAATG AGAAGGAGAT ATTCGAAGCA GCTCAAATAT TCCTTCAAAC CCAAGGAATA GTTCCAGCAC CAGAATCCTC TCATGCAATT AGGGCAGTGA TAGAGGAGGC ACGCGAGGCC AAGATCAAGA ACGAGAAAAA GGTGATCGTA TTCAACCTGA GTGGTCATGG ATTACTGGAT CTACCTAACT ACGAGTCTAT GATGAAGAGG ATTGGTCAAG ATTGA
|
Protein sequence | MANSEEIIPS YWYNIIPDLP KPLPPPRDPP DAEFSRIDLL RKIMPSEVLR QQFTVERFVP IPKEVREAYI NIGRPTPLVR ARRLEEFLNT PAKIYFKYEG ATPTGSHKIN TALPQAYFSM KEGVDHLVTE TGAGQWGTAV ALSARMYGLN STIFMVKVSY EQKPQRRTIM QLYGARVFAS PTSHTEYGKK VLNENPNHPG SLGIAMSEAI EYALSNGYKY LVGSVLDVVV LHQSVIGLEA MKQLQELDEE PDVLVGCVGG GSNFGGFTFP FIGSKKGSKY IAVGSYEIPK FSKGAYNYDF PDSAGLLPLV KMITLGRDYV PPPIYSGGLR YHGAAPSLSM LIKEGIVDWR EYNEKEIFEA AQIFLQTQGI VPAPESSHAI RAVIEEAREA KIKNEKKVIV FNLSGHGLLD LPNYESMMKR IGQD
|
| |