Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1400 |
Symbol | |
ID | 5104610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1372257 |
End bp | 1373261 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507289 |
Product | threonine synthase |
Protein accession | YP_001191482 |
Protein GI | 146304166 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0498] Threonine synthase |
TIGRFAM ID | [TIGR00260] threonine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000170642 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGGTTA AATGCATAAA ATGTGGAAGG GAGAGGGAAG GCGTAGAGGT TAGGTGCAAA TGCGGTGGAG TCTTCAAGGT AGAGGTGGAC GTCCCATTTT CTAAAAATCT TAGGGAAAAC TTCCCTTACG TTAAGAGATG GATTTCCCTA GGTGAATGGA ACACCCCGTC CATTAGGGTT GAAGGCTTTA CATATAAGCT GGATTTCCTT AATCCCACCG GTTCATACAA GGATAGAGGA TCGGTAACCC TAATTTCTCA CCTTTCTCAG CTTGGGATAA GGGAGATCTC TGAGGACTCA TCTGGTAATG CAGGCGCCTC TATAGCTGCC TACGGAGCTA TGGCTGGGAT GAAGGTGAAG GTCTTTGTTC CCTCAACTGC AAGGGGAGGG AAACTGAAAC AAATTGAATC TTACGGGGCT GAAGTAGTCA GGGTGGAAGG CACAAGAGAT GACGTATCAC GGGCTGCAGA GAACTCTGGA GCCTATTATG CGTCCCACGT CCTTCAACCT GAGTTTAGGG ACGGAATAAG GTCCTTAGCC TACGAGATAG CGAGGGATCA TGGATGGAAA TCCCCAGGAG AAGTATTTTT GCCCACGTCA GCAGGTACTC TACTCCTAGG AGTTTATGAA GGGTTTAGAC ACATGGTCAG CGAGGGAGTC CTAGATAGGA TGCCCAAGTT AGTTGCAGTT CAAACGGAAC AAGTGAGTCC AGTATGCTCC AAGTTCCGTG GAATAGAATA TAGACCGCCA AGCAGGGTCA CCTCCATAGC TGATGCCCTT GTTTCAACGA ATCCCGTACT CATGGATGAA ATGATTAGGG TTCTTCAAGA GACGGGTGAC TGCGTAGTTG TTAGTGAAAA TGAGATCATG GACTCTTGGA AATACCTTTC CAGGAAAGGG ATACTTGCCG AGTATAGCTC TGCGGTGGCA CTAGCCGGAG GAAGGAAATA TGAGGTGTCA GACCCTGTCA TAGTGCTAAC AGGCAATGGA CTTAAGACTT TATAG
|
Protein sequence | MRVKCIKCGR EREGVEVRCK CGGVFKVEVD VPFSKNLREN FPYVKRWISL GEWNTPSIRV EGFTYKLDFL NPTGSYKDRG SVTLISHLSQ LGIREISEDS SGNAGASIAA YGAMAGMKVK VFVPSTARGG KLKQIESYGA EVVRVEGTRD DVSRAAENSG AYYASHVLQP EFRDGIRSLA YEIARDHGWK SPGEVFLPTS AGTLLLGVYE GFRHMVSEGV LDRMPKLVAV QTEQVSPVCS KFRGIEYRPP SRVTSIADAL VSTNPVLMDE MIRVLQETGD CVVVSENEIM DSWKYLSRKG ILAEYSSAVA LAGGRKYEVS DPVIVLTGNG LKTL
|
| |