Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2075 |
Symbol | |
ID | 5105055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1993473 |
End bp | 1994717 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640507965 |
Product | hypothetical protein |
Protein accession | YP_001192139 |
Protein GI | 146304823 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.739423 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGT TGGTGGCTTT AGCTTTCACC TTAACTCTGA CTCCACTGGT TAACGCTCTC CAGTTCTATC CCAATGGGAA TCAGCCCATT CCGTATCAGG TACCAACAAA GCTTACGTAC TCGCTCAAGG TCTATAACAG TACTAAGGTT GGCAATTCCA CGACTGTCTC TCTAGTTGAG AGCGCTGTGA TTAATTACCA AGTGACCTCG CTTAATGGTA CATGGGTTAA GGTTAACGTA AATTCCAACT ATACTCCTGT GAAGAACGTC ACGTTTATAC AGCCAGGAAG TTACGTTGTA AACTATGCCT TAGATCCGCT CAACCTTAGT TATCCTTATA TTTACCCAGG ATTCCTGTCT AACTCTACCT CTTATGCCAT CGAGTCAAAC GTTTCCACAG TGATTTTATC CTTTGTTACC TCGACCAGTA ACAACGTAAC AGGACAAACA GTTTATAGGT ACTCTGAACT GTCCCCTGTT ACCTCTTCGC TATTAGTATT ACCCTCCGGT TTGGTTCAAA CTATTAACAG GACCGTGTCC GGTCTCGATT TTGTTATGAA TCTCACAGGC TATCAACTGT CTAACGCTCT ACAACCAACC AATTTCACCA GTAGGCCAGG ATATGTTTAC GTTAACATGA CATATTCGAA TTTCTCAGCA ACCTACCAAC CCTCCGGGTA CGTGGAATAC GTTTATCCCG CTCTTCTTCC TGGAAACCTC CTTTTGATGG TTCAGTATAA CATCAATGAA TTGAACGCCT TCCCTCTTGG CGGTTATACG TCGGTGAATG GCCAACTGGT GAACTTCATT ATTCAGGTGG GAACACCTAC CACCCTGGTC ACTAACTTCA TCAGTAACGC GAACGGCACA CTGACATGGA ACTCGCTCAA ACTGTCTTAC GTGGGTAACG TAACCAAGAC GGTTCAGGGT ACCACGTTCA ACTTGGAGGA GTATACATCT AAGGTAACTA GGGGAAACAT TACCTTTGCC ACTGCTACGA TCTACGCGTT GAAGAACATG GTAGTTGAGG TAAACTATAA TCAAACATTC CCATCTTTCT CCAGTTATAA GCTAGAGTTC ATCAATGGCT CCTACATAAA TCCCAGCCTT CACTTCCCCT ACCTAACAGG ATATCAGAAC ACAACTCTGC CCTATAAACC GGTTAACCCC TCTGAGTCCT TCACAATAGC CGTTGTGGTT ACCCTAATTG TTATTGCAAT TCTTGTAATT CTACATAGGA GATAG
|
Protein sequence | MALLVALAFT LTLTPLVNAL QFYPNGNQPI PYQVPTKLTY SLKVYNSTKV GNSTTVSLVE SAVINYQVTS LNGTWVKVNV NSNYTPVKNV TFIQPGSYVV NYALDPLNLS YPYIYPGFLS NSTSYAIESN VSTVILSFVT STSNNVTGQT VYRYSELSPV TSSLLVLPSG LVQTINRTVS GLDFVMNLTG YQLSNALQPT NFTSRPGYVY VNMTYSNFSA TYQPSGYVEY VYPALLPGNL LLMVQYNINE LNAFPLGGYT SVNGQLVNFI IQVGTPTTLV TNFISNANGT LTWNSLKLSY VGNVTKTVQG TTFNLEEYTS KVTRGNITFA TATIYALKNM VVEVNYNQTF PSFSSYKLEF INGSYINPSL HFPYLTGYQN TTLPYKPVNP SESFTIAVVV TLIVIAILVI LHRR
|
| |