Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2081 |
Symbol | |
ID | 5105061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1999374 |
End bp | 2000468 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507971 |
Product | hypothetical protein |
Protein accession | YP_001192145 |
Protein GI | 146304829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.808811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCGA GAGGCGGGCT ATCCGATCTA CTTAATAGCA TTGTCTCCAA ATACATAATC TTCCATTTGG AGAGCCCGCC CTTTAGCCCC ATTCTAGACC AAGCACAGGA TGTTGACCTT ACCTCTTCCG ATGACATAAT AAGGGTGGTG GACAGCATAC CCAAGGTATC AACAGAGATC GTTGCGGTGG ACGGGAGTAG CAGGAGCTTT GTTCTCTCCC AGGGATTAAT TTCGGTCAGC TCCGTGTCTG CAATCTCAAG TTTAAGGGGA ATTAAGGGTA TGTTCCCCTC ATTTGACCCC AGCATGAGTC TTGATCTACA GGAGCCCTTC ATCGCTCTCG CCACTCCCTT TACTGGGCCA GAGAGAATAG AGGACTTTCT TCTTCATCCG GCAGTATCCA GGGTTCAGAT GGAGGGAAAC CCGTTTCAAC AGGACTTAAC TAGGTTAGAG ACGGAGCTCC GTTTTTCCTT GGAGACCTCT TCACTGGAGA AGGTGAAGGA TAGTCCCCTG GTCCTGGTGG ACGGTCCTCT CTTACCTAAG TTCCTTTACA TTAACAAGAG AGTTGCCAAT AAACTCCTTC AGAGAAGAAA GGAAGTCCTA CAGAAAAATT TTATTGGAAT TGTTAAAAGG GTAAATCACT CTACTGTCTT GATTGAATCA CTTAATGAAA GGAAAATCAG AGAAGTCATG ATCATGAAAT ACAAAGTGAA TCCAGCCTCA TTCTCCAACG ATGAAGCCTT CTTAATTCAC CTTGTGAAAA AGAACTTTAG GTCTCCCTTT AAACCTCTTG TGGTAGGTCC ACTCCACGGA AAAGAGGAGG GAACTGAGAT ATTTTCCAAC TACGTAGTAA TTCCATTCCA CCCCTTCCTG GAGAGGTTCT CCGTGCTGAG GGTTGAGAGC CTCACTGACT CCCTTGATCC TGGAATTATT TTATCTACGC CAATTACCTC TGATGGAATT CCCCTTCCGC TAGCTTTCGC TGATAAGGTG GCCAAGGAGG TATCTAATGC TGTGTTTAAC ATGTTACTCC AGCAGTTGTC AAGGGAAGGT ATTCAGTCGA GTTTCTACAG CAGACTGGAG GGATTGGGAG CTTGA
|
Protein sequence | MASRGGLSDL LNSIVSKYII FHLESPPFSP ILDQAQDVDL TSSDDIIRVV DSIPKVSTEI VAVDGSSRSF VLSQGLISVS SVSAISSLRG IKGMFPSFDP SMSLDLQEPF IALATPFTGP ERIEDFLLHP AVSRVQMEGN PFQQDLTRLE TELRFSLETS SLEKVKDSPL VLVDGPLLPK FLYINKRVAN KLLQRRKEVL QKNFIGIVKR VNHSTVLIES LNERKIREVM IMKYKVNPAS FSNDEAFLIH LVKKNFRSPF KPLVVGPLHG KEEGTEIFSN YVVIPFHPFL ERFSVLRVES LTDSLDPGII LSTPITSDGI PLPLAFADKV AKEVSNAVFN MLLQQLSREG IQSSFYSRLE GLGA
|
| |