Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0779 |
Symbol | |
ID | 5103468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 710971 |
End bp | 711837 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506684 |
Product | periplasmic solute-binding protein-like protein |
Protein accession | YP_001190878 |
Protein GI | 146303562 |
COG category | [R] General function prediction only |
COG ID | [COG2107] Predicted periplasmic solute-binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.124322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAACAA TAAAGATTGG TGCCCTGGCG GATTCTGGGG ACCTGTACCC CTTCATTCCC ATGATGGAGG GTTGGGTTAA GGAAGACGGG ATAAACCTGG AGTTTCAGGT AATACCCACA GTTCAAGACG TCAATATGAG CGTGCTCACC AAGGTTGTGG ACGTTTCGGT CCCCTCAGCT GCAATGTATC CGTATATTCA GGACGACTAC TTCATTTTAG GCAACGCAGT GGCCACGGCT ATTGACGGGA TCACGGGTAT GCCAGTCCTC TCCACTAGCG AAATGAGACT TGATGACCTA AAGAAGTCCA CACTCATAGT CCACGGGCCT AACACCTCTG CCTTTACCCT CTATAGGCTT CTGGTTGGCA AATACGGTAA ACTGGTAATA ATTCCAAGGG TTCTTGACGA GATCAAGGAG CTGGGGAAAA GCGGTGATGT GTTGGTAGCG GTCCACGAGA TAAAGATGAT GTACGCCATG AGGAAACTGG GCATAAAGCC CTACGTCATT ACAAGCATGT GGGATATGTG GTCTAAGATT TCTGGGGGAG CTCCAATGCC CATGGGGACG GTGGTAGTAT CAAAGGAGCT GGGTAAGGAA ATGGCCCTCA AGTTCAAGGA ATTATATGAG AGGAGCAAGA AGTTCGCGGA GAAGAACCTT GACAAGGTGA TTCCTAGGGA CGTAGAGATA ATGAGTGAGG CTCAGGGAGT AAACATGGAC AGAGAGATCG TGGAGAAGAC CATATGGGCC GACATTCAGG AGTACAACGT GCCTCAGGAA AAGGCTATGG AAGGCTTAAA CAAGTTCTAC AGCCTAACTC ACGAGAGAGG CCTTCTACCC CTGGTCAAGA GCATAGACGT AATCTAA
|
Protein sequence | MVTIKIGALA DSGDLYPFIP MMEGWVKEDG INLEFQVIPT VQDVNMSVLT KVVDVSVPSA AMYPYIQDDY FILGNAVATA IDGITGMPVL STSEMRLDDL KKSTLIVHGP NTSAFTLYRL LVGKYGKLVI IPRVLDEIKE LGKSGDVLVA VHEIKMMYAM RKLGIKPYVI TSMWDMWSKI SGGAPMPMGT VVVSKELGKE MALKFKELYE RSKKFAEKNL DKVIPRDVEI MSEAQGVNMD REIVEKTIWA DIQEYNVPQE KAMEGLNKFY SLTHERGLLP LVKSIDVI
|
| |