Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2166 |
Symbol | |
ID | 5104905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2082658 |
End bp | 2083572 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640508059 |
Product | hypothetical protein |
Protein accession | YP_001192229 |
Protein GI | 146304913 |
COG category | [S] Function unknown |
COG ID | [COG1992] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00217357 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTAGATA CACCACTGTC GCTAATCACG GACATACTAC TACCTAACGT TAGGGGATTA GTAGCTAAAA GGCTTAGAGC TCAGGGTATG AGCCAGAACA GGATTGCTGT ATTGGTGGGC GTAACACAAC CGGCGATAAA GCAGTACCTG GACGAGGATG AAGATGATCT CCGCGGGAAG CTTGGTGAAG CGGGCCTCAG TAACGAGGAA ATTGACTCCC TGGTATCTAA CCTGGTAAGC CTGGTTTCCA GCGGGAGAAA AGAGGAGGCC TCTCTTTACT TCACCACGTT CGGATTGATG ATGCTAAGTC AATTGAAACT GTGCAACTTT CACAGACGCG TGAACTCCTC GATTTCCTCG GACTGCAGAA TCTGTCAGTC TCTCTACAAG GAGGATGAGG AGAGCCAATT ACAGTTAGCC CTATCCCTTC TAAGGAACGA GAGCGTATCT AAACTGATCC CTGAGATCTT AAGCAATTTA GCTTACTCCA AGAGGGATGC GAGGGAAATA CTTGACGTGC TAGCCATTCC AGGAAGGATA GCGGTGATAG GTGGGGTTCC AACCCCGGCA TCCAGACCCA CGTGGGGCGG TAGCAGGCAC CTTGCCACCA TACTCCTTGA AACTAGAAGG AAATGTGAAA GGTGGAGGTC AGTAATGAAC ATCAAATATG ACGAGAAAGT GGAGGAGGCC ATATTGAAAT CTGGACTCAG GCTAGTTAAG GTTGGTCCCT CGGATAGAAG GGACGATCAA TCCATTGCTA ACATGGTAGC CTCCGTCATA TCCTCCTGCC CTGATGTGGT CGTTCACCTT GGAGGGAACG GGGTTGAGCC GAACTGTTAC ATTTTCGGGG AGAATCCCTT AGAGGTATCC TCCAAGGTAA ACAAGATAGC TAAGCTTTGT TGCGAGTCCT CCTAA
|
Protein sequence | MLDTPLSLIT DILLPNVRGL VAKRLRAQGM SQNRIAVLVG VTQPAIKQYL DEDEDDLRGK LGEAGLSNEE IDSLVSNLVS LVSSGRKEEA SLYFTTFGLM MLSQLKLCNF HRRVNSSISS DCRICQSLYK EDEESQLQLA LSLLRNESVS KLIPEILSNL AYSKRDAREI LDVLAIPGRI AVIGGVPTPA SRPTWGGSRH LATILLETRR KCERWRSVMN IKYDEKVEEA ILKSGLRLVK VGPSDRRDDQ SIANMVASVI SSCPDVVVHL GGNGVEPNCY IFGENPLEVS SKVNKIAKLC CESS
|
| |