Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0866 |
Symbol | |
ID | 5105225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 798923 |
End bp | 800299 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506770 |
Product | general substrate transporter |
Protein accession | YP_001190963 |
Protein GI | 146303647 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.580989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.629225 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGATA TATTTAAACC GTTAGACGAG AAAAGGCTTT CCTTTTTCCA TATCAAATCC CTTGTGACCA CGGGAATGGG TGTTTTCACC GATGGATATG ACCTTTCCTC CATTGGTATT GTCCTCCTGT TAGTCCTCTC GGAGTTCGGA ATAACGTCAA AGAGCCCAGA TTACGTTTCC CTTACTGCGG CAATTTCCGG ATCCGCGCTA GCAGGGGCTG CAATAGGCGC CATAATCTTT GGTTTGCTAT CTAACCAGGG AAGGAAGAAG TTTTACGGGA TCGACGTAAC CCTCATGACA GTGGGAGCCC TCTTACAGGC CTTCGTGACA GATCCCACAG AGTTGATAAT AGTTAGGTTC CTTTTAGGAC TAGGAGTCGG TGCCGACTAC GTTCTATCTC CCATGATCAT GGCTGAGCAC GCTAACGCCA AGGATAGGGG AAAGATAATT GCTCTAGGAT TTGGTTTGTT TTGGGGATTT GGCGCGACGC TAGCAGCTAT TCTATATCTT GCCCTACAGG CGGCAGGGGT ACCTCCTTCA CTAGTCTGGA GAATTGTGCT GGCAGCAGGT GCGATACCCT CAGCTTCTGT GATCTATCTG CGAAGAAAGA TTCCTGAAAC TGCGAGATTC CTTGGAAGGA TAAGGGGAGA TACTGAGGGA GTGAAGGGAG TAATTAGGGA AGTCACGGGA ACCGAGGTAA ATCTCACCTC TAATCTCAAG GATAACACTA GCTTCGGCGA GTACTTCAGG AGGAACTGGG CACTTTTCCT ATCGGCATGT ATACTTTGGT TCCTCTTTGA CATAGTGGCG TACTCGGGAA TATTGTTTGG TCCAAGCCTC ATTGCGAGCA GTCTAGGAAT TAATTCAGGA GTATTTCAAC TTTTAATTGA AGGAGCCTTC ACCATACCTG GAGGGATAAT AGCTCTGTCC TTGATTGACA GAGTTGGAAG GAAACCACTG CAGGTTGTAG GATTCGTGGT AATGGCAGTT GCCTTAATGT CCTTTGCCTT TTACAAGAGT TCAGCTGGGG CATCTTTCTC ACCTATTATA GCCTTCTTCC TTTACGGACT GCAGAACTTG GGGTCACAGG CTGGACCTGG CTCAGTATCC GCATCAGGAA TTCTTGGGGT AGAGCTTGCT CCAACCAAGG TAAGAGGACT GGTGCAGTCA CTAACAGTGG CATCAGGAAG AATAGGAGCC ACTCTGACAT CCTTCGTCTT TCCATCACTT TTCCATGAGT ACGGCGAGTC GTTTGCTGTG TATTTCCTTG CAACCATAGC GGCAATTGCT GCGGTCATAA CTTTAGTAGC CATACCTGAA ACGAAGAGGA AACCGTTGGA AGAGTCCTCT AGAGAAGTAA GTGTAGAGAC TGCGTGA
|
Protein sequence | MKDIFKPLDE KRLSFFHIKS LVTTGMGVFT DGYDLSSIGI VLLLVLSEFG ITSKSPDYVS LTAAISGSAL AGAAIGAIIF GLLSNQGRKK FYGIDVTLMT VGALLQAFVT DPTELIIVRF LLGLGVGADY VLSPMIMAEH ANAKDRGKII ALGFGLFWGF GATLAAILYL ALQAAGVPPS LVWRIVLAAG AIPSASVIYL RRKIPETARF LGRIRGDTEG VKGVIREVTG TEVNLTSNLK DNTSFGEYFR RNWALFLSAC ILWFLFDIVA YSGILFGPSL IASSLGINSG VFQLLIEGAF TIPGGIIALS LIDRVGRKPL QVVGFVVMAV ALMSFAFYKS SAGASFSPII AFFLYGLQNL GSQAGPGSVS ASGILGVELA PTKVRGLVQS LTVASGRIGA TLTSFVFPSL FHEYGESFAV YFLATIAAIA AVITLVAIPE TKRKPLEESS REVSVETA
|
| |