Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1447 |
Symbol | |
ID | 5104817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1414270 |
End bp | 1415532 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507335 |
Product | amino acid permease-associated region |
Protein accession | YP_001191528 |
Protein GI | 146304212 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGTA AACCAAAAAT CTCACCCACT GAGGTGTTCT TCCTGTCCTT TGGCGGACAA TCGCCCTTCA TTTCACTCAT GGCGTTCGGC ACAGTGATGA TTTCCTACGT TGGTATCCAT TCTGGGTTCG CGATGATAGT TACTACCCTC GTGGTAATGG CTAACGCGTC AGTGGTTTAT TCACTTTCAA AGAGATTCAA CAAAGGAGGT GGGTATTACA CCTATGCCCT ACACACGCTG ACCAATAACT TGGGCATAAC TACGGGCTGG ATGTATATTC TTTACTCGTT AAGCTACGGT GGTACCTTGA TGATGGGTGG GGTCTATGTA CTTAACCTGT TAACAGGTAT TAGTCCCCTA TACCTTACCC TTATAGTTTC AATTCTTGCC TCCACGATAG TTATTGCTGG AGTGAAGCTT TCAGCCAAGT ACGCAGTGGC CGTGGGCATA TTGGAGATAA TAGCAATCCT AGGTCTCTCG ATTTTCTTCA TGTATAGATC TGGGTTTGCG TTCTATAATC CTATTCCCAC TTCTCTTCCA ATGAACCTGC CAGAGGCAAT ACTTTTCGGT ATTGGAATAC CATCGGGCTA CTCCAGCATA GTCAGCTACC CCGAGGAGAT TGAGAACGCT TCCAAGACAG TGAGCAGAAT TTCCCTCTTA GTTCCAGTCA TAGGAGGTGG ATTGGCATCC TTCTTCTTCT ACGCTTTAGC GGCCCTAGGT TTCACGGGTA ATCTAGTTGA GTTGCTCACC TCAGAGTTCG GACTTGTAGG GGGTATCCTG ATATCCGCCA TAGCCCTGAG TGATGCTGTG CTGGGAGGAA TAGCGTACCT GTTGGCTGGG TCAAGGACTC TCTACAACAT GTCCAAAAAT GGCCATCTAA TCAGTTATCT CGCGAGGGAG TATAAGGGTC AGCCCAAGGT GGCCGAGGTA CTAATCTCGG TGTTGGTGAT ACTCTCACTC TCTTTTCTCT CAATGAACTT CAGTCCTCTG GTGGCGCTAG GCCTGATTGG AGGGGTATCA GGAATGAGTA ACCTTTACAT CCATATGGCG GCTGGGGTCT CTCTCGCCAG AATGGGAAGG AAAAAGCCCC TGAAGCATCT CCACGAAATA GCCTTCTCCG TTGTTTCCCT AGCTTTCTCG GCCTGGGTCC TGCTCATTTC ACTGGTTCAG CTAGAGAAGT ACGTGGTTTA CTTCTTCTTG GGTTGGATAA TTCTAGGTTT TCTCCTAGCT GAGAGCCTTG AAATGGTTAA GGAGGAAGAG TAA
|
Protein sequence | MGSKPKISPT EVFFLSFGGQ SPFISLMAFG TVMISYVGIH SGFAMIVTTL VVMANASVVY SLSKRFNKGG GYYTYALHTL TNNLGITTGW MYILYSLSYG GTLMMGGVYV LNLLTGISPL YLTLIVSILA STIVIAGVKL SAKYAVAVGI LEIIAILGLS IFFMYRSGFA FYNPIPTSLP MNLPEAILFG IGIPSGYSSI VSYPEEIENA SKTVSRISLL VPVIGGGLAS FFFYALAALG FTGNLVELLT SEFGLVGGIL ISAIALSDAV LGGIAYLLAG SRTLYNMSKN GHLISYLARE YKGQPKVAEV LISVLVILSL SFLSMNFSPL VALGLIGGVS GMSNLYIHMA AGVSLARMGR KKPLKHLHEI AFSVVSLAFS AWVLLISLVQ LEKYVVYFFL GWIILGFLLA ESLEMVKEEE
|
| |