Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0409 |
Symbol | |
ID | 5105526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 361763 |
End bp | 363343 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506315 |
Product | amino acid permease-associated region |
Protein accession | YP_001190510 |
Protein GI | 146303194 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAATC AAAAGGCAGT TCCAAGATTA AGAAGAGGAG TAGTTGGCAC GCTTGAGGCT GTAGCTCAGG AAATAGCTGC AATGGCGCCC GCGTGTGATA CTGTCGCCTT CATTACCTCA GCTGCGGCCT TTGCGTTCGT GCTAACTCCC TTCGCGTTTT TACTAGCGAT GCTTACCATG TTCATAGAAG TAAACACGTT ATATCATCTG TCAAAGAGAC ACGCAAGTGC AGGTGGATAC TATGGTTACG TGGCCACTGC CTTAGGTCCC TTTACAGCAA TAGTAACTGG TCTCATGTAT CCGGTATATC AAATAGCTAG TACTGCAGCT ATTCCTGTTT ACGTTGCAGG CGTGGTACTC CCAGGGGTAT TGAATTACTT CTTTGGGATC AGTTTGCCCG GATGGATCTG GATCCCCTTC ATCTTAGGCT TCATTCTGGT TCCAATTGGG CTCGCCATTG TGGGGATAAG GCCTCAAATG AAGTACATCA GATATGCTGC TATGACGGAA ATAGCGTTTC TGGCTATAAC CTCGTTATAC ATAATCGTAA TGTCCCCTGA CAATACCCTT AACGTATTCA ACCCGTTTGC CTGGAACTCG GTTTACGGAG CTCAATTCGC ACCCCTTGGG GGTCCAATTG CTGGATTAGG TCTTGGAATG ATATTCGGGT TGACAAGCTT CATTGGATAC GGAGGCTCAG CACCCTTAGG TGATGAGGTC AAGAATAGTA AGGCGATTAC AAAGGCACTG ATGCTTGGGG TAAGCATTGT GGGAATAGTG CTCACCGAGG TAAGTTACGC TTTAACGGTG GGATGGGGAG TGAATAACAT GACGAGTTTT GCCAACAGTA GTATACCTGG GATAATTGTG TACGCCCATT TCATGGGAAT TGTGGGTGGA CTCATGCTTG CCCTATTCGC ATTCAATTCA GCTTTCTCGG ATAGCGTTGC GATGCAATCC AATGCAGGAA GGGTCTACTT TGCCATGGGA AGAGACGGAA TACTCCCCAA GTTCTTTGCC TATGTTCACG AGAAATGGAT AACTCCTAGT AAGGCACTTC TCTTTGTGGG GATTGCCTCG AGTATTACAG CAGTGCTATC GGGTTTCGTG ATCGGGTGGT TATCAGGAGT TTCACCTTTG CAAATGTTTA CGTTAAGTGC TACTTCTCAA CAGGTCTCTT TGGCGTTAAG TAACGTTTTT GATTTCCTCA CTACCATTGC ACTTGTGGGA TTTATTGTGG CACATTTTGC GAACAACACT GCAGTAATGG TAATGTTCTA CAGGCTCAAG GAGAAACACA CAGGGGTTAA CAGGATCCTT CATCCCGTGA TGCACTACCT TTTACCAGCA GTAGCAACAG CCATATTTGC CTTCGTTCTC TACGAGTCCA TATGGCCTCC AGTATTTCCA GTAACGCAAG CGGTAATCGT GGGCGTAGCC TTCTTGGTAT TTTCAATATT TTACACTCTC AGGATAAAGA GAAATAACCC GAAGGCCTAC AAGAACGCAG GGATCACGGT GAATATTGTG GAGGAAGAGA AGTTGGAGAA GATGAGCAAA GCTGAAAAAG ATCCAAATTA A
|
Protein sequence | MSNQKAVPRL RRGVVGTLEA VAQEIAAMAP ACDTVAFITS AAAFAFVLTP FAFLLAMLTM FIEVNTLYHL SKRHASAGGY YGYVATALGP FTAIVTGLMY PVYQIASTAA IPVYVAGVVL PGVLNYFFGI SLPGWIWIPF ILGFILVPIG LAIVGIRPQM KYIRYAAMTE IAFLAITSLY IIVMSPDNTL NVFNPFAWNS VYGAQFAPLG GPIAGLGLGM IFGLTSFIGY GGSAPLGDEV KNSKAITKAL MLGVSIVGIV LTEVSYALTV GWGVNNMTSF ANSSIPGIIV YAHFMGIVGG LMLALFAFNS AFSDSVAMQS NAGRVYFAMG RDGILPKFFA YVHEKWITPS KALLFVGIAS SITAVLSGFV IGWLSGVSPL QMFTLSATSQ QVSLALSNVF DFLTTIALVG FIVAHFANNT AVMVMFYRLK EKHTGVNRIL HPVMHYLLPA VATAIFAFVL YESIWPPVFP VTQAVIVGVA FLVFSIFYTL RIKRNNPKAY KNAGITVNIV EEEKLEKMSK AEKDPN
|
| |