Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1819 |
Symbol | |
ID | 5105382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1765899 |
End bp | 1767323 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507718 |
Product | amino acid permease-associated region |
Protein accession | YP_001191897 |
Protein GI | 146304581 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.757233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAGA AAAAGAAACT GGAAAATGAG GCTCAACCAA GCCTGAAGAA GGACGTGTTG GGGACGTGGT TAGTGGCAAG TTACGGTATA GCTGCAAACG CTCCCATTGC AGTTGCCACG CTGTATTTCG TGGGGATTGC TGGTATAGCT GGAGGTGCCA TGCCCCTAGT GGTTCTGCTT TCCTACCTGA TTTACGCTAC CACACTGATC GTGATATACG AGTGGAGCAA GGACGTGGCT TCATCTTACG GCTACGTGGC CATCATGAAG AAGGGTCTTA ACAGCAGTTT AGCTGCTTTC ACCGTGGGAT ACGGTTACAT TTATCAGTAC CTGGTCGCAG GGACTGCCGG TTTCGGCATA CTCGGCCTAG CTTCCTTCCT TTACTTAATA TCGCCAAGTA TAGCCTCAAC GATGCCGTGG CTGTGGGCCC TGATCACGGT GATCCTGACG CTCGAGGTTA CCCTTGTAAT GTGGTTGGGC GTGAAGCCCG GTGGTCTGCT CAACCTCGTG ATAGGTCTAT TTTCCATTGG ATTTCTCGTG GTAACGTCTA TCTCTCTCAT AGCCGTTGCG GGAAGTCACA ACACCGTAAG CGTGTTCACC GCATCCCCGG TGAACAACAA CTGGGTTCTA ATACTGGTTT CAATGATCTT TGCGATCACG ACGTTTGGAG GAGCCACGAC TCCCATAGGC GTTGCCGAAG AGGCCAAGGT ACCAAAGAGA ACCATGCCCA GGGCACTCCT CCTCGGGTTT GGACTACTTG GAGTGGGGCT AATTCTCAAC TCCTATGCGC AGACCGTGAT CTACGGAGTG TCCAACATGT TCAACTACGC CTCTCTCCCT GATCCCATGG TGATCATCTA CAGCAAGTAT TTCAGTCCCG TGATTGTGGA TCTACTAATA GTACTGGTTG CATTCATGTT CAACTCCTCT ATCATTTCCT TTGCGACCAG CGGTAGCAGA ATGATATACG GGATGGCAAG GGACGGAATA CTCTATCCAA GCAACTTCTC GAAGGTGAAC AGGCACGGGG CTCCCGGTAA CGCAATAATA TTGACTGGAG TTATCGCTGG GGCACTTTGC CTGCTAACCG GTTACCTTCT AGGTCCCCTG GAGGCCAGCA TCTTCCTAAT AACCTTCGGC TCATTTTACG TTTCTCTCGG GCACCTGTTT GCTGCCTTGG CTCTCATTAG AAGAAAGGTG AAGCTGGGGA GGCCAGACAT CGCCAAGCAC GTGTTAATCC CCATAATCTC CATGGGCGCT TACGTGGCTA CGATATACTT CGGAACCTAT CCGGCACCAG CGTTTCCCTT GAACATAGCA GTGTATTCGG CCTGGGCCGT GTTGGCGGTT CACGTCGTAG TGTATTATTT GATGAAGAGA AGATATCCGG AAAGGCTAAG CAAGTTCGGG GATCACAGTC TATAG
|
Protein sequence | MDEKKKLENE AQPSLKKDVL GTWLVASYGI AANAPIAVAT LYFVGIAGIA GGAMPLVVLL SYLIYATTLI VIYEWSKDVA SSYGYVAIMK KGLNSSLAAF TVGYGYIYQY LVAGTAGFGI LGLASFLYLI SPSIASTMPW LWALITVILT LEVTLVMWLG VKPGGLLNLV IGLFSIGFLV VTSISLIAVA GSHNTVSVFT ASPVNNNWVL ILVSMIFAIT TFGGATTPIG VAEEAKVPKR TMPRALLLGF GLLGVGLILN SYAQTVIYGV SNMFNYASLP DPMVIIYSKY FSPVIVDLLI VLVAFMFNSS IISFATSGSR MIYGMARDGI LYPSNFSKVN RHGAPGNAII LTGVIAGALC LLTGYLLGPL EASIFLITFG SFYVSLGHLF AALALIRRKV KLGRPDIAKH VLIPIISMGA YVATIYFGTY PAPAFPLNIA VYSAWAVLAV HVVVYYLMKR RYPERLSKFG DHSL
|
| |