Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2206 |
Symbol | |
ID | 5105426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2116973 |
End bp | 2118334 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640508099 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001192268 |
Protein GI | 146304952 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000504054 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTCAAG TAAAAACTAG GGAAAAGGTA CTAGAGGTCC AGAACCTCAG GTTGAGCTTT TACACGAGAA GAGGAGTCTA CAAGGCACTG AGGGGGCCTA CCCTTAATCT CTACTCAGGA GAAGTACTTG GGATTGCAGG TGAAAGCGGA TCTGGTAAAT CCACACTGGG ACTGGCAATA ATGGGGCTTC TACCGAGAAA CGCTAGGGTA GAGGATGGAA CAGTTCTTCT AGATGGAATG GACATGATTA AGCCCCTAAG GGACTACGGT TCTCAAGGCT CTAGGTTTAG CGTAAAGAAA AATGAGAAAA TTATCAAGAG GCTCAACAAG ACACTCCAAA CCGTAAGGGG AAAGAAGATA TCCATGGTAT TTCAGGAGCC GTTAACAGCA CTTAACCCAG TACTGCCCGT TGGATATCAG ATAGCTGAAG CCGTTTACTT CCACGATCCA GAAAGGCTAA TCAGAAGGGC ATTGAGCAGG CAGAAGGTTA CTCATGAGGA ACTTCGTGAG CTCTTGAACG TTCTTAAGGC TCAAGGAGAG GAGAGATTGC TAGAGGAAAT TGAGAGGAAA GGCTTACAGG GATTAGATGA GCAAATTCTC TCCATCTGGA GACGCAGGGA TATACATGAA GCCAGAAAAG AGAAAATGAT CCTAAACCTA GCTAACGTAA AGTTATCCAG GACGGACTTG ATGGGTATCT CCCTTTATCA AAGAAACATG GGGAAGTTTC CACTAGCGTC TAGGTTCGCA AAAAATGCAC TAATTAGGGA AGGTTACAGA CTAGCTGTCG AATTGTTAAC ATTTCTAGGT ATACCTCACC CCGAGAAAGT AGTGAGACTA TATCCTCACG AGTTATCAGG CGGAATGAGA CAGAGGATAG TTATTGCTAT AGCTCTTGCC AACAACCCGA AAGTCGTGAT AATGGACGAA CCCACAAGCG CCCTAGACGT CACAATACAG GCTCAGATCT TGGATCTGGT GAAGGATCTG AAATCCGATT TCAATACCTC ATTCATATTT ATATCCCATG ACCTATCCGT CCTCGCTGAA GTTTCAGACA GAATTGGGAT CATGTATGCT GGACAAATAG TCGAGATCGG TTCAGCAAAG GAAATCTTCC AGGAGCCTCT TCATCCTTAC ACTAAGATGC TGATGGAGGC CATACCAACC ATGGATAAGA CTGTCCTGAA AACAGTTCCA GGCTCAGTCC CAGACATGAG AAATCCACCT CCAGGTTGTG CCTTCTCACC AAGGTGTCCC TTTGCCATGG AGGAATGCAG GACAAACGAG CCAAGAATGG TGGAGGTAAA TGATGAGCAC TCCGTGGCGT GTTTCCTTGT GAGGAAGGGT GATAAGAAGT GA
|
Protein sequence | MSQVKTREKV LEVQNLRLSF YTRRGVYKAL RGPTLNLYSG EVLGIAGESG SGKSTLGLAI MGLLPRNARV EDGTVLLDGM DMIKPLRDYG SQGSRFSVKK NEKIIKRLNK TLQTVRGKKI SMVFQEPLTA LNPVLPVGYQ IAEAVYFHDP ERLIRRALSR QKVTHEELRE LLNVLKAQGE ERLLEEIERK GLQGLDEQIL SIWRRRDIHE ARKEKMILNL ANVKLSRTDL MGISLYQRNM GKFPLASRFA KNALIREGYR LAVELLTFLG IPHPEKVVRL YPHELSGGMR QRIVIAIALA NNPKVVIMDE PTSALDVTIQ AQILDLVKDL KSDFNTSFIF ISHDLSVLAE VSDRIGIMYA GQIVEIGSAK EIFQEPLHPY TKMLMEAIPT MDKTVLKTVP GSVPDMRNPP PGCAFSPRCP FAMEECRTNE PRMVEVNDEH SVACFLVRKG DKK
|
| |