Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1181 |
Symbol | |
ID | 5104477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1150971 |
End bp | 1152443 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507073 |
Product | extracellular solute-binding protein |
Protein accession | YP_001191266 |
Protein GI | 146303950 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.595766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCCG TGATTGCGGT GGTCGTGATT GTGATAGCAG CAATTGCTGG ATACATGATG CTTAGCAGGC CGAGCTCCAC ACCTCAGACC CCGACGAACT CCGGTTCAGG CGGGTCATCG AGCAACGGGA CTACGACTTC AACCTCCAGC ACTTCCAGTG GTGGAAACAC GGGTAACGCA ACCACGCTAA ACATTATTGC CTGGTCAGGT ATATGCGGAG ATTACATTAC CGAGGCCGGT AAGCTGTTTG AGCAACAGCA TCCGGGCGTC AAGGTTAACG TGTATACATA TCCCTTCAGC CAATACATCA GCGTAGAGTT AACTTACCTC AAGGCCCACT CCTCCCAATT TGATATAATG TCATACACTC CAACGTCGTC GGCCCTGATT GCCCCCTATC TAGTTCCCTT AAACCTTACG CAGGACTTTA ACACCTCCGA CCTAATAATG CCCCAGGAGA CGTATGCGGG GATAGTCTAC AACTACACAA CCCATCAGAA CATAACCGCA GGTATAGCTT ACGCGACCTG TGTTGACACC CTGGTTTACA ACACCACAAT CTTTGACAAC AAGACCCTAC AGAACGAGTT CTACAACGAG TATCACATGA ACTTCTCACC CCAAACCTGG CAGAACTGGA CCGTGGTCAT AGACGTGGAC CAGTTCCTCA CCTCGCATCA CGTGACCAAG TACGGAATCC TGTTACCTGA CGACACCTCC CACAACATAT GGAACTCCTA CCTGGTGATC TTTGGATACT ACTACGCGAG GAACTCGAGC CTAAATGACG GACTCATCTC AGGCCTTCCT GAGTTTCAGG TATACTTCCA GGGTAAGATC CTTCCCGGCT ACAACTTCCC ATTACCCTCC ATTAACTCCA CCGCCGGTGT TCAGGCCCTT GAGGTGTATA AGCAACTAGT CTCCTACATG ATTCCGCCAT CTCAGGTTCA GATAACCTAC GACAACATCA TCAACTACCT TCCTCAATCC CCAGGTATGA TATCGTGGCC AGGACCCCTC GGAGGCCTGA ATCAGAGCCA GTTGGAGCAA CTTGCATACG CCCCCTTACC AGGTGGTTAC GCCGTTGCCG GTTCTGACTT TGTGGGAATA AGCAAGTACT CTCAGCATCA GCAGTTGGCC CTAGAATTCC TGCAGTTCCT GGTCTCGCCA CAGGTTCAGG CGAAGCTCTT CTACATGTAC GAGATATTCC CAATCTCTAA GGAGGCCTAC AATATCCTGT TGTCTAACAC GTCTCTCCCC TCATATGAGA GGCAGTGGTT AACAGTGATG CCCACCATAG CCGAGGAGGG ATGGGACGTC GGACCCATCA TTCCTCCGAC ATACGAGCAA CTCCTTCCCA CCTTCAATAA CGAGGTCCTA CTCTACCTAG AGGGTCAGGT GAACAACCCA CAGCAGGTCC TGAACACCAT AGCCTCGCAG TGGATGCAGT ATCTCAAGTC CTACTATGGT TAG
|
Protein sequence | MIAVIAVVVI VIAAIAGYMM LSRPSSTPQT PTNSGSGGSS SNGTTTSTSS TSSGGNTGNA TTLNIIAWSG ICGDYITEAG KLFEQQHPGV KVNVYTYPFS QYISVELTYL KAHSSQFDIM SYTPTSSALI APYLVPLNLT QDFNTSDLIM PQETYAGIVY NYTTHQNITA GIAYATCVDT LVYNTTIFDN KTLQNEFYNE YHMNFSPQTW QNWTVVIDVD QFLTSHHVTK YGILLPDDTS HNIWNSYLVI FGYYYARNSS LNDGLISGLP EFQVYFQGKI LPGYNFPLPS INSTAGVQAL EVYKQLVSYM IPPSQVQITY DNIINYLPQS PGMISWPGPL GGLNQSQLEQ LAYAPLPGGY AVAGSDFVGI SKYSQHQQLA LEFLQFLVSP QVQAKLFYMY EIFPISKEAY NILLSNTSLP SYERQWLTVM PTIAEEGWDV GPIIPPTYEQ LLPTFNNEVL LYLEGQVNNP QQVLNTIASQ WMQYLKSYYG
|
| |