Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1528 |
Symbol | |
ID | 5104056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1489135 |
End bp | 1490364 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640507415 |
Product | hypothetical protein |
Protein accession | YP_001191608 |
Protein GI | 146304292 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.275544 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTT CAGATGTTGT GAACTTTTCC TTTAAAGCCT TAACCTCGAA GAAGCTAAGA ACGAGCTTAA CAATCTTAGG AATACTTATT GGGCCAGCTA TAGTTGTGGG ATTAACGGGT CTAACCCTAG GATTTTCCTC GGTTCTAACT CATCAGCTCT TCTCCAGTCT ATCTCCTACC GACATCTTTG TGACTCCGGG TACAAGCACA ATTACCCCCT ATACCATACA GGAGATATCA CACATACCAG GAGTTAAGGC AGTAGTTCCG TTTTATCTTA TCTCTGGTAC AATTGAAACG CCTAGTGGTC CTATGGCTAC AGAAATTCTC TCTATTAGCA CTAGCCAAGC TTCTGAGGTG TTTCCTGGAC TAACCTTGCA GCAAGGTCAA TATCCCTCAC CCTACTCCTC ATATGGAGCA GTAGTTGGTT ATTACATTGC ACATCCACAG TATCCAGGTC AGCCCACTTA TCAGGTAGGT CAAACTATAA CCGTGGTGAT GAATACCCCA AACGGGAAAA TTACCAAGAC GTTCCTGGTC ACTGGTAGTT TTAACGAGTT TAGTAGTGCC TTCGCGGATA TAGATAAGGC CTTAGTTGTG CAGAATATAG TGGGCTCACA GTACTATGGG GATCAATACA GTGGTTTAAT AGTGGAGGCA AGTAGTGTGT CACAGGTAAA CAATGTTGTA AATTCCATAC ATAACGAGCT CGGAAGATCA GTAAGTGTTA CATCCGTGGA GCAGTTCATC ACTCTCATCA ATAACTCCCT ATCTGCGGTT AATGGTCTTC TTTTCGTAGC AGGCGCATCG TCCTTCATTG TGGCCTTCGT GGGAATATTG AGTACTATGT TTACCACAGT GGTGGAGCGA ACAAGGGAAA TAGGAGTTCT GAGGGCAATA GGCTTCACCA GGAGAGGTAT AATGGTGATA TTCGTGACTG AGGCCATATT GATGGGCCTA CTTGGAGGCA CAGCTGGAGT GGGTGCAGGG GTTGGGATGG GATACCTTCT TACCACACTT ACCAATTCTG GAGGACCTGG GGCTGGAAGA GGAAGCGCTG CAGCGTCCGG TGGCCTCGGA ATCAGCGCAC ATATTACTCC AGTATTTGAA CCAACATTCA TTGCGGAAGT GATACTTATA ACAGTCATTT TCAGTCTCTT CGCAGGAATC ATCCCTGCAT ATAGGGCGTC CAGAATCGAA CCAGCTGTGG CATTAAGATA TGAAGTGTAG
|
Protein sequence | MKLSDVVNFS FKALTSKKLR TSLTILGILI GPAIVVGLTG LTLGFSSVLT HQLFSSLSPT DIFVTPGTST ITPYTIQEIS HIPGVKAVVP FYLISGTIET PSGPMATEIL SISTSQASEV FPGLTLQQGQ YPSPYSSYGA VVGYYIAHPQ YPGQPTYQVG QTITVVMNTP NGKITKTFLV TGSFNEFSSA FADIDKALVV QNIVGSQYYG DQYSGLIVEA SSVSQVNNVV NSIHNELGRS VSVTSVEQFI TLINNSLSAV NGLLFVAGAS SFIVAFVGIL STMFTTVVER TREIGVLRAI GFTRRGIMVI FVTEAILMGL LGGTAGVGAG VGMGYLLTTL TNSGGPGAGR GSAAASGGLG ISAHITPVFE PTFIAEVILI TVIFSLFAGI IPAYRASRIE PAVALRYEV
|
| |