Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1105 |
Symbol | |
ID | 5103579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1033096 |
End bp | 1034280 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507000 |
Product | hypothetical protein |
Protein accession | YP_001191193 |
Protein GI | 146303877 |
COG category | [R] General function prediction only |
COG ID | [COG1672] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.42011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACTAG TACTACAGAG AAAGGAGTGC GAAGAGCTCA AGCGGGTAGA TTACTGGATA TTGCTTTTCG GGAGAAGGAA AACTGGGAAG ACGACGCTAA TCAAGAACTG TGCCAAATAC GATTATTTCG TCACAATAGC CAATGAATCT GAAGGTCTGC TCGAGGACGG TGAGAGGATA GGAATCCCAG AGCTGTTGAG GGAAATTCGC TCAGAGGTGA GAAGGGGTGG TAGGGTGGTC ATTGACGAGT TTCAGAGATT ACCAGAGAGG TTCTACGCGG ACATCTCCAC TCTCGACAGG ACGGGAGGGC TAGTCCTAGC CGGCTCGAGT TACGGGGTTC TCAACAAGGT CTTTGACAGT AATTCCCCAC TTCTCGGCCT TGTGACCCCA AGAGAGATTC CCATCCTGAG GTATGAGGAG GTCTTGTCCC AGGTTGGGGA CCCAGTTCTC TCAACTCTCT TCAGGGATCC GTGGGTTATC CCCTTCGTGA ACTCCTACGG GGAATTCCTG GATAGGATAA GGGAGTTCTC CCTCATCTCC AAGGGGTTAG TTGGCGAAAT ATTCAAGGAG GAGGAGAGAA GCCTGACCGA GCTGTACTAT CAGTCCCTCC TGAAGGTGGC TGAGGGCGTG TGGAAGACCT CGGATCTAGC GGGTATCCTC CAGGTTAAGG GAGGAGAGGC CACCGTTTCC TCCCTGATGA ACCGGTTAAG TAAGATGGGC TTAGTGAGGA AGATCAGAAC CCTGGGGAAA GAACTGTATT ACAGGCACGT CTCCCCAGTG ATCTCGTTGG CGTTTTACGC TGAGTCAAAA TATCTAGTCA GTGACAGAGA CGTCAAAATC CCAGAGTTGC CCATTGGCCT CGAGGTTCAG TTCTCAGTGG GCGAGATGAT CAGCGAGTAC TATGGCGGAG ACTTCGTTTA CTCCCCTAGG GAAGACATCG ACGTGATTGT CATGAAGGGG AGGAGAAGGT TGATTGCCTT TGAGGTCAAG ATGGGCGAGA TAAGTGAGTC AGAGGCAAGG GAGGCGGTGA GGAGGATGGG TAGGGTCGCG GAGAGGGTGG GGCTGATAAG CTTGAGGGAG AGACCCCCCG AGATCGGTGA CGTCTCCTTG GGGCCTACGG AGCTTCTTGA GATGTCCAGG GAGTTGGTGG CGGGGAAAAG GGTCCCAGGT CCTGAGGTGG ATTGA
|
Protein sequence | MRLVLQRKEC EELKRVDYWI LLFGRRKTGK TTLIKNCAKY DYFVTIANES EGLLEDGERI GIPELLREIR SEVRRGGRVV IDEFQRLPER FYADISTLDR TGGLVLAGSS YGVLNKVFDS NSPLLGLVTP REIPILRYEE VLSQVGDPVL STLFRDPWVI PFVNSYGEFL DRIREFSLIS KGLVGEIFKE EERSLTELYY QSLLKVAEGV WKTSDLAGIL QVKGGEATVS SLMNRLSKMG LVRKIRTLGK ELYYRHVSPV ISLAFYAESK YLVSDRDVKI PELPIGLEVQ FSVGEMISEY YGGDFVYSPR EDIDVIVMKG RRRLIAFEVK MGEISESEAR EAVRRMGRVA ERVGLISLRE RPPEIGDVSL GPTELLEMSR ELVAGKRVPG PEVD
|
| |