Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2296 |
Symbol | |
ID | 5104247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2189967 |
End bp | 2191196 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640508195 |
Product | hypothetical protein |
Protein accession | YP_001192357 |
Protein GI | 146305041 |
COG category | [R] General function prediction only |
COG ID | [COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.414005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGT ACGTAATTGG CGTTGACGAT CACGATTCCC CAGAGGGCGG ATGCACCACA CATTTTTCCT CGCTTTTATT GAAAGAGTTT AACAAGGCTA ACGTAAGGGT TGTGGGATAT CCTAGGTTAA CTAGGCTGAA CCCCAACATA CCCTGGAAAA CCAGGGGGAA CGCCTCAGTC TCATTCGTGG TGGAGACTGA GAGGGACCAG GCGGAACTCC TTGAGATGGT CTGGAGCGAG TCCATGAACT ACGTGGAGAG GGTGTCCAGG GGATTACTGT ACAAGAGGTC TCCCGGCGTT TCTGTGGGAA AGGTTGAGGT GATGGGGGAG CTGGAACACC TCTACTGGAA GGCGGTCAGC GACGTGGTCA CGCTGGATTA CGTCAAGAAC GTTTCGGAAA GGCTTGGGAT CCTTACCACG GGAGGTAGGG GGGTGATAGG GTCCATGGCC TCCATGGGAT TTTCAGGTAA TGGAACCTAT GAACTAGTAA CGTACAGGGC CCAGGAAAAC TGGGGGAGGA GAAGGGAGTT GGACCTATCC TCCCTGATAG AGTATGACGA GAGATATTTT CCAAGGGTGT ATGCGAATGT GGACTACGTG GACATGGAAC CACTGGTCCT GTCTCACGGA AGGGATCCCG TGCTGTTTGG TCTTAGGGGA ACCGATCCAG TCGCGTTGGT GGAAGGAATG AAGAGATTGA AGGTGAACGA GGAAGCGGAA TCGTATGTGG TGTTCGTAAC TAACCAGGGA ACTGATCACC ATTTTCGGAA CCCTAAACTT AGGCCGTACT CCAGTTTCGT GGGAGAAGTT ACCGTGAATT CTGTGAGGGT GGAAAGGGGA GGAGACTGCG TCGTGATAGG GGATGACCTG GTGATGCTGG TGTACAAGGA AACCGGGGAG TTAAACAGGG CCGTAAGGGA GTTATTGCCT GGAGATAGGA TCAGGGTCTA TGGAGCTGTC AAGCCGTCGG TTAGGTACGG GGTTGTGATA GAACCGGAAA AGGTGGAGAT CCTGAACTTG GTGCCAAAGG TGGAGGTAAA CAATCCTAGA TGTCCCATCT GCGGAGGCTC CTCCGAGTCT GCCGGTAAGG GGAAGGGATT TAGATGCAGG AGGTGTGGGC ACAGGTTTGC GGGGGAGAAG GTGGTGAGGG AAGTGGAAAG AGGAATCAGT CTAGGAGTGT TTCAGACGAG GAAGTACAGA CACCTAACGA AGCCGATTTT TTATGAGTAG
|
Protein sequence | MNMYVIGVDD HDSPEGGCTT HFSSLLLKEF NKANVRVVGY PRLTRLNPNI PWKTRGNASV SFVVETERDQ AELLEMVWSE SMNYVERVSR GLLYKRSPGV SVGKVEVMGE LEHLYWKAVS DVVTLDYVKN VSERLGILTT GGRGVIGSMA SMGFSGNGTY ELVTYRAQEN WGRRRELDLS SLIEYDERYF PRVYANVDYV DMEPLVLSHG RDPVLFGLRG TDPVALVEGM KRLKVNEEAE SYVVFVTNQG TDHHFRNPKL RPYSSFVGEV TVNSVRVERG GDCVVIGDDL VMLVYKETGE LNRAVRELLP GDRIRVYGAV KPSVRYGVVI EPEKVEILNL VPKVEVNNPR CPICGGSSES AGKGKGFRCR RCGHRFAGEK VVREVERGIS LGVFQTRKYR HLTKPIFYE
|
| |