Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2224 |
Symbol | |
ID | 5104285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2129630 |
End bp | 2130625 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640508117 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001192286 |
Protein GI | 146304970 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000449816 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATAGTTT TAGGTATTGA GTCAACAGCT CACACTTTTG GAGTAGGCGT TGCGCAGGAT CAAGTTCCCT TTATTCTAGC TAACGAGAGG CACACGTTTG TACCTCAAAC TGGAGGGATG AAGCCAAGCG AAGCAGCGAG GCATCATACC TTAGTGGCTC ATGAAATTCT AAGAGGAGCG TTAGATCGAG CAAGAATATC AATTAGGGAC GTGGATGGAA TAGCAGTAGC TCTTGGACCT GGTATGGGTC CAACTCTTCG TGTTGGAGCC GTGGTAGCAC GCGCCCTCTC CCTTAGATTT AACAAGAAAC TCGTTCCCGT GAACCATGGA ATAGGTCACA TCGAGATAGG ATATCTCACG ACTGAGGCTA AGGACCCACT TATCCTCTAC CTATCTGGAG GGAACACTAT AATCACCACA TATTATAGGA GAAGGTTCAG AATATTTGGT GAGACCCTCG ACATAGCGCT TGGCAATATG ATGGACACTT TCGTTAGAGA GGTAGGTCTC GCTCCGCCTT ATATAGTGGA TGGTAAACAT AAGATAGATA TATGCGCTGA GCAAGGCTCC AGTATAATCG ATCTACCATA TACTGTGAAA GGAGAAGATA TGTCATTCTC TGGGTTACTT ACTGCCGCCC TTAGGGCAGT TAAGAAGCAT AACCTTCACG ATGTGTGCTT GAGCCTTAGG GAGATCGCAT ATGGCATGCT TTTAGAGGCT ACAGAGAGAG CACTAGCCCT AACAGAGAAG GGAGAAATCA TGATTGTTGG AGGAGTTGCA GCTAGTGGAA GCCTGAGGTC AAAGCTCGAA AAGTTGAGCA ATGATTGGGG TGTTGGACTT AAGGTTGTGC CTACGTCTTT TGCAGGAGAT AACGGTGCCA TGATAGCCTA TGCCGGCTTG CTGGCCTTGA AGCATGGGGT TCACATAGAC GTGAAAGATT CTACTATTCG ACCACGTTGG CGCATAGATG AGGTTGATAT TCCATGGAGG GATTAA
|
Protein sequence | MIVLGIESTA HTFGVGVAQD QVPFILANER HTFVPQTGGM KPSEAARHHT LVAHEILRGA LDRARISIRD VDGIAVALGP GMGPTLRVGA VVARALSLRF NKKLVPVNHG IGHIEIGYLT TEAKDPLILY LSGGNTIITT YYRRRFRIFG ETLDIALGNM MDTFVREVGL APPYIVDGKH KIDICAEQGS SIIDLPYTVK GEDMSFSGLL TAALRAVKKH NLHDVCLSLR EIAYGMLLEA TERALALTEK GEIMIVGGVA ASGSLRSKLE KLSNDWGVGL KVVPTSFAGD NGAMIAYAGL LALKHGVHID VKDSTIRPRW RIDEVDIPWR D
|
| |