Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1466 |
Symbol | |
ID | 5104713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1437827 |
End bp | 1438954 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640507354 |
Product | hypothetical protein |
Protein accession | YP_001191547 |
Protein GI | 146304231 |
COG category | [C] Energy production and conversion |
COG ID | [COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain |
TIGRFAM ID | [TIGR00273] iron-sulfur cluster-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTGGG AAATCGCCAT TGAAAGGACG ATAAGGAACA ACGTTCCCAG AGTTTACAAT GTCTTAGAGA AGTATCCCTA CATACTAGAT CTTGCAAAGG AGCTGAGGAA GGCCAAGCTT GAGGTCCTGA ACAATCTCGA GATGTACGTG GAACAGACGG TCGAGTCTAT AAAGAGAATT GGTGGGGTTC CACACGTCGT TGGGGATTCT ACGGAGGCGA GAGAGGTCAT CTCCAAGATA ATTGGGGATA GGAAGAGGGT TGTCATGGGC AAGTCCATGG TGGCCTTCGA AGTTGGATTA AGGGAACATC TCAAGAGCCT AGGAAAGGAG GTGTGGGAAA CTGACCTAGG CGAGTTCCTG ATACAGCTCG CCAACGAGCC ACCCTCTCAC ATCATAGCCC CTGCGGTTCA TATGTCAAAG GAGAGGGCTG AGGAACTGGT TAGAGAGGCG CTCGGTGGTC TTCCTCCCAA TTCAACTCAC GAACAGATCG TGGCAAGGGT GAGGGAGTTC CTGAGGGACA AGTTCGTCAA CGCTGAGGTG GGAATAACGG GAGCAAACGC GATAGCTGCC GATACTGGGT CAATCATCCT CGTGGAAAAC GAGGGAAACA TAAGGTTTAC CACAGTGTCT CCTCCTCTTC ATATTGCAGT GGCCGGTTTC GAGAAAATCG TACCTACCCT TCCACACGCT ATGATGGAGG CCATGGTCCA AGCTGCATAT GCGGGATTAT ATCCGCCCAC CTATGTTAAC CTGACCTCTG GACCCAGTTC CACAGGTGAT ATTGAGATGA AGAGGGTTAG CCCAGCACAT GGGCCCAAGG AGTTCCACCT TGTCCTGGTG GATAACGGGA GAGTGAAGGC GTCCAAAGAT CCTGACCTGA GGGAGGCCTT ACTTTGTATT AGGTGTGGTA GATGCCATCT ACACTGCCCC GTGTATAGGG CGATGGATGG AAAATGGGGC GTTCCTCCCT ACTCGGGTCC CATGGGCTCC ATGTGGTCAT ATGTCGTGTT CGGCGATCCT AAACCCTCGC TACTCTGCAC ACACTCTGGG GGATGCAAGG AGGTTTGTCC CATGAAGATA AACATACCGA GGGTTCTAGA GAAGATAAAG GCTCGGGCGT GGAGCTAA
|
Protein sequence | MTWEIAIERT IRNNVPRVYN VLEKYPYILD LAKELRKAKL EVLNNLEMYV EQTVESIKRI GGVPHVVGDS TEAREVISKI IGDRKRVVMG KSMVAFEVGL REHLKSLGKE VWETDLGEFL IQLANEPPSH IIAPAVHMSK ERAEELVREA LGGLPPNSTH EQIVARVREF LRDKFVNAEV GITGANAIAA DTGSIILVEN EGNIRFTTVS PPLHIAVAGF EKIVPTLPHA MMEAMVQAAY AGLYPPTYVN LTSGPSSTGD IEMKRVSPAH GPKEFHLVLV DNGRVKASKD PDLREALLCI RCGRCHLHCP VYRAMDGKWG VPPYSGPMGS MWSYVVFGDP KPSLLCTHSG GCKEVCPMKI NIPRVLEKIK ARAWS
|
| |