Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0945 |
Symbol | |
ID | 5104375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 871735 |
End bp | 873972 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506848 |
Product | Ni Fe-hydrogenase I large subunit-like protein |
Protein accession | YP_001191041 |
Protein GI | 146303725 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGTA GTTCACCATA TTATTTCACG TTTGATTGGA CTAAACCAGT TCTCATAGAA CCCATAGTGA GGATAAAGCC TGAGTTGGGA ATACAAGTCA CCTACGACTC CTCTAGTAAC AGGGTTACGG ACGCCTATGC AGCAGCAGGT ATGTTCAGAG GTTTCGAGAT ATTCATGAGA AATAAGCCAG GACCAGACGT CATAATGCTT TCCAGTAGGG AGTGTGGAAT ATGTGGAGAA CATCACCAAT TCATTGAAAG AATTGCACAG GAAATGGCTC AGGGCGGTAA CGCTCCTCCA CCATTAGGTC TTGAAACCAT TATCCTGGCT AATGACGCAG CTATGACCTA CGATGCCACA GCCCATCTTA CAGCTTTAGG AGGACCAGAC TGGAGCTCCC TATTCTTCAA GGTTGCTGGG TATTTCCCAA GCCAAATTTA CGAAATAGCT CAGCAGACAC CTCTGAGTAA AGTGGCGCAA GAGTCTCAGG CATTCCCAAT GGGCGCACCA ACGCAACTGG AAATGGCCGG CGTCAAATTG AAGACTGTTT CGGATCTCAT GGATGCGTTA GTTCCAATAA TTGGCGCCTA CTTCCTTGAG GGAGCCTATT CGTGGAGAAT ACTTCACGAG GCTGAATTGC TGTACTATCT CAGAGTACCT CATCCGATAA CAATGGTACC TGGAGGAATT GGTGTACCTG CCACTGTGGA GAACCTACAA AGATATTTCC AACGGATGGT TGACGGAACG GCTTGGTTAA TGAAACAGAT GGCCGTCTAT GAATATTTGA CATTATTCCT TGCAGATTAT AATAATACGT TCGGAATCCA GGCTCTATAT GACAAATTGA ATGGGACTCT TGGATTTGCT GAGGAGGGAA ACAGACCTGG AAACTTCGTC TCCTATGGCC AGAGCGACGT TTCAGAAGCT CCCTCAGTTA CGCCATCCAG TACATCGGTG ATGCCTTATG ATGGTACCTA TGAGAATATG GGCGCTTGGG GTAGGGCTAG GGTCGTGAAA CCAGGACTGG TTATTCAACC CAATTCCAAT TCTCCTCCAC AGCTTGTCAC CAATAGCCTT ATTGACATAA ATCTCGGAAT AAGGGAGTTC GTGGAGTCCT CTTTCTACAC GCCGTGGTAC GAAAATGGAT CCTTCAATAG TGAAGTTCCG TCCTCAGTGA GCGGAATAGA GGAAGATCCA ATCGGAAATA AGGTTAGTTA CTACCATCCA TGGAGAAAGT GGACTATGCC TAACCCGCAG CCTAGACCGG TTCCAATGGC CTATCCTGCA CCCTATTCAT GGGCGGTTTC ACCTAGAATA GTTCCGTACC AGAATCACAA ACCAATTACC AATCTCTTCT ACAACGTTGA AGCGGATCCC ATGGCCGTGA TGTATGCTCA AGTTCTTCAG CCACAGGCCC CTACCCCCAT ATATACTTCC AGCTATCCCT CAGGCTTTGA AGTCTCATAC AACAGCAACG AGATGAGTGC TACCTTCTAC TTGCCCACTG TGAACAGTCC AAGGATATCA CTGCCTCCTG AATTTCAGCA GGGGAGCGAA ATAGAGTTTA AGTATGTGGC ACCATACGCG CTTTCCGGTG GAAAGATAGT TACTAATGCC ATAGAAAGGA TGAGGGCCAG GGCATTTGAG GCGGGTCTAG ATGGAATGGG AATGTGGATA GCCTGGATGT CCGCTATCAC GTACTTGAAG AAGGGTATGA CGCAAGTGAG TAGTATGCCT CCATCGAGCT GGACCTACAA GTCCAATACT AATACAGGGG TAGCATTAGG TGTGGGAATG AAGGAGGCCC CACGAGGTGC ACAGTATCAT TCAGAGGTGT TCGCCACTGG AACTGGCGTA AATGTCCCAG CAATTAACCC GCAACAAATG CAACCAACTA CAGTGAACTC TGGTCCAAGG GTGAAGAATT CCTATGACGA CGGCATTCAA ACAATTAAAC CTGCATCTCC AACACACGGC GCAGGAACTT TTGAGGAAGT GCTGATGGGG AACCCCGATT ATAATATGCC TGGTACTCCT AGGTTAACAG TTATTCCTAC GGAGCAGTGG GATGGCTTCG AGTTTGCCCA AACCCTACGA TCATTTGATC CATGTTTTGT ATGCGGAGTG CATATGGTAT TGCCAAATGG AAGGGCTAGA TACATTACCC TTGGAGCTCC CGTTGATTTA AGTAATGCGA TTAAAGCATT CTACAGGTTT GCCATGAAGG TGAAGTAG
|
Protein sequence | MASSSPYYFT FDWTKPVLIE PIVRIKPELG IQVTYDSSSN RVTDAYAAAG MFRGFEIFMR NKPGPDVIML SSRECGICGE HHQFIERIAQ EMAQGGNAPP PLGLETIILA NDAAMTYDAT AHLTALGGPD WSSLFFKVAG YFPSQIYEIA QQTPLSKVAQ ESQAFPMGAP TQLEMAGVKL KTVSDLMDAL VPIIGAYFLE GAYSWRILHE AELLYYLRVP HPITMVPGGI GVPATVENLQ RYFQRMVDGT AWLMKQMAVY EYLTLFLADY NNTFGIQALY DKLNGTLGFA EEGNRPGNFV SYGQSDVSEA PSVTPSSTSV MPYDGTYENM GAWGRARVVK PGLVIQPNSN SPPQLVTNSL IDINLGIREF VESSFYTPWY ENGSFNSEVP SSVSGIEEDP IGNKVSYYHP WRKWTMPNPQ PRPVPMAYPA PYSWAVSPRI VPYQNHKPIT NLFYNVEADP MAVMYAQVLQ PQAPTPIYTS SYPSGFEVSY NSNEMSATFY LPTVNSPRIS LPPEFQQGSE IEFKYVAPYA LSGGKIVTNA IERMRARAFE AGLDGMGMWI AWMSAITYLK KGMTQVSSMP PSSWTYKSNT NTGVALGVGM KEAPRGAQYH SEVFATGTGV NVPAINPQQM QPTTVNSGPR VKNSYDDGIQ TIKPASPTHG AGTFEEVLMG NPDYNMPGTP RLTVIPTEQW DGFEFAQTLR SFDPCFVCGV HMVLPNGRAR YITLGAPVDL SNAIKAFYRF AMKVK
|
| |