Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2149 |
Symbol | |
ID | 5104888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2064015 |
End bp | 2065292 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640508040 |
Product | AIR synthase-like protein |
Protein accession | YP_001192212 |
Protein GI | 146304896 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1973] Hydrogenase maturation factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000105335 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000422525 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCTAG AGGGATATGC AAGGAGATTG TGGAATCACC TCGACGAGTC TCAAATGAGG GAAGAACTAC TTCGCTGGTT AGAATTCTAT AAGGGAAAAA GGGAGCTTAA TCAGGATTTT GTGGACGCCG TAATAAGGGA GGTCAAGAAT TCAGAGAATT TTAAGGAATT CTCCTTCACG AGGGTAGGCC TCACGGCAGG AGACAGTGGC CTAGGTTCCA GGGGTCTTGG GGATAATCTA ATTCATCTGA AATTATTTGA GCTCAGTAAG AGGAATCTCG AGACTTTTGA TGATGCGGGA ATAGTTCAGG ACATAGTTGT CTCTGTGGAC GGAATACACT CCAGACTATC CTACTTCCCT TTTCTAGCTG GATTTCATGC AACAAAGGCC ACCCTTAGGG ATATCATGGT GAAAGGTGCA ATCCCGCTGG GCATCCTTGT GGATATTCAC CTTTCCGACG ATAGCGATGT TTCCATGCTC TTCGACTTTG AAGCCGGTGT ATCAACTGTG GCTGATGCCT TGAACGTACC AATTCTGGCT GGTAGCACGC TGAGGATTGG TGGTGATATG GTCCTGGGAG AGAGGATAAG TGGGGGCGTT GCCTCTGTGG GGAGACTCCA GGGAGAACCA TTCACTAGAA AGAGAATTAG TGAGGGACAA CATATAGTCA TGACAGAAGG CCATGGAGGT GGAACAATCT CGTCCATGGC CATATTTCAT GGCATTGAGG GTGTAGTGGA GGAGACCCTA AGGGTAAAGG ATCTTGAGGC ATGTCTTGCC GTGAGACGTG TTAGAAATCT CGTAAGCTCC ATGACAGACG TTACTAACGG TGGTATAAGG GGTGATGCGT TAGAGATTTC GGAGGTAACT AACGTAAGCC TTGTGATAGA CGAGGATGAA TTCCTCTCTC TCATAAACCC AAGGATCAGG AAGGCCATGA ATGAATTGGG CATAGACCCC TTTGGTCTCT CGCTTGATTC CATCCTTATT TTCACCAATA ACCCGGACGA GGTTATAAGG ACCTTGAGGG ATAATCACGT ACAAGCTAAG ACCATAGGGG AGGTCACGCG AAGGAGAGGA TATCCAATAG TTACCCGCGA TGGAAGGGAG ATGAGACCCG CCTTTAGGGA AAGCCCCTAC ACTCCCATTA AAGCCGTCAT AGGAAACTAC TCCCCCATGG ATCTAGATGA GATTAAAAAG AGACTGGAAA GGGCCTACCT GAACTCTTTG TCGAAGAAGG AAAAGGTATT GAAAAACTTA AAAACAGGGA GTTTATAG
|
Protein sequence | MDLEGYARRL WNHLDESQMR EELLRWLEFY KGKRELNQDF VDAVIREVKN SENFKEFSFT RVGLTAGDSG LGSRGLGDNL IHLKLFELSK RNLETFDDAG IVQDIVVSVD GIHSRLSYFP FLAGFHATKA TLRDIMVKGA IPLGILVDIH LSDDSDVSML FDFEAGVSTV ADALNVPILA GSTLRIGGDM VLGERISGGV ASVGRLQGEP FTRKRISEGQ HIVMTEGHGG GTISSMAIFH GIEGVVEETL RVKDLEACLA VRRVRNLVSS MTDVTNGGIR GDALEISEVT NVSLVIDEDE FLSLINPRIR KAMNELGIDP FGLSLDSILI FTNNPDEVIR TLRDNHVQAK TIGEVTRRRG YPIVTRDGRE MRPAFRESPY TPIKAVIGNY SPMDLDEIKK RLERAYLNSL SKKEKVLKNL KTGSL
|
| |