Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0053 |
Symbol | |
ID | 5104631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 47130 |
End bp | 48773 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640505948 |
Product | hypothetical protein |
Protein accession | YP_001190154 |
Protein GI | 146302838 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.594463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAG TAACGCGGTA TCTCATAGCA ATAGGCTTAG TGCTTGCGGA GATGATCTTC GTTGAGCTGG AAAGCCTCCA ACCTATCATT GGCGGGTTTA ACGGTGCCCT AATATTATCC GTTCTATTAT TTCTGGATGG GGTCTTTTCC CTCCTAGTTT TGGATTCCCT GATCCCGAGC CTTCTCTTCT CCCTCAGCAT ATTTGCCTTA TTTAATCTAG TTTATGGGTC CGTCGATCCC CTCATTCTTT TGGAATATCT CTCCGGGGTT GGGATATCAT CGGCATTCCT TTACTTAACG AGGTCGCTGT GGGATACTAC CTTTAGGCTA TGGAGAAGGA TCAAGAGGAC ACCTGACCTG AAGATCCTCC TAGTCTCGGC AATACTAGCA CCGGCTGTCT ATGCACTCAC GAGATCGCCA TTTATGGCCA CTGGAGTGAT AATAGACGGG GCTATTCTCT CCTTTATCGG AAGGATCGAG TTGAGCCCTT TGTCCTCCCT ATCCTGGATC TCTCTTCCCT ACCTTCTCAT GGTAGAACCC AAGGAAACTA GTACTGGGAT ATGTATAGGC AATGAAGAGA AGGTGTTGGT TAGAAGTCTT ACACCTGGGT TGTTTCAAGC AGGATACAGA TATAAATGGA TTAATGTGAA GAAGCCCTTT TGTGTGGATT TTTCTAAGAT GAAGAATTAC AACATGGTAA TAGTGGGGTC GAGCGGATAT GGGAAGTCCA CCCTGGCTAA ATTAATCCTT TCAAAAACTA ACGTAGATTA CATCGTCTTC GACCTACACG GGGAATATGA AGATGTCCCA GGGAGGAGGA TGGATATGTC CCTTAACGGG GTAAATCCTC TCTCTCTTTT CGGGAGGTCC CCTAAACAGA GATCCATTGA AATTGCCCTG ATGCTTAAAT CTATCTTCAA TCTGGGAAGT ATTCAGACCA TGGACCTATC GAACCTGTTC GTTGAGGCGT ACCAGGAAAG GGGAATATAT GACGAGGATG AGAGGACTTG GTCTCTAGAC CCTCCTACGA TAAGGGATGT GCTTCTACTC CTAGAAAGGA AAAAGAGGGC ATCATTCAAT TCACAGGATT TGAATAGGTG GGGAAGCATA GATCCTTACC TGAGGTTCCT AGACTCAAAC ATCTTTTATG GGAGCGAGGA CCTGGGAAAA CTACTTGAAG GAAAAGTTAT ACTAGACTTC TCTAGGATAA CCACCACGAA CATAAAGTAC ATCCTCATGG AGACCGTTCT AACCTCTATT CTGGGGAGTA TGTACCTTGA GAAATCTGCC TCACTAAGAA AGCTCGTAGT GATAGACGAG GCACCCTTTC TACTGGGAAG GGAGAGTGGT GAGGCCCTAG CCGAAAGACT TTTCGCTGAG GGAAGAAAAT TTGGATATGG CTTTATACTA ATTTCACAAT ATTCCGATAA ACTTGAAAAG ATGATAAACA ATGCCTCAAT GACCATGATA ATGGGAATGA ATGATCCCGA CGAATTAAAT TACATCGCTA GGTTGATTGG AGGAGAGAGT CAAGAGGCGA GGAGAGTTAT TTACGAAACT CTTTCGGTAC TTGAAAGGGG AAAGGTGATA ACAAGAGATA TAACGGCGAA TGATATAGTA GTTGTTCGCC TAAATCAGGG GTGA
|
Protein sequence | MQKVTRYLIA IGLVLAEMIF VELESLQPII GGFNGALILS VLLFLDGVFS LLVLDSLIPS LLFSLSIFAL FNLVYGSVDP LILLEYLSGV GISSAFLYLT RSLWDTTFRL WRRIKRTPDL KILLVSAILA PAVYALTRSP FMATGVIIDG AILSFIGRIE LSPLSSLSWI SLPYLLMVEP KETSTGICIG NEEKVLVRSL TPGLFQAGYR YKWINVKKPF CVDFSKMKNY NMVIVGSSGY GKSTLAKLIL SKTNVDYIVF DLHGEYEDVP GRRMDMSLNG VNPLSLFGRS PKQRSIEIAL MLKSIFNLGS IQTMDLSNLF VEAYQERGIY DEDERTWSLD PPTIRDVLLL LERKKRASFN SQDLNRWGSI DPYLRFLDSN IFYGSEDLGK LLEGKVILDF SRITTTNIKY ILMETVLTSI LGSMYLEKSA SLRKLVVIDE APFLLGRESG EALAERLFAE GRKFGYGFIL ISQYSDKLEK MINNASMTMI MGMNDPDELN YIARLIGGES QEARRVIYET LSVLERGKVI TRDITANDIV VVRLNQG
|
| |