Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2105 |
Symbol | |
ID | 5104399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2027393 |
End bp | 2028976 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640507995 |
Product | hypothetical protein |
Protein accession | YP_001192169 |
Protein GI | 146304853 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.282872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAAG TACGATTTGA ATTTAAGAAT ATTACCCTAA GGATGGGCAG GTTCCTTGAG TTCTTAGACA GGACAAGATA TTTCCCGCTA TTATCAACGA TTCATCCCTT TAACTTGTTC CTCAGGCTCA TAAGGCAGAA CATTGAAGTG TATGGAGGAA GTATCTCAAG AGTGGAAAAT CTTTCAAATA GATCATTCTT AATCTTTTTA ATTTCATTGT CTCTCATTGG AATACTAGTA TGGCACATGG GGATATACTT TGTTCCCCTA TTAATTATAC CAGTTCTAGT TTATTTACTT CCTGTTATCT ATATACTAGC CTCCAAGATG GAATATTTAT CCAGGTTAAA TCTAGAGCTA CTTCCATTTT CAATTCTTCT ATATCTCAAT GCGTCACTGG GAAAGGGGCT TTACGAAACA TTCAACGATG TAAACCAGAG TTCCTTGTTC ATTGCCTTCA GGAAAGAGTT TGAAATTATA CAGAGGTATG GCATATTTCA TGGTAAATCA TTTCTCGATG GAATTCAGAG GAGAATAAAG AATCTCAGAA CTGGCTTAAT AGTAAAGTTA TACTCGTCTT CGCTTTCAGG CCAATTTTTA GGCGTAACCA TGGGTCAAAG GTCGCTGGAG TTCATTAATG ACTTGCTAGG AAACATAAGG GAGGCCTTCA ACAACTATGT TTCAAAGGCC TCCGAGATAG TTGAAGTCAT TTTCTCCATC TTCCTCTTAG TCCCCCTAGT AGCCATAGGG TTTCAGGGAT TATCTAGTAA TAATAATGGT GAGATTCTTT TAATACCGTT ACTATTTGCG CCTCTCATTT ACCTATGGAT ATCCGTGTCC CAACCCAACA TGGGTATCCA TGTTAAAATT GGGAAATTAC AATTGGTTGG TTTACTTCTG TCCACTGCAC TATTGGCTCT ACCATTCAAT CTATTGCTCA GAGTAGGCAT AACGTTCCTT GCAACTCAGC TGATTCTCTT TCCTTCCTAC CTAGTAATTA AGAGAGATGA GAGTATTCTC GCCGATTTTC CAACCATATT AAGAGAGATA GGCGATTTCA CTAAGCTCGG ATATGGAATT AGGGCATCTA TTCAAAGAAT AAATTTCGAT GAACTAGGCC TACACAAACC AACTGTAAAG TTCTTTGATA ACGTAAAGAA ACAGATTGGG ATGGGAAACA ATATCTATTT TGGATCTATT CAGAACGAAC AGGTAAAGTT TATCGTGGAG CTATTGAACA TCCTAGACAG GAAGGGTGGA GAAGGAGTCA GGGTGCTCCA GGAACTGAGC GACATGATAT ATTCGATCAC ATTATCCAGG ACTAAGCTAC AAAGAGAACT GAGTACGTTC AATATTCTTG CGCTTATTAC TCCCGTACTT TTCTGGTTTT CCACTACTTC AATTGAAACA ATTTCCAGTT TCTCTTCTTC CACTCTAGGC CTATTGAATT TAGGTTATAG CTTGACATTA AGCCTGTTAT ACACTAAGCT AAGTAAATTT ACCTTCCTAA ATCCTGTGGT ATACATCTCT GTGACGTTAA TATCGATATT ACTTTCAATC CTCCCTCCAG GATTATTATC TTGA
|
Protein sequence | MSKVRFEFKN ITLRMGRFLE FLDRTRYFPL LSTIHPFNLF LRLIRQNIEV YGGSISRVEN LSNRSFLIFL ISLSLIGILV WHMGIYFVPL LIIPVLVYLL PVIYILASKM EYLSRLNLEL LPFSILLYLN ASLGKGLYET FNDVNQSSLF IAFRKEFEII QRYGIFHGKS FLDGIQRRIK NLRTGLIVKL YSSSLSGQFL GVTMGQRSLE FINDLLGNIR EAFNNYVSKA SEIVEVIFSI FLLVPLVAIG FQGLSSNNNG EILLIPLLFA PLIYLWISVS QPNMGIHVKI GKLQLVGLLL STALLALPFN LLLRVGITFL ATQLILFPSY LVIKRDESIL ADFPTILREI GDFTKLGYGI RASIQRINFD ELGLHKPTVK FFDNVKKQIG MGNNIYFGSI QNEQVKFIVE LLNILDRKGG EGVRVLQELS DMIYSITLSR TKLQRELSTF NILALITPVL FWFSTTSIET ISSFSSSTLG LLNLGYSLTL SLLYTKLSKF TFLNPVVYIS VTLISILLSI LPPGLLS
|
| |