Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2189 |
Symbol | |
ID | 5105410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2100279 |
End bp | 2101697 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640508083 |
Product | hypothetical protein |
Protein accession | YP_001192252 |
Protein GI | 146304936 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000970823 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000755051 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGAAAG CTCTAACTAT CCTATTAGTT ATATTGATAC TTCCGAGCCT ACTTTCAGGA CTAAGCAATA TCGTGGCATT TGCTCAAAGT GGAAGCGGTA GTAATAATTT AGGGACCTTT ATCTCGACAC TACAAACATT GATACCTGCA GCACTGCTAT TGCTTGCAAT TTTGGCAAAC AGATACGATC AACGTTATGC TTTAGTGTTG TTCGCAGCTG CTTTAGCTTC AGCGTTATTT CTAGCAGCTG CTACGGGAGG AAAACTAGGA GGAAGCGTAA ACATAACATT AGTAGAACTG AAAGTAACAG TTTCTGGTCC CACTTCGGCT TATACTGGAA ACGCTGAAAC TTATACAGTG TCCTGGAGCC CATCTATGTC TGGAACAGTG ATATGGACAG TATTATATAA TGGGAGCATA GTATACAATT CCACAGGCGG AACTTCGTTC ACCTATACAT TCAATGAACC TGGAAAATAC ATAATTGCTG CGGCTGTAAT TAATGATCAA AACTATGCAG GAGGGTCTGG AGCAGTCTTA ATAACTGTAA CGAACCCACC TAGCCCCTTG GGCTGGCTTA CTGGGGCTAT AACGAGCGCA ATCTCTGGAT TTATTAACAC TGCAGCTAAC ACATTTGAAG GATTTTTGAC TACACTTTTA CAGATTTTCG GAGCACCTCT AGAATGGATG ACTTATTCCC CTACGCCCAA TATTTCTCCA ATTACTCAAA CGATATATAA TGAAATCCAA GATTATGCTA TCGGGCTAGC TATGTTGTTC GTATCTTTCT CGATAGCATA CAACGCATTC AGAGGGTTCT ATTCGGACCT CGTTGATCTT GCTGGTGATG TCCTTTACAA GCTCGGCGTA TGGGGACTAT TCTATGCAGG TGGGATGATA GTATACACTT ATGCTGCGAA TTTCATTAAT TCTATCATTT ACTCAGTCGC AGGTCCATAT CTAGGAATAG CAACGATAGA ATATACAGGA GGAGCAACTC TCTACACGGC CCTATTTGCA TTAATGAACG GGGTTCCCTT TGGGTTTGGG GATTCTTTAG ATATGTTTTT ATCTCTAGTT ATGTTTCTCT TAGCTTTCAC ATTAGCGGTA GCAACAATAA AATATGCTGT AATGTTATCT GTTGTATCTA CTATTCCACT ATGGGCTTCA CTCTGGATAT TTGAATGGAC TAGAAAAATT GCGATGATGG TAGTAGATTT GCTAATAGGC TTAATGGTAG CGGGTTTGGT AGCCGCAATA ACTTTTGCAA TCTTAGCAAC ATTACCTATC GGAGCCTTGC TGTTTATCAT AGACCCTATA GCGATTGATG GAGAATTCCT GTTCTCGTTA GTGCTTTTTG TGTTTAGTTT GAGACCAGGA CAACATATGG TAGGAGCAAT TAGAGAATTG TCTTCATAA
|
Protein sequence | MRKALTILLV ILILPSLLSG LSNIVAFAQS GSGSNNLGTF ISTLQTLIPA ALLLLAILAN RYDQRYALVL FAAALASALF LAAATGGKLG GSVNITLVEL KVTVSGPTSA YTGNAETYTV SWSPSMSGTV IWTVLYNGSI VYNSTGGTSF TYTFNEPGKY IIAAAVINDQ NYAGGSGAVL ITVTNPPSPL GWLTGAITSA ISGFINTAAN TFEGFLTTLL QIFGAPLEWM TYSPTPNISP ITQTIYNEIQ DYAIGLAMLF VSFSIAYNAF RGFYSDLVDL AGDVLYKLGV WGLFYAGGMI VYTYAANFIN SIIYSVAGPY LGIATIEYTG GATLYTALFA LMNGVPFGFG DSLDMFLSLV MFLLAFTLAV ATIKYAVMLS VVSTIPLWAS LWIFEWTRKI AMMVVDLLIG LMVAGLVAAI TFAILATLPI GALLFIIDPI AIDGEFLFSL VLFVFSLRPG QHMVGAIREL SS
|
| |