Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1155 |
Symbol | |
ID | 5103503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1119052 |
End bp | 1120803 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640507047 |
Product | hypothetical protein |
Protein accession | YP_001191240 |
Protein GI | 146303924 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.104357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTGA TGCCACATTC ACTAGAATAT ACTGTCTTCA CCAATATTCA ATTGAGGATT TCCTCCGATG CAGAGAATGC ATCAACTAAA AGAGTTGAAA AGAATATTAA AATCAAAAAG ATTGAAATAA CGGTTAAGGA AGAGAAAGCG AAGGTTAGGG TATGGACAGA TCAACAAGAG CTCGATTTAT TTGCAGAAAA ATTGACTATA GACGATATTA AGATTCCCAC ACAAATTACA ATCTCCATGA AATCCGTCTC TGAGGTGTTG TCTGAGCGCC GTTTGACCTT TGAAGTGACC AAGAAATCTC TCGATGTAGC TATCATGGAG GTAAAATTGG ATTCTTCAAA GAAATTGGGT CCCTTACCGT GCTATGTGAA ATTTTACGGA AAAGTTTACC ACAACGGTAC GGGGATTATT GCACACGAGT TAAGAATCTA CACCAGGATA AAAATAAGTA GGGTCAGGTA CTTCGTGACT AATAGGGATA ACAAGAACAT TAATGTTCGA CTCGAGTATC AGATTGAAAA TGACCTGGAC TTCGTTGTTA AACACGTCTT TGTCCCACTT AAAAATTTCG TGATTGGGTT AATGATAAGG GACGAGAGCG GAACACAACT AAGTTTCATC AGTAGGAGAG AGATCATGAC AAGGCTCTCG CTTGACCGCT CACTAGATGA ATTGAATTAC TTTGTGATAG TTCCTCTTCA AGGTGGGCTG AAGCCAAGGC AAAGGAAAAT CCTCAACTTT GAGGGAATTG AGGTCTTACC GCCTAGTAAA CCCAAGGAGA AGAGTGGGTC AAAGGAGAAG AATGAGCCAG GGACGGAAAG CGTATTTGAG ACAGAGTTCA ATGGTGATGC TGCCTTGGGA ATAATCATCG AATCTCCTTC ATCTAACAAG GACATTGTTG TTAAGAGTAT TACTAGTAAA GTTGTTAAGA AGATTACTAG TGGACAAGAG GATAATCAAA GTGGTCAAAA GGAAATACCA CTCAACCCTT GCAATGATAT GAAGTCTGAA GAGAGTTCAA CCCCTACAAC AGTTGCCTCA AGCAATTCAC AGCCCGAGTT TTACTGCGAG CCCAGGAACT GTGATGATTA TGAAGGTCAA CATAATATAA TCAGATCGTC CCACAGAATA GACCTAGAAT TCAGATCGAG GGGCGGTAAT ATATCTACAT CATCGCCAGT GACGGCGATT ATTTCAGTCA CTTATAACAT AGTTCCAGAG AAGAAGGCTC AGGATTACCT GTGGTTCCTT ACCGCCTTCT ACTGGTTAGC CACAACAACC ATATTTGGGC TATTCGTCGA GGAGTTATCC ATGGATTTAT TTGGATTTAA CTTCTTGACC TTCGTTGTGG GATTATTAGC AGGTGGTGTT GCGGGAGCCT TTTTATTCTT CCCCTACTAT ATAAGCCATG TAGGAAATTT ATTTGAGTCA GATAGTATAA GCTTCCTGAG GGAGGCCTTC AGGAGAGTTA TACGATACAA GTTCAACCTA GCCCTTTTCA TGGTGAGCAT ATTCATCCTG GTGATATCTA TGAGCCTGAG GATATATCCC GCCTTGCTCG AGTCAACTCC CGCCATCACC GTAATCAGAG AAGTTTATTT GGACATAAAT TTTGCCCTGG CAGTTATATT GGAGTTCTCG ACAATAACCT CTGAGGAGCT AGAGTACAAG TCTCCGTTCG AGGTGCTAAC AATATCCCTT CTAAGCTTGA TACTGATCTC CATGTTCCTC CTGCTTATTT AA
|
Protein sequence | MILMPHSLEY TVFTNIQLRI SSDAENASTK RVEKNIKIKK IEITVKEEKA KVRVWTDQQE LDLFAEKLTI DDIKIPTQIT ISMKSVSEVL SERRLTFEVT KKSLDVAIME VKLDSSKKLG PLPCYVKFYG KVYHNGTGII AHELRIYTRI KISRVRYFVT NRDNKNINVR LEYQIENDLD FVVKHVFVPL KNFVIGLMIR DESGTQLSFI SRREIMTRLS LDRSLDELNY FVIVPLQGGL KPRQRKILNF EGIEVLPPSK PKEKSGSKEK NEPGTESVFE TEFNGDAALG IIIESPSSNK DIVVKSITSK VVKKITSGQE DNQSGQKEIP LNPCNDMKSE ESSTPTTVAS SNSQPEFYCE PRNCDDYEGQ HNIIRSSHRI DLEFRSRGGN ISTSSPVTAI ISVTYNIVPE KKAQDYLWFL TAFYWLATTT IFGLFVEELS MDLFGFNFLT FVVGLLAGGV AGAFLFFPYY ISHVGNLFES DSISFLREAF RRVIRYKFNL ALFMVSIFIL VISMSLRIYP ALLESTPAIT VIREVYLDIN FALAVILEFS TITSEELEYK SPFEVLTISL LSLILISMFL LLI
|
| |