Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0221 |
Symbol | |
ID | 5104087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 182314 |
End bp | 183693 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506126 |
Product | hypothetical protein |
Protein accession | YP_001190322 |
Protein GI | 146303006 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000130576 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0122393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGTCA GGAGTAAGAG ACACAACAGG GTCAGGGCAC CGTTTGATTG CGAGAGAGGG TTGCCCTATA CAGAGGTAAA CGTGAACGGG AAAATTGTGG AGAACTGTGA GCCTCCTACA AGACTTCATG GGCTTAGCTT TCCCTTAAGC GGGATGTATC TACATCGAGT CAAACTTCTG AGAACCTTTC CCTGGCTCCT GGAAAAGGTA GCAGAACGGA TGAATGTTCC AGAAGGATAC TCCACCGTGG AGGATTACAG GGTTGAGAGG GTTAGCGTGG ATACGCTAAT AGTAGGATCA GGACTTTCAG GTTTAACAGC TCTATCCAAG AGTAAGGCCA TGCTTGTCAC GAATGATCTT TACACAGACC TCTTTGACGA TCCCTTGAAT CAGGGAGAGC TTTTGGGGAA GGTCAAGGAC ATTATCAAGC AGAACGAGGG TAGGATAATT CAGGGCGACT TCCTAGGAAA GTTTACTGAG GGCTTTGTGG TCAGAACGGG TAAGAAACTC GTTCTAGTTT CCCCCTCCAG GGTGATCTTC GCCGTTGGTG GAAGATATTT GCCTCCTATC TTCGAGGGTA ATGACTATCC CAACGTGATT TCCAGGAGAC TTTATCTCAA GAGGAGATCT GCCTATAGAA GGATAGTGGT TCTTGGATCT TCGGATGATG CAATTAAAAC TAGTCTAATT TCAGGAGGTA AAATTCTGAC TCCGCGTGGG GTAAGGCTCT TCTCAAGAAG GTACTTGGAG TTGGCCGAAA CTAGGGGGGT GGAGATAGAG GAGGTCGACT CTCTTAAGGT GAAACCAAGG GATGGCAAAC TTTTCGTGGA GTGGAATAGT AGTAATCTTC TGGTTGACGC TGTGGTTTTC GCTCCAGTTA AACAGCCCAG ACTGGAACCC ATAGCTAATG CAGGGTGTGA ATACAGGTTT TACCCAAACA TGGGACTATA CGTTCCGGAA CATGAGATGG ACGGTTACAT GAGGAGTTGT GGGCACTTTG TAGTTGGAGG GGCCAGAGGC ATCATGGACG AAGAAACGTC GATGTTAAGT GCTGAGGCTC CTTTTAGCGC TGAGGCCCTC TCTACCCTAG CCAGTCATCT GAAAGAAACT CCCCTTCACG AGTACTACAC AAGGAATTTC GTATCTGTGA AGAGTCCATA TTACTATTCT CCAGGAGGTT ACGCTTGTTT CTGCGAAGAT GTGCTCTGGA GTGATGTGGA ACAGGTCATG AAAATGGGTT ACGACAATGT GGAGTTAATC AAAAGGGTTG GTGGGATTGG TCTTGGCGAG TGTCAGGGCA AGGTTTGCAC ATACGTTACT GGTAGTATCC TGTCAAGTCA GAGGCTGATA ACCTTCAGAT CACCGCTTTA CCCGATGTGA
|
Protein sequence | MEVRSKRHNR VRAPFDCERG LPYTEVNVNG KIVENCEPPT RLHGLSFPLS GMYLHRVKLL RTFPWLLEKV AERMNVPEGY STVEDYRVER VSVDTLIVGS GLSGLTALSK SKAMLVTNDL YTDLFDDPLN QGELLGKVKD IIKQNEGRII QGDFLGKFTE GFVVRTGKKL VLVSPSRVIF AVGGRYLPPI FEGNDYPNVI SRRLYLKRRS AYRRIVVLGS SDDAIKTSLI SGGKILTPRG VRLFSRRYLE LAETRGVEIE EVDSLKVKPR DGKLFVEWNS SNLLVDAVVF APVKQPRLEP IANAGCEYRF YPNMGLYVPE HEMDGYMRSC GHFVVGGARG IMDEETSMLS AEAPFSAEAL STLASHLKET PLHEYYTRNF VSVKSPYYYS PGGYACFCED VLWSDVEQVM KMGYDNVELI KRVGGIGLGE CQGKVCTYVT GSILSSQRLI TFRSPLYPM
|
| |