Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0873 |
Symbol | |
ID | 5104267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 805468 |
End bp | 807054 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640506776 |
Product | hypothetical protein |
Protein accession | YP_001190969 |
Protein GI | 146303653 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.237291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATTTTG AACCAATGGC AAGCGGTAAA GAAGATCAAA AAGGAATTAA GGGTTCCCTT TTTGCAAGGG AGTCCTCAGG TTTAGTCAGG GAAATAAACT GGCTAACTGC CATGTTTATT AGCATGGGCT TCATTGCATT CTACGTCCTT CCAATATCCT ACCTATCGGG TCTCTCAATC TCTCCGAACG GGCTTGTGGT TATTGGCGCC CTACTCAGCT GGATAGTTCT TCTCCCCCAC GGTTATTTAT GGACAAAAAT AAGCGAGAAA TTTCAGAGAA CTGCTGCAGA TTACGTTTTT GCGAGTAGAG TGTTACATCC TGCTATCGGA GTGGGAACAG GGCTTGTCTT CGGCATCTCC CAGATGATAT TTGACGCTGC CATAGTTTAC GATGGCGTGG GCCAGCTACA GACAGGTTTT TCGGCACTGG GGACAGATTT TGGAGGCTCC TACACGTCAT TGGCAACTAC CTTAGGGAAC CCCTTCACAA TTCTCGCGAT AGGAGCAATT ACCTTCACAG GAGTAATTGC AATAAACATC TTTCTGCCAA AGTATACCAA TCAAATAATG GGTGGAATAA CTATTGTAGC CTTAGTAACC TTTGTATTAA CAGGGATACT GATGCATTAC GTTACGCCCG CATCGATAAC TGCGTCTGGA TACGATTACA GCTCCATAGT GAAGGCGAGT TCTTCTGCGA CACCACTCTT CAGCAATCCG ATCTTAGCTA CAATTGGTCT TATGGTGTTC ACTGCGAGTT TCCTTCCATT TGTTAATGGA GCCACTTCAG TCGCTGGGGA GGTCCGAGGG GGAAGTAAGG CGTTCAGGAT CGGAGTACTA GCAGCTCTTG TTGTTGCGGG TCTGCTAATA ACCTTCTTCA TCTCTTCGTC AGTTAGTACG CTTACTTCAG AGTTTTTCAT AGGAGCTGGA GTCTTGGGAC CGAACTATTC TAATCTCCTA AACCCAATAT TTGATGTAGT AGTGAGTTTC AGGAACTTGC CTTTAGATCT CTTTCTTGTG ATAGGGTCTT ACTTCTGGTA CCTTGCCATA ATGTTCGCTG TCGCCCTCTT CGTATCTAGG TATTTTATGG CTGTGGCCTT TGACAAGGCC CTACCAACAA TTGTCTCATA TGTTAGTGAG AGATATCATT CCCCTGTGGT GGCACATCTG ATCGACGAAG CAATAACAAT TGGTAGTCTC ACCGTTATCA CTGTCACACC TCTCAGCTCT GCCTTCTTTT ATGGCATGGA TACTGCTGAT GCCATGGCTC TGCTCTTCGG CTTCATAGTG GTCATTCTGG CGTCCATAGT CGCATCAATC AGAAGAAATG GATTAGAACT AAAGGGAAAC AGGGCCGTAA TACTAGGAAT ATCCATTGCT GACCTAATAG TCATGAGTGT ATACGGTTTC TATTGGTTTG GAAATTCCAC AATTTACCTA GGAATAAACA TGAATCCGGT CACCTGGCTC ATAGTCTCTT CTCCTTTCAT TGCTGGAATA ATCATTTACT TCGTAATGAG ATGGTACAGG CTAAGAAAAG AGGATATCGA TATCAAATAC TCATTCCAGG AAATTCCGCC AGAGTAG
|
Protein sequence | MNFEPMASGK EDQKGIKGSL FARESSGLVR EINWLTAMFI SMGFIAFYVL PISYLSGLSI SPNGLVVIGA LLSWIVLLPH GYLWTKISEK FQRTAADYVF ASRVLHPAIG VGTGLVFGIS QMIFDAAIVY DGVGQLQTGF SALGTDFGGS YTSLATTLGN PFTILAIGAI TFTGVIAINI FLPKYTNQIM GGITIVALVT FVLTGILMHY VTPASITASG YDYSSIVKAS SSATPLFSNP ILATIGLMVF TASFLPFVNG ATSVAGEVRG GSKAFRIGVL AALVVAGLLI TFFISSSVST LTSEFFIGAG VLGPNYSNLL NPIFDVVVSF RNLPLDLFLV IGSYFWYLAI MFAVALFVSR YFMAVAFDKA LPTIVSYVSE RYHSPVVAHL IDEAITIGSL TVITVTPLSS AFFYGMDTAD AMALLFGFIV VILASIVASI RRNGLELKGN RAVILGISIA DLIVMSVYGF YWFGNSTIYL GINMNPVTWL IVSSPFIAGI IIYFVMRWYR LRKEDIDIKY SFQEIPPE
|
| |