Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1771 |
Symbol | |
ID | 5104771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1712553 |
End bp | 1713797 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640507669 |
Product | hypothetical protein |
Protein accession | YP_001191850 |
Protein GI | 146304534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGGCT CATCGCCCCT AATGGCGTCT CTCTTCCCGG GATGGCAGTT CAAGACTGCC CTGGATGCCG TGAAGTTGGG GTGGTACTTC GCCCTGCCCA AAGTGGGAGA ATTTGGGGCC TCCAAGCTTG TTCTCGAGAA GGAGAAGCCC TTCATTGACG TGAATGAGCT TAAGGGGTTT AGGTACGACG GGTTAGTTAT CAATCAGGTT GTTACCTCGG GCAAGAACTT GAGGACCGTG GAGAGGACCG TGAAGTCCAT TCATCATTGG TATGGCGAGG TGGAGAGGAG GTACTCTGTG AGGATACCCC ACGAGGTCTG GATTGTGGTT GACGAGGGAA GGGAGGCAGG TTTGAGGGGG TTAGACGCTA GGGTGGAGGT AGTTCCAGCA GAGTATAGGA CCAGGAATGG GTCCATGTTC AAGGCCAGGG CACTCCAGTA CGCCGTGGAA CAGAGGGGAG GCACTGGATC AGGCACGTGG GTTTACTACC ACGATGAGGA GACCGTGTTT GGGGAGGACA GCGTCCTAGG AATTGCCGAA TTTGTTCAAG GGGACAGGGA CGTTGGGGTT CATCCCATAG TTTACCCGGT TAACTGGAGA GGCGACGTGT TATCCACGAT TGAGACGTTG AGGACGTCCA ATGACGTGGT GAGCCTTTCC CTGTCCCCCA GGGGAATGTG GCACGGCTCT GGTTTCATGG TTAGGGGAGA GGTGGAGAGG GAGATTGGAT GGGACTTTGG CCCAGTGAGG GCTGAGGACC TCCTCTTCCA CCTGAGGGCA TCACGGAGGT TCAGGTACGG AGTCATGAAG GGTTTCGTGT ACGAGATACC TCCGCAGAAC TTAATGGACT TCATGAGGCA GAGGAGGAGA TGGATACTGG GGATACTTGA CGGGTTCAAG GACGGAAGGA TGGATGTAAG GAATAGGGTG AAGTACCTTC TGGGTTTAAC TAGCTGGTAC TCCTCCGCGT TGGGTTTCCT GGTGCCCCTA TTCGTGTACA TGAGGGATGC AAGCGCACCT CTTCCCATTG GACCATATCT AACCGGGCCC ATCTGGTTCA CCCTGCTCCT CATGTTAAAG GACGGCTTTG TGCTCACTAG GAGGTATGCT GGCCTCAGGG GACGGGACCT TCCAAGTTTC ATGGTGAAGG GATTAGTAGG GCTCATGCTT GAGGCCATAG CCCCTTGGTA TACCCTGTTT ACAGGATGGA GGGATCACGG GTTCCACGTC ATAGATAAGG GATAG
|
Protein sequence | MLGSSPLMAS LFPGWQFKTA LDAVKLGWYF ALPKVGEFGA SKLVLEKEKP FIDVNELKGF RYDGLVINQV VTSGKNLRTV ERTVKSIHHW YGEVERRYSV RIPHEVWIVV DEGREAGLRG LDARVEVVPA EYRTRNGSMF KARALQYAVE QRGGTGSGTW VYYHDEETVF GEDSVLGIAE FVQGDRDVGV HPIVYPVNWR GDVLSTIETL RTSNDVVSLS LSPRGMWHGS GFMVRGEVER EIGWDFGPVR AEDLLFHLRA SRRFRYGVMK GFVYEIPPQN LMDFMRQRRR WILGILDGFK DGRMDVRNRV KYLLGLTSWY SSALGFLVPL FVYMRDASAP LPIGPYLTGP IWFTLLLMLK DGFVLTRRYA GLRGRDLPSF MVKGLVGLML EAIAPWYTLF TGWRDHGFHV IDKG
|
| |