Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0680 |
Symbol | |
ID | 5105286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 622175 |
End bp | 623776 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640506584 |
Product | hypothetical protein |
Protein accession | YP_001190779 |
Protein GI | 146303463 |
COG category | [S] Function unknown |
COG ID | [COG3356] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000787674 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000670884 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATTCGG AAAAACTAAC TAGGAATTAC TATTCCAAGC TCATAAGGTT ACCATCAACA GAGGCCCTTT CAATCTGGGT GGGAGCGGAA GTAGGTCTAT CCTTTCTGAG GTCGCTATCT GACGGATTCA CGTATTTGAC AGGCTTCCTA GTGTACTTGT CCTTGTCCCT AATATCCCTT CACAGAAGAG TTAGAACTTT CCTTGCAATG GCGGGAATGT TTGGTATTAT CTACTTGATA ATCTCGTTCT TTCCTTTGGT AATACCCCTC TCCTTTGGCC TCTTTATTCC CCTTATGACT TACGTTATGT TAATAGATTA CGGGGATTTC ACTTCGCCGG GCCTAACCAC GCTAATAGGC CTGATCTCGG CACTTTCCAC GTTTCCAAGG AATATTGAAC TAGTTATTGC CTTCTACTTG ATCGTGGGGC TATTTTCCTA CTTATACTTG ATCCTTGTTA ACAGAAAAGG GAAAAGCGTC ACAGGCATTC CATCTCTGAA CATTGTTAGA CCTTTCCTGA AGGCTATGAG CTACAGGAGG GACGAGGAAG TTGAAAACTT CCTGGAAAAG ATATCTACCG AATTTCACTC AAGCACACTT GTACTGAAAC TTGGAGACGT TCTTCTAGTT TTACCCAGAA TACACTTTGG TATGTACGGA AAGGTGGGGA GTTCCCTATT TCCCTATCAG CTGGAGGAGT TAGTGAACAA CAAGGTAATG GTATTTCACG GTCCTGGAAG CCACGAGATA GATCTGGCCT CAAGCAAGGA GTCACGTAAG CTTGCACAGA TAATCTCCTC GAAGATCAGG GAGGGGAATT GGAAGGAGTT AAGGTTTGAG GGGATAAAGT TCCTATCTGA GGACAGGTTT AGGATGACCT CCCTCGTGTT CGACCATATA ACGCTGAACT TCAGCGAGAG ACCGGGCTAC GGAATAGATG ATCTTCCTGG AGGGTTATGG GACGAATCAC TGAAGACAGG TAATTTCCTA GTTGATTGCC ACAACGAGTC TCTAAAGGAG GAAATAGGGC ACAGGGATGA AAGAGCCTTA AGGGAATTCG TATCCAAGAA GATTCCAGCT ACCGAGGAGA GACCTCTCCT AGTGGGATAC GGTGAGTCTG AAATTAACTC AACATGTGAG GGGATCTGTA GTCGTAAAGT TAAGGCACTT ATAGTTGGAG ATGGGGACAA AAAGATAGTA ATTGCCTACG TGTTTGCCAA TAACGCCAAT GAGGAGACTG GGAAGTTATT ACGCGAGAAG TTTGGTAACC TTTACGAGAA GGTCATCCTC GTCACACCTG ACGATCATTC ATGTACAGGA ACATCCTTTG GAAACCTCTA CACTCCAGCA GAGCCTTGCC CACAGATACT GGAAGCCCTG GAGAAAGCGA TCAAAAATGC TGAAGCAAAC CTCAAGAAAG TAGAGGCAAG CTACATGATA GTGGATGCGA AAGTGAAGGT AATTGGAAAA TTTATTTCGT TGATGGTGGA GGGGCTGGAG CAGGTAGGGA GCTTTGCTAT GAGAACGTTC TGGATCCCTG TGATCTTTCC ATACGTAGCC CTTATAATTC TTCTACTTGG AAATTACCTT GTCAAATTCT AA
|
Protein sequence | MDSEKLTRNY YSKLIRLPST EALSIWVGAE VGLSFLRSLS DGFTYLTGFL VYLSLSLISL HRRVRTFLAM AGMFGIIYLI ISFFPLVIPL SFGLFIPLMT YVMLIDYGDF TSPGLTTLIG LISALSTFPR NIELVIAFYL IVGLFSYLYL ILVNRKGKSV TGIPSLNIVR PFLKAMSYRR DEEVENFLEK ISTEFHSSTL VLKLGDVLLV LPRIHFGMYG KVGSSLFPYQ LEELVNNKVM VFHGPGSHEI DLASSKESRK LAQIISSKIR EGNWKELRFE GIKFLSEDRF RMTSLVFDHI TLNFSERPGY GIDDLPGGLW DESLKTGNFL VDCHNESLKE EIGHRDERAL REFVSKKIPA TEERPLLVGY GESEINSTCE GICSRKVKAL IVGDGDKKIV IAYVFANNAN EETGKLLREK FGNLYEKVIL VTPDDHSCTG TSFGNLYTPA EPCPQILEAL EKAIKNAEAN LKKVEASYMI VDAKVKVIGK FISLMVEGLE QVGSFAMRTF WIPVIFPYVA LIILLLGNYL VKF
|
| |