Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1752 |
Symbol | |
ID | 5104752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1687634 |
End bp | 1689016 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507647 |
Product | hypothetical protein |
Protein accession | YP_001191831 |
Protein GI | 146304515 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000573149 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0520768 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTAC CAGCAACTGT GTTCGCAGAT GACGTACTCA TCGAGAAAAT GAAACAGGAC ATGACGCTGA GACAGGCAAC TAACGTAGCC TGTCTTCCAG GAGTCCAAGA ATCTGTTTAC GTGCTTCCCG ATGGCCACCA GGGATACGGA TTCCCCATAG GGGGAATTGC TGCCACGTCC ATAGATGAGG GAGGAGTAGT CAGCCCAGGC GGAATAGGAT ATGATATAAA CTGTGGAGTT AGGTTGTTAA GGACTAACCT AGATTATTCT GACATTAAAC CGAAACTAGT TGATATTGTG GAAGAGCTTC ACAGGAGCGT CCCCAGCGGG GTAGGAAGCG AGGGCAGGAT AAAGCTCACC CCTCAGGAGC TGGACAACCT CCTACAGGAG GGCGTTAAGT GGGCAGTGGA TAAGGGTTAT GGATGGAGTG AGGATATGAA TAACATTGAA CAAAGGGGTA GTTGGGAACT AGCTGATCCT AGTAAGGTAA GTCAAGTGGC AAAACAGAGG GGAGCCGCAC AGCTGGGAAC TCTAGGGGCT GGAAATCACT TCCTTGAAGT ACAGGTCGTG GATAAGATAT ATGACGAGAG GATAGCTAGG GCCCTCGGAA TAACAAGGGA AGGGCAAGTT ACTGTCATGG TTCACACGGG ATCTAGGGGT TTAGGTCATC AGATAGCCAG TGATTACCTG CAAATAATGG AAAGAGCCAT GAAGAAATAT GGAATAGAGG TACCAGATAG GGAACTGGCA GCGATTCCCT TTGAGAGCAG GGAGGGACAG GACTATTTCA GGGCGATGGT GTCTGGAGCT AACTTTGCCT GGAGTAACAG GCAACTAATT ACCAACTGGG TGAGGGAGAG CTTCGGCAAG GTGTTCAAGG TAGATCCTGA GAAGTTGGAC CTTCACATTG TGTATGATGT GGCCCATAAC ATAGCCAAGA TAGAGGAGTA CGACATAGGC GGGAAAAGGA AGAAGGTCCT GGTTCACAGG AAGGGTGCGA CCAGGGCATT CCCACCAGGA AGCCCTGAAA TCCCACAGGA ACACAGGGAA ATAGGGCAAG TAGTTCTTAT TCCAGGAAGC ATGGGAACAG CAAGTTACGT AATGGCTGGA ATACCTGAGG GTAGAAGGAC CTGGTTCACT GCCCCGCATG GTGCGGGAAG ATGGATGTCG AGGGAGGCTG CAGTGAGAAA CTACCCTGCC AACTCAGTTG TGGGATCCCT TGAGGAGAGG GGTATAATAG TGAGGGCTGC CACACGTAGG GTTATAGCTG AGGAGGCACC CGGTGCATAC AAGGACGTAG ATAGGGTTGC GAGGGTAGCT CACGAGGTTA AGATAGCTAA GCTAGTTATG AGGCTTAGAC CCATAGGGGT GACCAAGGGT TGA
|
Protein sequence | MKVPATVFAD DVLIEKMKQD MTLRQATNVA CLPGVQESVY VLPDGHQGYG FPIGGIAATS IDEGGVVSPG GIGYDINCGV RLLRTNLDYS DIKPKLVDIV EELHRSVPSG VGSEGRIKLT PQELDNLLQE GVKWAVDKGY GWSEDMNNIE QRGSWELADP SKVSQVAKQR GAAQLGTLGA GNHFLEVQVV DKIYDERIAR ALGITREGQV TVMVHTGSRG LGHQIASDYL QIMERAMKKY GIEVPDRELA AIPFESREGQ DYFRAMVSGA NFAWSNRQLI TNWVRESFGK VFKVDPEKLD LHIVYDVAHN IAKIEEYDIG GKRKKVLVHR KGATRAFPPG SPEIPQEHRE IGQVVLIPGS MGTASYVMAG IPEGRRTWFT APHGAGRWMS REAAVRNYPA NSVVGSLEER GIIVRAATRR VIAEEAPGAY KDVDRVARVA HEVKIAKLVM RLRPIGVTKG
|
| |