Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0804 |
Symbol | |
ID | 5105127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 733639 |
End bp | 735249 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506709 |
Product | Pyrrolo-quinoline quinone |
Protein accession | YP_001190903 |
Protein GI | 146303587 |
COG category | [S] Function unknown |
COG ID | [COG1520] FOG: WD40-like repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGAA TACTCGTAGC CATAGCGTTA GTTTCCGTGT TTTTAGTTGG TTCCTTCCTA GGAGCTCCCA TGATACAGTT CTTTTCCTCG GTGATAACGC CAATACCTCA ACCCATTGTA AGGACTTACG ACATGTATAA CACAACCTAT TTCCCGTATG AGGTGAAAGT CGTGTATTAT CCAGCTAATG CAACGAGTCA GAATCTGGAT CTACCAAGTT ATTGGGGAGT TACGAACGGT GGCCAGTCCC ACAATGCAGC CCTTACGACC ACATGCACCG AGTTAATTCA GGGTGTTGTC TGGCAACAGG ATTTTGCCCA TATGGCTGGT GCAGCCCTCA TTCCCATGAC TGCTCCACAG AGTATGTTAC CCGGTGCTAG CGTCATGGGA ACAAGGTCCG CACTGGTAAT GTTAACTCAA ATGGTGGGAG AACCGTTGGG CGTTACATTG GCCGATAATT TACTATTTGT GGAAGAGGAT AGCGGACCAG GAAGTATCTT CGCAGTGAAT CCTCTGAACG GACAGGTAGT GTGGTATGCC ACAGGACTAG CCAGTTACGC GATGAATAAC CCCATCGTTT ACAACGGGAT AGTGTATGTG ACTGTGGGAG ATGTTGGTTT CAACTTCGCG AATTTCGTTC ACTACGAGAA GGGGCAATTC TCCTCGATTC ACAGGGGGAT GGCATATGGA GCCATCTACG CGTTTAATGC CACTGACGGT GAGCTGTTAT GGATGAGGTT CACGATGGGA GAGGCAATGC CAGCACCCGC GGTTTATAAC GGAATCCTTG CCTATTCAGA CGGTGGTGGG GAGTTCATAG GAGTTAATGC GACCACAGGA CAGGTCCTAT GGCAGGATAT GATGCCAGGC CTGTTTGATA GCATGAGTAG TGTAAACTAC TACGTTCTGC CTAACGGCAC TCCCTTATTC ATTGCTGGAT TCACAAGTCT GACTGAGCCA TATGGACTTC TGGTCGCTGT TAATGGAATG ACAGGGAAAG AGGTATGGAA TGCATCTCTT CCTGCCCCCA ATAAGCCATT CAATACAGGG ATGGGAGATG TGCCCCCAGC TGTGGATCAG CAGGCAGGTA TCGTTGTGCA GTCAACTGTA GCAAACGCCG AGCCCAATGG AACAGTTGAC ACCATGGTTC TTGCGGTGAA TGCAACTAAC GGCCACGTTC TGTGGGTGAC AAATCTGGGC AGAGGTTATA CTCCACCAGC ATTCAAAGGG GCAATCCCAA TGATTTACAA TAACACCGTT TACGTAGGTT CTCCTTCACT GGGTAAGGAG TTTGCCCTAA ATCTCACTAA TGGCCAAATA CTGTGGCAGA CCAGGCTTAA CGGGATAGGA TTACCACCAA AGGCTCCTGG TGGACCCAGG GGTGGAGCAA CCTATTACGA CAACTTGCTG TGGGTAGCTG GAGGTCCTTA CGTTTACGTG TTGAACCCCC ACAACGGTGA ACTATTGCAA CAGTACTATG TTGGCGGAAG GTTCGGCATA GTTAACCCCG TGATAGTCGG AAGCACAATG TATCTAACTA ACAGTTACGG CTGGGTGGTG GCGATCCCAC TCTATCAGAT CTACCCCGAC TACGTACTTT ACGCTAGCTA A
|
Protein sequence | MNRILVAIAL VSVFLVGSFL GAPMIQFFSS VITPIPQPIV RTYDMYNTTY FPYEVKVVYY PANATSQNLD LPSYWGVTNG GQSHNAALTT TCTELIQGVV WQQDFAHMAG AALIPMTAPQ SMLPGASVMG TRSALVMLTQ MVGEPLGVTL ADNLLFVEED SGPGSIFAVN PLNGQVVWYA TGLASYAMNN PIVYNGIVYV TVGDVGFNFA NFVHYEKGQF SSIHRGMAYG AIYAFNATDG ELLWMRFTMG EAMPAPAVYN GILAYSDGGG EFIGVNATTG QVLWQDMMPG LFDSMSSVNY YVLPNGTPLF IAGFTSLTEP YGLLVAVNGM TGKEVWNASL PAPNKPFNTG MGDVPPAVDQ QAGIVVQSTV ANAEPNGTVD TMVLAVNATN GHVLWVTNLG RGYTPPAFKG AIPMIYNNTV YVGSPSLGKE FALNLTNGQI LWQTRLNGIG LPPKAPGGPR GGATYYDNLL WVAGGPYVYV LNPHNGELLQ QYYVGGRFGI VNPVIVGSTM YLTNSYGWVV AIPLYQIYPD YVLYAS
|
| |