Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1821 |
Symbol | |
ID | 5105384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1767667 |
End bp | 1768896 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507720 |
Product | FAD linked oxidase domain-containing protein |
Protein accession | YP_001191899 |
Protein GI | 146304583 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.778953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCCT ATCTGAAGGA CCTGGAAAGG GAGTTCGGCT CGAGATTCAT CTCAAGAGGA GAGATCATTG ATCAGTACTC AAGTTCCCCC TACCTAGTCT CACCAGTTCT CTCGAAAATG GGTAAAAGGA TCCTAGGCGT TGTCGTGGCC GAGGACATCG ATGACATCAA GAACCTCCTA CGCTTCTGTG ACGCCAACAG GATTCCGCTC CTGGCCAGGG GAGCTGGGAC TTCAACCATA GGCCAGGTAT TACCCATAAC TCCGTGTATT GTCCTGGATA TACAGAGATT AAATAAAACT CTGGAATACG ACAAATACCT GAGAGTTTCG CCCGGGGTTA AGGTCCTGAC AGCACTCAAC TACCTCAGGA AGAGGGGCAA GGAGCTCCAG GTCTACCCCA GTAGCTTCTA CATCTCCACC CTCGGTGGCT ACATAGCTGG AGGAGACGTT GGGATAGGCT CGTATCAGTA CGGCTACCAT TTCGACCATG ATGGGGTCAG AAGGTTAACT GTGTTGGGGA CCACTGGGAC CTACGAGCTC AAGGGAAAGG AGACGCTGGC AGTCTCGCAG GCAGCCGGGA CGACAGGCGT GATCGCGGAG GCCGAGCTCT CGGTAGTGGA TTACGAGGAC TGGAGGGATC AGCTAATCAG GGTTGACGAG GTGGAAGGAG TAGTGAAGTT GCTCAAGAGA CTCGAGGAGG ACAGGCCTAG GATCAGGAGA ATAACCTTGG AGGATTACGA GACTCTCTCC TTGATCGCCA AGGGTAGGAT CAACCCAGGA AAATGGAACG TAATAGTCTC GAGCACCAAG AGCTTTGGGG AAGAGGTTGA CATGAGATTT CTGGATGAGC TCGCGTTCGC AGCGATTTAC GTTACCATGA GCAAGTTAAC CGGGTTCTCG AGGTACTTCT ACGAGGTGAG GCTCCTCTCA CTGGAAAGCT TCCTGAAGGT AGTGACGCAG GTAAAGATGG CCCTTGGTTC TAAGGTTCTA GTTCACGGTG ACGTCATGAC GTTGAGGGGG GAGACCGTGG TGTACACAGT CTTCATATCG GAAAGGGAAA ACTTTGAGGT AATAGACTCC ATAATGCTCA AGGAGGGAAT ACCCTTCGAG ATACACTCCC TCGTTGTGAA TGACAGGGTA GATGAGGAAT TCAGGCTTGA GTTAATGAGG AAATACAAGG AAATCGTGGA CCCTCATAAC ATCCTGAATC CGGGGAAGTT AAGAGTCTAG
|
Protein sequence | MESYLKDLER EFGSRFISRG EIIDQYSSSP YLVSPVLSKM GKRILGVVVA EDIDDIKNLL RFCDANRIPL LARGAGTSTI GQVLPITPCI VLDIQRLNKT LEYDKYLRVS PGVKVLTALN YLRKRGKELQ VYPSSFYIST LGGYIAGGDV GIGSYQYGYH FDHDGVRRLT VLGTTGTYEL KGKETLAVSQ AAGTTGVIAE AELSVVDYED WRDQLIRVDE VEGVVKLLKR LEEDRPRIRR ITLEDYETLS LIAKGRINPG KWNVIVSSTK SFGEEVDMRF LDELAFAAIY VTMSKLTGFS RYFYEVRLLS LESFLKVVTQ VKMALGSKVL VHGDVMTLRG ETVVYTVFIS ERENFEVIDS IMLKEGIPFE IHSLVVNDRV DEEFRLELMR KYKEIVDPHN ILNPGKLRV
|
| |