Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0687 |
Symbol | |
ID | 5105293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 627777 |
End bp | 628766 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506591 |
Product | FAD dependent oxidoreductase |
Protein accession | YP_001190786 |
Protein GI | 146303470 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000330964 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000300742 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAGGTGG CCGTGGTAGG AGGGGGCCCG GCTGGAATAT CACTGGGATG GTTCTTAAGG GGAACCAAGA TTGACGTCAC GGTTTATGAA GGACTGGATG ATGTGGGGAA GAAACCATGT GCGTGGGGAG TTCTCAAGGG GATAGAGAAC TACTTGGACA TCCCAAAGGA GGCAATCTAC AGTGAGATAA AGGGGTTCAG GATCTACCTA GACAACAAGC TCATCTCAGA GGTTAGGGAG AGGGAGAGGC TTGGGTATAT CGTGGATAAG CCACTCCTAC TAAGGAAACT GGGGGAGAAG ATTGATCTGA GACTTAACTC CAAGGTAGTC CTGAACAAGG GCAAGCTCGT GGTGAACGGG AAGGAGGAGG AGGCTGACAA GGTGATAATT GCCACTGGCC ACTATTCCCT CTCCAAGGAT GTCACAATTC CCGCACTCCA ATACATTACT GACCTAAATT ACGATCCAGA AATGGTGGAC ATGTACTTCT ACTCTGACCT CCTTGGATAT GGATGGATAT TCCCTGATCC AAAGGGGGCT AAGATTGGGG TAGGGGGTTA TGCTTCCGTG GACTTCATAA GGGAGAAACT AAAGACCATC ACGTCAGGTA GGATCATAAC CCAACATGGA GCGAGAGTCG CTGACTACGG AGTTTTCGAG GATAGATTGA ACGGCTCATA CATTGGCGAG GCCTTGGGAA CAGTGTACGC GGTCACGGGG GAGGGGATAA GGCCATCAAT CATCTCCTCA AGGATTATGG CAGACTCCCT CTTGGAGGGG AAGGACTTCT CTAGGGAGTT CAAGAGGAGC AAGCTTCACT GGACCCTTCA GGTGCACGCG GAAGTGATAA AGAGAGCTAA GGCATCCAAC TCGGTAAAAG GATTGGAGAG GGTATTACTA AGGGCAGATC CAAAACTGGT TGTGAAGTTC GCCATGGGCG ATTTCGGAAA GTTAGACCTT ATTAAACTGT TCGGGAGTGC TATATTATGA
|
Protein sequence | MKVAVVGGGP AGISLGWFLR GTKIDVTVYE GLDDVGKKPC AWGVLKGIEN YLDIPKEAIY SEIKGFRIYL DNKLISEVRE RERLGYIVDK PLLLRKLGEK IDLRLNSKVV LNKGKLVVNG KEEEADKVII ATGHYSLSKD VTIPALQYIT DLNYDPEMVD MYFYSDLLGY GWIFPDPKGA KIGVGGYASV DFIREKLKTI TSGRIITQHG ARVADYGVFE DRLNGSYIGE ALGTVYAVTG EGIRPSIISS RIMADSLLEG KDFSREFKRS KLHWTLQVHA EVIKRAKASN SVKGLERVLL RADPKLVVKF AMGDFGKLDL IKLFGSAIL
|
| |