Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2998 |
Symbol | |
ID | 5209966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3762199 |
End bp | 3763884 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640596590 |
Product | proton-translocating NADH-quinone oxidoreductase, chain M |
Protein accession | YP_001277312 |
Protein GI | 148657107 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00848252 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.769914 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATACC TGGCATACGC ATCGACGCCA TGGCTGACGC TGCTGATCCT GTCACCCCTG GTCGGGCTGG CGCTGACGGC GCTGGCAGGC GCGCTGCGCC TCGATGATCG CACAGCCATG ATCGGCGCAA CGGCGTGGTC TACCGTCCCC CTTGCTCTGG CAATTATCGT CTGGGTCGGG TTCAACCCGA ACGCCACCGC CGATGGGCAG GGGGTCGTGC AATTCGTCGA GAAGATTCCG TGGGTGCAGG CGATCCGGGT CGATTATTTC GTTGGGGTGG ACGGCATCAG TATGCCGCTG GTGATCCTGA CGGCGGCGAT GACGCCGGTG GCGATGCTGG CGTCGTTCAG CGTCACACAG CGGGTGAAAC TGTACCTGGC GCTGATGTTT CTGCTGGAAA CGGCAATGCT GGGGTACTTC CTGGCGCTCA ATTTCTTCTT CTTCTTCATC TTCTGGGAAT TCAGCCTGGT TCCGGCATAT TTTCTCATTC AAGGATGGGG ACGCCGCCAC ACCACTGATG CCGACCCGGA ACAGCGCCGC CGCGATGCCG CACTGAAGTT CTTCGTCTAT ACCATGGCCG GTTCGATCGG CATGCTCCTC CTGTTTCAGT TCTTCTATGT CGCCACCGCC GCCGCCGGTA TTCCAACGTT CGATCTGATC ACGCTGGCGC GATTGGGGCA GGGGTTGACG GTCGAACGCG CGGCGCTCGA TCCGGTGAAC CTGACGTTGC AAGAGATTAT CTTCAATTAT GTCGAGCAAC TCGGAATTGC CGATGTGCTT GGGCGTTACC CGTTGCTCTA CACATCGATT GCGTTCTGGG CGATTTTTAT CGCCTTTGCG ATCAAACTCG GCATCTGGCC CTTCCACACC TGGCTGCCGG ACGCCTATAG CGAAGCGCCG CCAGCGGCGT CCATCCTGTT GGCGGCGGTG ATGTCGAAAA TGGGCGCCTA CGGCATGCTG CGCCTGATGC TTCCCCTGGT TCCCGACGCG GCGCAGTATT TTGGTCCGGC AATCGGCGCG CTGGCGCTGA TCGGCGTTGT GGCTGGCGCC TTCGGCGCAC TCGGTCAGGT CGGCGGCGAC CTCAAGCGCC TGATCGCGTA CACCTCGGTC AACCACATGG GGTACGTCGG TCTGGCAATT GCCGCAGCTG CGACCGTCGG CGCGGCGGAT GTCGCCACCC GTGCAACGGC GATCAATGGC GCGTTGTTCC AGATGGTTGC GCACGGTCTC TCAACCGGCG CGCTCTTCCT GATGGCGGGC ATGCTTGCCG AACGCACCGG CTCCGACGAC ATGCGCTCGC TCGCCGGGTT GCGCACAACG ATGCCGGTCT TTGCCGGTGC AATGGGCGTG GCGACCTTCG CCAACCTGGG GTTGCCCGGT CTTGCCGGGT TCGTCGGCGA GTTCTTCATC TTCCGCGGCG TCTGGGCATC GCTGCCGCTC TTCGCGCTGC TGGCGACCAT CGGGCTGGTT GTGACCGCGC TGGCGCTGCT GCGCATGTAT GGGCAGATGT TCCATGGGCA GACGAACGAA CGCAGCGCTA TGCCCGACAT GCGTCTCGCC GGACGCGAGT TCCTGGCAGT TGCACCGCTG CTGATCGCGC TGCTGATCCT TGGCATCTAC CCGGCGCCGA TCATGGACCT GTCGAACCAG ACGGCAACCG CGCTGGCGAG AGTATTCCTG CCGTGA
|
Protein sequence | MTYLAYASTP WLTLLILSPL VGLALTALAG ALRLDDRTAM IGATAWSTVP LALAIIVWVG FNPNATADGQ GVVQFVEKIP WVQAIRVDYF VGVDGISMPL VILTAAMTPV AMLASFSVTQ RVKLYLALMF LLETAMLGYF LALNFFFFFI FWEFSLVPAY FLIQGWGRRH TTDADPEQRR RDAALKFFVY TMAGSIGMLL LFQFFYVATA AAGIPTFDLI TLARLGQGLT VERAALDPVN LTLQEIIFNY VEQLGIADVL GRYPLLYTSI AFWAIFIAFA IKLGIWPFHT WLPDAYSEAP PAASILLAAV MSKMGAYGML RLMLPLVPDA AQYFGPAIGA LALIGVVAGA FGALGQVGGD LKRLIAYTSV NHMGYVGLAI AAAATVGAAD VATRATAING ALFQMVAHGL STGALFLMAG MLAERTGSDD MRSLAGLRTT MPVFAGAMGV ATFANLGLPG LAGFVGEFFI FRGVWASLPL FALLATIGLV VTALALLRMY GQMFHGQTNE RSAMPDMRLA GREFLAVAPL LIALLILGIY PAPIMDLSNQ TATALARVFL P
|
| |