Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_1522 |
Symbol | |
ID | 4038325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 1644017 |
End bp | 1645825 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637976906 |
Product | respiratory-chain NADH dehydrogenase domain-containing protein |
Protein accession | YP_583674 |
Protein GI | 94310464 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000148322 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000569435 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAAG ACATCCGGAC AATACTTGAG CGTTACCGCT CAGACAGAGC CCGTCTGATG GACATACTTT GGGATGTTCA GCATCTGTAC GGGCACATTC CCGATGAGGT GCTGCCGCAA CTAGCGGCCG AGTTGAACCT GTCCCCACTC GACATTCGGG AAACAGCGTC GTTCTACCAT CTTTTCCATG ACAAACCGTC GGGAAAGCAT CGGATTTACC TGTGCAATTC CGTGATTGCC AAGATGAACG GCTATCAGGC GGTGCACGAT GCCCTCGAGC GCGAGACTGG AGTCCGTTTC GGCGAAACCG ACCCGAATGG AATGTTCGGC CTGTTCGAAA CGCCCTGTAT CGGACTCAGC GATCAGGAGC CGGCGATGCT GATCGACAAG GTGGTATTCA CCCGCCTGAG GCCCGGAAAG ATCGCAGACA TCATCGCCCA GCTGAAACAG GGGCGGTCCC CGGCCGAGAT CGCGAACCCG GCCGGGTTGC CCAGCGACGA TATCGCCTAT GTCGATGCCC TGGTGGAATC CAATGTGCGC ACGAAGGGGC CGGTATTCTT CCGTGGCAGG ACGGATTTCA AGGCCGTGCT CGACCACTGC CTGACGCTCA GGCCCGAACA GGTGATCGAC GAGATCATCG AATCCAAGCT GCGCGGACGC GGGGGCGCCG GGTTCACGAC CGGGCTGAAG TGGCAGCTTT GCCGGCGTGC TTTGAGCGAC ACGAAGTACG TCATCTGCAA TGCCGACGAA GGCGAGCCCG GGACCTTCAA GGACAGGGCC CTGCTGACAC GCTCGCCCAA GGAGGTATTC ATCGGAATGG CGATCGCCGC CCATGCCATC GGCTGCCGCC ATGGCATCGT TTACCTCCGC GCGGAGTACT TTTATCTCAA GGATTATCTG GAACGACAGC TTCAGCAGCT CAGGGATGAC GGGTTGCTGG GACGCGCTAT TGGCGTCCGG CGGGACTTTG ATTTCGATAT CCGTATTCAG ATGGGGGCCG GCGCTTATAT CTGCGGTGAC GAGTCGGCCC TCATCGAGTC CTGCGAGGGA AAACGCGGCA CGCCACGGGT GAAACCTCCC TTTCCGGTGC AGCAGGGGTA TCTGGGAAAA CCCACCTGCG TCAACAACGT CGAGACCTTT GCCGCCGTCT CCCGAATCAT GGAGGAAGGC GCCGACTGGT TCAGTGCGAT GGGAACGCCG GACTCCGCCG GCACCCGACT GCTGAGTGTT GCCGGCGATT GCAGCAAGCC GGGGATCTAC GAGGTGGAAT GGGGCGTGAC GCTCAACGAG GTACTGGCGA TGGTCGGAGC GAAGGAGGCG CGGGCCGTTC AGATCAGCGG GCCCTCGGGT GAGTGCGTGT CGGTGGAGAA GGATGGCGAA CGCAGGCTCG CGTACGAGGA TCTTTCCTGC AATGGTGCGT TCACGATCTT CAACCGGAAC CGTGACCTGC TGGACATCGT CAAGGACTAC ATGCAGTTCT TCGTCGATGA GTCCTGCGGC ATTTGCGTGC CGTGCCGGGC CGGCAACGTC GACCTGCACA GGAAGGTCGA ATGGGTGATC GCCGGCAAGG CGTGCCAGAA GGATCTGGAC GACATGGTCA GTTGGGGAGC GCTGGTGCGC AAGACCAGTC GATGTGGTCT TGGCGCCACC TCTCCCAAAC CCATCCTGAC GACGCTGGAG AAGTTTCCCG AGATCTATCA GGACAAGCTG GTGAGGCACG AGGGCCCGCT GCTGCCGTCG TTCGACCTCG ATACCGCCTT GGGCGGGCAT GAGAAGGCGC TGAAGGAACT GGAGGAGGCA AAAAAATGA
|
Protein sequence | MSKDIRTILE RYRSDRARLM DILWDVQHLY GHIPDEVLPQ LAAELNLSPL DIRETASFYH LFHDKPSGKH RIYLCNSVIA KMNGYQAVHD ALERETGVRF GETDPNGMFG LFETPCIGLS DQEPAMLIDK VVFTRLRPGK IADIIAQLKQ GRSPAEIANP AGLPSDDIAY VDALVESNVR TKGPVFFRGR TDFKAVLDHC LTLRPEQVID EIIESKLRGR GGAGFTTGLK WQLCRRALSD TKYVICNADE GEPGTFKDRA LLTRSPKEVF IGMAIAAHAI GCRHGIVYLR AEYFYLKDYL ERQLQQLRDD GLLGRAIGVR RDFDFDIRIQ MGAGAYICGD ESALIESCEG KRGTPRVKPP FPVQQGYLGK PTCVNNVETF AAVSRIMEEG ADWFSAMGTP DSAGTRLLSV AGDCSKPGIY EVEWGVTLNE VLAMVGAKEA RAVQISGPSG ECVSVEKDGE RRLAYEDLSC NGAFTIFNRN RDLLDIVKDY MQFFVDESCG ICVPCRAGNV DLHRKVEWVI AGKACQKDLD DMVSWGALVR KTSRCGLGAT SPKPILTTLE KFPEIYQDKL VRHEGPLLPS FDLDTALGGH EKALKELEEA KK
|
| |