Gene Rmet_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4300 
SymbolnuoF 
ID4041158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp893833 
End bp895077 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content62% 
IMG OID637979722 
ProductNADH:ubiquinone oxidoreductase complex I, chain F 
Protein accessionYP_586435 
Protein GI94313226 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0070015 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAAAAGC TGAATCAGGT TCTCCTGAGG GACGATGTCC GCGTTGGTGC GGACCTGGAG 
GCATGGCTCG CAATTGGCGG AGGAGAGGGG CTGGCCAAGG CCTTGTGCGA TCCAGGCGCA
GTCATAGGCG AGATCGAGCA GGCCGATCTG CGCGGAATGG GCGGCGCCGG ATTTGCGACC
CATCGCAAAT GGGCGCCGGT TGCCGCCGCG GCTGACGGCG ACAAAACCAT CATCTGCAAC
GGCAACGAAG ATGAGCCGGG GACCTTCAAG GACCGCTTCC TGCTGGAGCA CACGCCACAT
CAGGTGATCG AGGGTGCCCT CATCGCCGCG GCCGCCACGC GTGCCAACCA TATTGTCCTT
TACGTCAACC CTCACCAGCA GCAGGCCATT GCGGTCATAC GACAGGCCGT CGGGCAATGG
CAGGCGCACC CCCGGTACGC CGAACTCGAG CGACTGCTGG GCGCCCCCCT GTCGCTTGGC
GTTGTACCGA GTTCAGGGCT ATACATCGGG GGCGAGGAGA CGGCGGTGAT CGCAAGCGTC
GAGGGCGGAT TCCCGTTCCC GCGGCGCAAG CCGCCCTTTC CCAGTCAACA AGGCGTGCAT
GGCGCGCCAA GCATCGTCAA CAACGTCGAA ACGCTAGCGC ATATACCAGG AATTCTTCGC
CACGGTGCTC AGTGGTATCG CGATCTCGGC ATCGGTAACG CAACCGGAAC CAAACTCTAT
TCACTCTCTG GCGACGTATT GCGCCCCGGT CTGTATGAAC TACCAATGGG AACGAGCCTG
GAGTCCCTGG TGTTCGAGCA CGGTGGCGGC ATGTTGCAAG GCAAGGAGTT CAAGGCCGTC
TTTACAGGGG GGCCCTCGAA TACTCTGCTG ACGAAGCGTG ACCTCGATGT CGCCCTGGAC
TTTGATTCGG TGCGACTAAG ACGCTCACGT CTGGGAACGG GGGCGATGAT CGTTGTATCG
GAAGGCACCA GCATTGTCCG CAAGGTCGCT GAATTTGTGA GCTTCTTCGC GCAAGGATCG
TGCGGCCAAT GCCCACCGTG CAAAGGTGGC AGCTTCCAGT TGATGCGATT GCTGAACCGC
ATCGATACGG GCCGCGGTGT GCATGCCGAT CTGGCAGCGC TGGAGAATCT GTGCCGCATC
CTACCCGGCA GCGGCCGCTG CGGCCTCATC GACGGCGCCG TGACGGTGGT GGAGAGTTCC
CTGCACCAGT TCCGTGAGGA GTACGAGGCG CTGCTTATGG CATAG
 
Protein sequence
MEKLNQVLLR DDVRVGADLE AWLAIGGGEG LAKALCDPGA VIGEIEQADL RGMGGAGFAT 
HRKWAPVAAA ADGDKTIICN GNEDEPGTFK DRFLLEHTPH QVIEGALIAA AATRANHIVL
YVNPHQQQAI AVIRQAVGQW QAHPRYAELE RLLGAPLSLG VVPSSGLYIG GEETAVIASV
EGGFPFPRRK PPFPSQQGVH GAPSIVNNVE TLAHIPGILR HGAQWYRDLG IGNATGTKLY
SLSGDVLRPG LYELPMGTSL ESLVFEHGGG MLQGKEFKAV FTGGPSNTLL TKRDLDVALD
FDSVRLRRSR LGTGAMIVVS EGTSIVRKVA EFVSFFAQGS CGQCPPCKGG SFQLMRLLNR
IDTGRGVHAD LAALENLCRI LPGSGRCGLI DGAVTVVESS LHQFREEYEA LLMA