Gene RoseRS_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2998 
Symbol 
ID5209966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3762199 
End bp3763884 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content63% 
IMG OID640596590 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_001277312 
Protein GI148657107 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00848252 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.769914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATACC TGGCATACGC ATCGACGCCA TGGCTGACGC TGCTGATCCT GTCACCCCTG 
GTCGGGCTGG CGCTGACGGC GCTGGCAGGC GCGCTGCGCC TCGATGATCG CACAGCCATG
ATCGGCGCAA CGGCGTGGTC TACCGTCCCC CTTGCTCTGG CAATTATCGT CTGGGTCGGG
TTCAACCCGA ACGCCACCGC CGATGGGCAG GGGGTCGTGC AATTCGTCGA GAAGATTCCG
TGGGTGCAGG CGATCCGGGT CGATTATTTC GTTGGGGTGG ACGGCATCAG TATGCCGCTG
GTGATCCTGA CGGCGGCGAT GACGCCGGTG GCGATGCTGG CGTCGTTCAG CGTCACACAG
CGGGTGAAAC TGTACCTGGC GCTGATGTTT CTGCTGGAAA CGGCAATGCT GGGGTACTTC
CTGGCGCTCA ATTTCTTCTT CTTCTTCATC TTCTGGGAAT TCAGCCTGGT TCCGGCATAT
TTTCTCATTC AAGGATGGGG ACGCCGCCAC ACCACTGATG CCGACCCGGA ACAGCGCCGC
CGCGATGCCG CACTGAAGTT CTTCGTCTAT ACCATGGCCG GTTCGATCGG CATGCTCCTC
CTGTTTCAGT TCTTCTATGT CGCCACCGCC GCCGCCGGTA TTCCAACGTT CGATCTGATC
ACGCTGGCGC GATTGGGGCA GGGGTTGACG GTCGAACGCG CGGCGCTCGA TCCGGTGAAC
CTGACGTTGC AAGAGATTAT CTTCAATTAT GTCGAGCAAC TCGGAATTGC CGATGTGCTT
GGGCGTTACC CGTTGCTCTA CACATCGATT GCGTTCTGGG CGATTTTTAT CGCCTTTGCG
ATCAAACTCG GCATCTGGCC CTTCCACACC TGGCTGCCGG ACGCCTATAG CGAAGCGCCG
CCAGCGGCGT CCATCCTGTT GGCGGCGGTG ATGTCGAAAA TGGGCGCCTA CGGCATGCTG
CGCCTGATGC TTCCCCTGGT TCCCGACGCG GCGCAGTATT TTGGTCCGGC AATCGGCGCG
CTGGCGCTGA TCGGCGTTGT GGCTGGCGCC TTCGGCGCAC TCGGTCAGGT CGGCGGCGAC
CTCAAGCGCC TGATCGCGTA CACCTCGGTC AACCACATGG GGTACGTCGG TCTGGCAATT
GCCGCAGCTG CGACCGTCGG CGCGGCGGAT GTCGCCACCC GTGCAACGGC GATCAATGGC
GCGTTGTTCC AGATGGTTGC GCACGGTCTC TCAACCGGCG CGCTCTTCCT GATGGCGGGC
ATGCTTGCCG AACGCACCGG CTCCGACGAC ATGCGCTCGC TCGCCGGGTT GCGCACAACG
ATGCCGGTCT TTGCCGGTGC AATGGGCGTG GCGACCTTCG CCAACCTGGG GTTGCCCGGT
CTTGCCGGGT TCGTCGGCGA GTTCTTCATC TTCCGCGGCG TCTGGGCATC GCTGCCGCTC
TTCGCGCTGC TGGCGACCAT CGGGCTGGTT GTGACCGCGC TGGCGCTGCT GCGCATGTAT
GGGCAGATGT TCCATGGGCA GACGAACGAA CGCAGCGCTA TGCCCGACAT GCGTCTCGCC
GGACGCGAGT TCCTGGCAGT TGCACCGCTG CTGATCGCGC TGCTGATCCT TGGCATCTAC
CCGGCGCCGA TCATGGACCT GTCGAACCAG ACGGCAACCG CGCTGGCGAG AGTATTCCTG
CCGTGA
 
Protein sequence
MTYLAYASTP WLTLLILSPL VGLALTALAG ALRLDDRTAM IGATAWSTVP LALAIIVWVG 
FNPNATADGQ GVVQFVEKIP WVQAIRVDYF VGVDGISMPL VILTAAMTPV AMLASFSVTQ
RVKLYLALMF LLETAMLGYF LALNFFFFFI FWEFSLVPAY FLIQGWGRRH TTDADPEQRR
RDAALKFFVY TMAGSIGMLL LFQFFYVATA AAGIPTFDLI TLARLGQGLT VERAALDPVN
LTLQEIIFNY VEQLGIADVL GRYPLLYTSI AFWAIFIAFA IKLGIWPFHT WLPDAYSEAP
PAASILLAAV MSKMGAYGML RLMLPLVPDA AQYFGPAIGA LALIGVVAGA FGALGQVGGD
LKRLIAYTSV NHMGYVGLAI AAAATVGAAD VATRATAING ALFQMVAHGL STGALFLMAG
MLAERTGSDD MRSLAGLRTT MPVFAGAMGV ATFANLGLPG LAGFVGEFFI FRGVWASLPL
FALLATIGLV VTALALLRMY GQMFHGQTNE RSAMPDMRLA GREFLAVAPL LIALLILGIY
PAPIMDLSNQ TATALARVFL P