Gene Rcas_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2097 
Symbol 
ID5539577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2691768 
End bp2693468 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content62% 
IMG OID640894232 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_001432201 
Protein GI156742072 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.564619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTACC TGGCATATGC CTCAACACCC TGGCTCACAT TGCTGATCCT GTCGCCGCTG 
ATCGGGCTGG CGCTGACCGG GCTGGCAGGT GCGCTGCGGC TCGACGACCG GACGGTGAAG
ATCGGCGCTA CGGCGTGGTC GACCATTCCC CTGGCGCTGG CGATTGTCGT CTGGGTGGGG
TTCGACCCGA ATGCGACTGC CGATGGACAG GGGGTTGTGC AGTTCGTCGA GAAGATTCCG
TGGGTGCAGG CGATCCGGGT TGATTATTTC GTCGGAGTGG ACGGAATCAG TATGCCGCTG
GTCATTCTCA CGGCGGCGAT GGCGCCGGTG GCAATGCTGG CATCGTTTGG CATCACTGGG
CGGGTGAAAC TGTACATTGC GCTCATGTTT CTGCTGGAAG CGGCGATGCT GGGATATTTC
CTGGCGCTCA ATTTCTTCTT TTTCTTTATC TTCTGGGAGT TCAGCCTCGT GCCAGCGTAT
TTCCTCATCC AGGGATGGGG TCGTCGTCAT GGCGCCGATG CCGATCCTGA GCAACGCCGC
CGTGATGCTG CGCTCAAGTT CTTCGTCTAC ACGATGGCCG GGTCGATCGG TATGCTGCTG
CTCTTCCAGT TTTTCTATGT TGCCACGGCT GCGGCCGGCA TCCCGACGTT CGACCTGATC
ACACTGGCGC GTCTGGGACA GGGGCTGACC GTCGAACGCG CGGCGCTCGA TCCGGTGAAT
TTGACGTTGC GCGAGATCAT CTTCAACTAT GTCGAGCAAC TGGGGATTGC CGCTGTACTT
GGGCGCTACC CGCTGCTCTA TACATCGATT GCGTTCTGGG CGATCTTCAT CGCATTTGCG
ATTAAGCTTG GCATCTGGCC CTTCCACACC TGGCTGCCCG ACGCCTATAG CGAAGCGCCG
ACGGCAGCAT CCATCCTGCT GGCGGCGGTG ATGTCGAAAA TGGGCGCTTA CGGCATGCTG
CGCCTGATGC TGCCCTTCGT TCCCGACGCA GCGCAGTACT TTGGTCCGGC AATCGGTGCG
CTGGCGCTGA TCGGTGTGGT GGCAGGCGCC TTCGGCGCGC TCGGTCAGGT CGGTGGCGAT
CTGAAACGCC TGATCGCCTA TACCTCGATC AACCACATGG GGTATGTCGG TCTGGCGATT
GCAGCGGCGG CGACCGTCGG GTCGGCGGAT AGCGCCACCC GAGCGACGGC GATCAATGGC
GCGCTGTTTC AGATGGTGGC GCACGGTCTT TCGACCGGTG CGCTCTTCTT GCTGGCGGGC
ATGCTCGCTG AACGCTGCGG TTCCGACGAG ATGGGGACGC TTGCCGGTTT GCGCACCACG
ATGCCGGTTT TCGCCGGAGC GATGGGGGTG GCGACGTTCG CCAATCTGGG GTTGCCCGGT
CTTGCCGGGT TCGTCGGCGA ATTCTTCATC TTCCGTGGTG TATGGGCGTC GTTGCCCCTT
TTTGCGCTCC TGGCGACGAT CGGGCTGGTC GTGACAGCGC TGGCGCTGTT GCGCATGTAT
GGGCAGCTAT TCCACGGCAA AACCAATGAG CGTAGCGCCA TGCCTGATAT GCGCCTCGCC
GGGCGCGAGT TTCTGGCGGT TGCGCCGCTG CTGATTGCAC TCCTGATTCT GGGCGTGTAC
CCAGCGCCCA TTATGGACCT GTCGAACCGG ACGGCGACCG CACTGGTGGA AGTGTTCACC
CGGGTTGTGG GAGGAGCATA G
 
Protein sequence
MTYLAYASTP WLTLLILSPL IGLALTGLAG ALRLDDRTVK IGATAWSTIP LALAIVVWVG 
FDPNATADGQ GVVQFVEKIP WVQAIRVDYF VGVDGISMPL VILTAAMAPV AMLASFGITG
RVKLYIALMF LLEAAMLGYF LALNFFFFFI FWEFSLVPAY FLIQGWGRRH GADADPEQRR
RDAALKFFVY TMAGSIGMLL LFQFFYVATA AAGIPTFDLI TLARLGQGLT VERAALDPVN
LTLREIIFNY VEQLGIAAVL GRYPLLYTSI AFWAIFIAFA IKLGIWPFHT WLPDAYSEAP
TAASILLAAV MSKMGAYGML RLMLPFVPDA AQYFGPAIGA LALIGVVAGA FGALGQVGGD
LKRLIAYTSI NHMGYVGLAI AAAATVGSAD SATRATAING ALFQMVAHGL STGALFLLAG
MLAERCGSDE MGTLAGLRTT MPVFAGAMGV ATFANLGLPG LAGFVGEFFI FRGVWASLPL
FALLATIGLV VTALALLRMY GQLFHGKTNE RSAMPDMRLA GREFLAVAPL LIALLILGVY
PAPIMDLSNR TATALVEVFT RVVGGA