Gene Rcas_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3391 
Symbol 
ID5540890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4421168 
End bp4422523 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content59% 
IMG OID640895509 
ProductNADH-quinone oxidoreductase, F subunit 
Protein accessionYP_001433459 
Protein GI156743330 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATA TCAGGCGCAT GCCGGTGTCA GCGCTGGAAA GCGAGTTTCT GCCTATGTCT 
CCGCTTGCCG AGCATATCGT TCTCCGCGAT CTGGACATCG AAAACATCGC TGATTTCGAC
GTCTATCTTC AGCATGGCGG TTATGAGGCA TTGCGCATCG CAGTGACCGA ACGCACTCCG
GCTGATATTG TGCAGACGGT GAAGGACTCT GGGTTGCGCG GGCGTGGCGG CGCCGGGTTT
CCTACCGGCG TGAAGTGGGG GTTTCTGCCC AAAGGGGTCT ATCCGCGCTA CCTGCTCTGC
AACTGTGATG AGAGCGAACC TGGCACCTTC AACAATCATC AGATTATCGA CCGCAACCCG
CATCAGTTAA TCGAGGGCAT TGCGATTTCC GCCTACGCCA TCGAAGCCAA TCTTGCGTAT
ATCTATATTC GCGGTGAGTT CGCCGCGGCT GCGCGTCGTC TCGAACGCGC TATTGCGCAG
GCGTATGCAC GTGGTTTCCT GGGCAGGAAT ATCTTTGGCA CGGGGTACGA CCTTGACATT
TATGTGCATC GTGGCGCAGG GGCGTACATC TGCGGCGAAG AAACAGCACT CATGGAGTCG
CTCGAAGGGA AGATCGGTCA ACCCCGCTTG CGCCCTCCCT TTCCGGCGGT CGCCGGTCTG
TACGGTAAGC CAACGATTAT CAATAACGTC GAGACGCTGA CGAATGTGCC GATGATCGTG
CGTCACGGCG CCGTCTGGTA TCGTCAGTTC GGCACGGAGA AAAGTCCGGG CACGAAGGTG
TTCTCCGTCT CCGGTCACGT GAAGCGCCCC GGCAACTATG AAGCGCCGTT CGGCACACCC
TTGCGCGAAT TGATCTTTTC TCCCGAGTAC TGCCAGGGCA TGCGCGGCAA CCATAATGTC
AAGATTGTCG TGCCTGGCGG CGCCTCAGCC GGCTGGCTCA CCGCCGATGA TCTCGATGTG
ACGATGGACT ACGAGGCGCT GGCGGCGAAG GGGAGCATGC TCGGTTCCGG CGGCGTGATT
GTGCTCGATG AACGCGTTAA CGCTGTCGAG GTGGCGTATA AGATGGACGA GTTCTTCAAG
CACGAATCGT GCGGAAAGTG TACGCCGTGC CGTGAAGGGA CGTATTTTCT GGTCAAGGTG
CTGCACCGCA TCACGCATGG TCACGGTCGC CAGGATGATA TTCCGCTCCT GCACGATGTG
TACAATCAAA TGGCGGGCAA TTGCTTCTGC CTTCTGGGGG AGAGCGCCGT CGTGCCGATC
CGCAGTGCGC TGCGTCTTTT CCCGCACGAG TTCGAGCGGG CGATCGCGCA GGCAGGCAAT
GGACGCCACG ACATCATCAC GCTGTCGGTT CACTGA
 
Protein sequence
MRHIRRMPVS ALESEFLPMS PLAEHIVLRD LDIENIADFD VYLQHGGYEA LRIAVTERTP 
ADIVQTVKDS GLRGRGGAGF PTGVKWGFLP KGVYPRYLLC NCDESEPGTF NNHQIIDRNP
HQLIEGIAIS AYAIEANLAY IYIRGEFAAA ARRLERAIAQ AYARGFLGRN IFGTGYDLDI
YVHRGAGAYI CGEETALMES LEGKIGQPRL RPPFPAVAGL YGKPTIINNV ETLTNVPMIV
RHGAVWYRQF GTEKSPGTKV FSVSGHVKRP GNYEAPFGTP LRELIFSPEY CQGMRGNHNV
KIVVPGGASA GWLTADDLDV TMDYEALAAK GSMLGSGGVI VLDERVNAVE VAYKMDEFFK
HESCGKCTPC REGTYFLVKV LHRITHGHGR QDDIPLLHDV YNQMAGNCFC LLGESAVVPI
RSALRLFPHE FERAIAQAGN GRHDIITLSV H