Gene Rcas_3382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3382 
Symbol 
ID5540881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4410002 
End bp4411498 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content61% 
IMG OID640895500 
Productproton-translocating NADH-quinone oxidoreductase, chain N 
Protein accessionYP_001433450 
Protein GI156743321 
COG category[C] Energy production and conversion 
COG ID[COG1007] NADH:ubiquinone oxidoreductase subunit 2 (chain N) 
TIGRFAM ID[TIGR01770] proton-translocating NADH-quinone oxidoreductase, chain N 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAATG ACACACAGAT GACCGAAATC GTGATTCCTC CTGTTGACTG GCGCGTGGCG 
ACGCAACTCA GCATCATCTT TGGCTGGGCG TCGGTGTTGC TGGTCATTGC CTTGTTTGTG
CCACGTTCAC GTACCCGGAT CGTCGGGTAT CTGGCGATGG TGGGCGCGAT GGTTGCTGCG
GCAGTTGGTA TTCCGCTGTG GGGCGTCAAC GCCGAGACCT TCAGCGGCAT GCTGCGGCTA
GACTCCTATA GCCTGACGCT GAACTGGTTG TTTCTGGCAG CGGCTGCCAT AACCATGGTG
CTGTCGCTCG ACTATCTGCC ACGCCAGGGA ATCGAGCGAA GCGAGTATTA CGTGCTGGTG
CTGTTCGCCA CCGGTGGCAT GATGTTGCTG GCGCAGGGCG CGGACCTGAT CATTCTCTTT
CTGGGTCTGG AACTGCTGTC AATTGTGCTG TATGTGCTGA CCGGCTTCGC CTATCCGCGC
AACGCTTCGG AAGAAGCAGG GATGAAATAT CTGCTGATCG GCGCATTCGC GGGCGGGTTC
GTCGTGTTTG GCATTGCTCT GCTGTATGGA GCGACCGGCA GCATGAATCT GCGCGCCATT
GGTGAGACGC TGGCACAACA GACGTTGACT CTCGAAGAAC GCATCTATCT GCTGGCGGGT
GCGGCACTGG TCGTCGTTGG TTTTGGCTAC AAGGTTGCAA TGGCGCCATT CCATATGTGG
GCGCCGGATG TCTATGAAGG CGCACCGACG CCGATAGCCG GGCTGCTCTC GGTCGGTAGC
AAGGCAGCAG GGTTTGCGGC GCTCCTGCGG TTCCTGGTCG AGGCGCTGGC GGGCGAGTGG
CAGATCTGGG CGCCGGTGCT GGCGGTGCTG GCGATTGCAA CACTGGCGGT CGGGAATATC
GGTGCGCTGA CGCAACGCAA CGTCAAGCGC ATGCTGGCGT ATTCGAGCAT CGGTCACGCC
GGGTACATCC TGTTCGGCGT GATTGCCGCC GGCGCGCCGG GTGGCATTGC CGGGCAACGC
GGCGTTGAAG GCGTTCTTCT GTACCTGATT GCATACACCT TTACTAATCT CGGAGCGTTT
GGCGTATTGA TCGCTCTCGA ACATCGCGGC GAAGCAGCCT GGGATATGAG CGATCTGGCA
GGGTTGTGGA GTCGGCGTCC CTGGCTGGCG GTCGCCATGG CAGTTTGTAT GCTCTCGCTG
GCTGGCGTCC CGCCGACCGG TGGTTTCTGG GGGAAGTTCT ATGTGTTCAC TGCCGCCTGG
CTGTCGGGCA TGGGCTGGAT CACAGTCATT GGGGTCATTG TTGCCGCAAT TGCGGCATTC
TACTATCTCC GCATTGTGGC GCAGATGTTC ATGGCGGAAC CGGCGCGCGA GGTGCCCTTG
CCTATGGACC GCGCCCTGCG GGCAGGTCTT GCGCTCGCAA CGCTCGGTGT GCTGATCCTG
GGCTTTCTCC CAACGCCTGC GATTGACCTG GTGCAGCGGG TGGTGTTAGG GGGTTAG
 
Protein sequence
MWNDTQMTEI VIPPVDWRVA TQLSIIFGWA SVLLVIALFV PRSRTRIVGY LAMVGAMVAA 
AVGIPLWGVN AETFSGMLRL DSYSLTLNWL FLAAAAITMV LSLDYLPRQG IERSEYYVLV
LFATGGMMLL AQGADLIILF LGLELLSIVL YVLTGFAYPR NASEEAGMKY LLIGAFAGGF
VVFGIALLYG ATGSMNLRAI GETLAQQTLT LEERIYLLAG AALVVVGFGY KVAMAPFHMW
APDVYEGAPT PIAGLLSVGS KAAGFAALLR FLVEALAGEW QIWAPVLAVL AIATLAVGNI
GALTQRNVKR MLAYSSIGHA GYILFGVIAA GAPGGIAGQR GVEGVLLYLI AYTFTNLGAF
GVLIALEHRG EAAWDMSDLA GLWSRRPWLA VAMAVCMLSL AGVPPTGGFW GKFYVFTAAW
LSGMGWITVI GVIVAAIAAF YYLRIVAQMF MAEPAREVPL PMDRALRAGL ALATLGVLIL
GFLPTPAIDL VQRVVLGG