Gene RoseRS_3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3675 
Symbol 
ID5210654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4604959 
End bp4606212 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content59% 
IMG OID640597269 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_001277980 
Protein GI148657775 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0132867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG CCGAAATCCG CGCACGCAAT CTGAGCATTC CCGCACCCAG TCAGATCACT 
CGTCCGGCGC TTGCCGGAGA GAAAGAGACC ATGGTGCTGA ATATGGGTCC CCACCACCCA
AGCACCCACG GCGTGCTGCG ACTGGTCGTT GAACTGGATG GCGAAACGGT TGTCGATGTT
GCGCCAGACA TCGGGTTCCT GCACACCGGC ATCGAAAAAA CGATGGAGAG CAAGACGTAC
CAGAAAGCGG TGGTGCTGAC CGACCGCACC GATTATCTGG CGCCGCTCTC CAATAATCTG
AGTTATGTGC TGGCAGTCGA AAAACTGCTC GGTTGCGAAG TTCCAGAGCG CGCTACCGTT
GCACGGGTGC TGCTGGTGGA ACTGCAACGG ATCGCCAGCC ATCTGGTATG GCTTGGCACC
CATGCGCTCG ATCTGGCGGC GATGAGCGTC TTCCTCTACG GTTTCCGTGA ACGCGAACAG
ATCCTCGATA TTTTCGAACT GGTCTCAGGC GCACGTATGA TGACCAGTTA CTTCCGGGTT
GGAGGGCTGG CGTATGATCT GCCTGCCGGG TTCGATGCAG CCGTCGAGGC ATTTCTGCAG
ATCATGCCGG GGCGGATCGA TGAATACGAA GCTCTGTTGA CCGACAATCC GCTGTGGATC
GAGCGCACGC AGGGGATCGG CGCGATCGAC AGCGAAGCGG CGATTGCCCT GGGATTGACC
GGACCGGGCT TGCGCGCCAC CGGAGTAGCG TGGGACCTGC GTAAAACGAT GCCGTACTGC
GGCTACGAAA CCTATTCGTT CGCCATTCCG ACCGCCACTC ACGGCGATAT TTATGACCGC
TACCTGGTGC GGATGGCGGA GATGCGCGAA AGCGTCTCTA TCTGCCGCCA GGCGTTGCAA
CGTCTGCGCG ATATAGGTCC CGGACCCTAC ATGACGTCGG ATCGCAAAAT CGCGCCGCCG
CCGAAGAGCG AAATCACGCA GAGCATGGAA GCGCTCATCC ACCATTTCAA ACTATGGACG
GAAGGATTCA AACCGCCGCG CGGCGACGCA CTGGCAGCAG TTGAATCACC GCGTGGAGAA
CTTGCAACCT ACATCGTCAG CGATGGCAGC GCCAAACCCT ACCGTGTCCA CTTCCGTGCG
CCTTCGTTCG TCAACCTGCA ATCGCTGCCC CACATGGCGC GCGGTCATCT TGTCGCCGAC
CTGGTGGCGC TGATTGCATC CCTCGACCCG GTACTCGGAG AAGTTGATCG ATAA
 
Protein sequence
MTVAEIRARN LSIPAPSQIT RPALAGEKET MVLNMGPHHP STHGVLRLVV ELDGETVVDV 
APDIGFLHTG IEKTMESKTY QKAVVLTDRT DYLAPLSNNL SYVLAVEKLL GCEVPERATV
ARVLLVELQR IASHLVWLGT HALDLAAMSV FLYGFREREQ ILDIFELVSG ARMMTSYFRV
GGLAYDLPAG FDAAVEAFLQ IMPGRIDEYE ALLTDNPLWI ERTQGIGAID SEAAIALGLT
GPGLRATGVA WDLRKTMPYC GYETYSFAIP TATHGDIYDR YLVRMAEMRE SVSICRQALQ
RLRDIGPGPY MTSDRKIAPP PKSEITQSME ALIHHFKLWT EGFKPPRGDA LAAVESPRGE
LATYIVSDGS AKPYRVHFRA PSFVNLQSLP HMARGHLVAD LVALIASLDP VLGEVDR