Gene Rsph17025_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2001 
Symbol 
ID5082365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2044038 
End bp2045333 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID640483563 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_001168197 
Protein GI146278038 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.675506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGG ACCAGGACCG GATCTTCACC AACCTCTACG GGATGGGCGA CCGCAGCCTG 
AAGGGCGCGA TGGCGCGCGG CCAGTGGGAC GGGACGGCCG ACCTGCTCGC GCTCGGCCGC
GACCGGATCA TCGACATCGT GAAGACCTCC GGCCTGCGCG GCCGGGGCGG CGCGGGCTTC
CCGACCGGCC TCAAATGGTC CTTCATGCCC AAGCAGTCGG ACGGCCGCCC GTCCTACCTT
GTGATCAACG CCGACGAATC CGAGCCCGCG ACCTGCAAGG ACCGCGAGAT CATGCGGCAC
GACCCCCACA CGCTGATCGA GGGCGCGCTG CTCTCGGGCT TCGCGATGGG GGCGGTCGCG
GCCTACATCT ACATCCGCGG CGAATATATC CGCGAGAAGG AGGCCCTGCA GGCCGCCATC
GACGAGGCCT ATGACGCGGG CCTGATCGGC CGGAACGCCG CGAAGTCGGG CTACGATTTC
GACATCTACC TGCATCACGG CGCGGGCGCC TACATCTGCG GCGAAGAGAC CGCGCTGCTG
GAAAGCCTCG AAGGCAAGAA GGGGATGCCG CGGATGAAGC CGCCGTTCCC GGCCGGCTCG
GGCCTTTACG GCTGCCCGAC CACGGTGAAC AACGTGGAGT CCATTGCCGT CATTCCCGCG
ATCCTGCGCC GGGGCGGCGA GTGGTTCGCG GGCTTTGGCC GGCCGAACAA CGCGGGCGTG
AAGCTCTTTG CCATGTCGGG GCATGTGAAC ACGCCCTGCG TGATCGAGGA GAGCATGTCG
ATCTCGATGA AGGAGCTGAT CGAGAAGCAT GGCGGCGGCG TGCGCGGCGG CTGGAAGAAC
CTCAAGGCGG TGATCCCCGG CGGCGCCTCC TGCCCGATCA TCCCGGCCGA GCAATGCGAA
GATGCGGTGA TGGACTATGA CGGGATGCGC GAGCTGAAGT CGAGCTTCGG CACCGCCTGC
ATGATCGTGA TGGACCAGCA GACCGACGTC ATCAAGGCGG TCTGGCGGCT GGCCAAGTTC
TTCAAGCACG AAAGCTGCGG CCAGTGTACG CCCTGCCGCG AGGGCACGGG CTGGATGATG
CGGGTCATGG ACCGCCTCGT GCGCGGCGAG GCCGAGGTTG AAGAAATCGA CATGCTGCTC
TCGGTCACGA AGCAGGTCGA GGGCCACACG ATCTGCGCGC TCGGCGATGC GGCGGCCTGG
CCGATCCAGG GTCTGATCCG GCATTACCGC GAAGAGATCG AGGACCGGAT CAAGGCGAAG
AAGACCGGGC GCATGGGCGC CATGGCGGCG GAATGA
 
Protein sequence
MLKDQDRIFT NLYGMGDRSL KGAMARGQWD GTADLLALGR DRIIDIVKTS GLRGRGGAGF 
PTGLKWSFMP KQSDGRPSYL VINADESEPA TCKDREIMRH DPHTLIEGAL LSGFAMGAVA
AYIYIRGEYI REKEALQAAI DEAYDAGLIG RNAAKSGYDF DIYLHHGAGA YICGEETALL
ESLEGKKGMP RMKPPFPAGS GLYGCPTTVN NVESIAVIPA ILRRGGEWFA GFGRPNNAGV
KLFAMSGHVN TPCVIEESMS ISMKELIEKH GGGVRGGWKN LKAVIPGGAS CPIIPAEQCE
DAVMDYDGMR ELKSSFGTAC MIVMDQQTDV IKAVWRLAKF FKHESCGQCT PCREGTGWMM
RVMDRLVRGE AEVEEIDMLL SVTKQVEGHT ICALGDAAAW PIQGLIRHYR EEIEDRIKAK
KTGRMGAMAA E