Gene Rsph17025_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1998 
Symbol 
ID5082362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2040466 
End bp2041503 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content64% 
IMG OID640483560 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001168194 
Protein GI146278035 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.222188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGT TCATGAATTC GGGGATGGGC ATCATCCTCA CGATCGCGGC GCAGGGGCTT 
CTGGTCATAG CCTTCGTGAT GATCTCGCTT CTGTTCCTCG TCTATGGCGA CCGCAAGATC
TGGGCGGCGG TGCAGATGCG GCGCGGCCCG AACGTGGTGG GCGCCTTCGG CCTGCTGCAG
ACGGTGGCCG ATGCCGCGAA ATACGTCTTC AAGGAGGTCG TGGTTCCCGC GGGCGTGGAC
CGGCCGGTGT TCTTCCTTGC GCCGCTGCTC TCCTTCGTGC TGGCGGTGCT GGCCTGGGCC
GTGATCCCCT TCAGCCCCGG CTGGGTGCTG TCGGACATCA ACGTGGCGAT CCTGTTCGTC
TTCGCCGTCT CCTCGCTCGA GGTCTATGGC GTGATCATGG GCGGCTGGGC CTCGAACTCG
AAATATCCGT TCCTCGGCTC GCTGCGTTCG GCGGCGCAGA TGATCTCGTA CGAGGTGTCG
CTGGGGCTGA TCATCATCGG GATCATCATC TCGACGGGTT CGATGAACCT GACCCACATC
GTCGAGGCGC AGGCGGGCCC GTTCGGGATC TTCAACTGGT ACTGGCTGCC GCACCTGCCG
ATGGTGGCGC TGTTCTTCAT CTCGGCGCTG GCCGAGACCA ACCGCCCGCC CTTCGACCTG
CCGGAAGCGG AATCCGAACT CGTGGCCGGC TTCCAGGTCG AATACAGCTC GACCCCGTTC
CTGCTGTTCA TGGCGGGCGA ATATATCGCC ATCTTCCTGA TGTGCGCGCT GATGAGCCTT
CTGTTCTTCG GCGGCTGGCT CTCGCCCATT CCGGGGCTGC CCGACGGCGC GCTCTGGATG
GTGCTGAAGA TGGGCTTCTT CTTCTTCCTG TTCGCGATGG TGAAGGCCAT CGTGCCGCGC
TACCGCTACG ACCAGCTCAT GCGGATCGGC TGGAAGGTGT TCCTGCCCCT CAGCCTCGCC
TGGGTGGTTC TCGTGGCGTT CCTTGCGAAA TTCGAAGTGT TCGGCGGCTT CTGGGCCCGC
TGGGCGATGG GGGGCTGA
 
Protein sequence
MDEFMNSGMG IILTIAAQGL LVIAFVMISL LFLVYGDRKI WAAVQMRRGP NVVGAFGLLQ 
TVADAAKYVF KEVVVPAGVD RPVFFLAPLL SFVLAVLAWA VIPFSPGWVL SDINVAILFV
FAVSSLEVYG VIMGGWASNS KYPFLGSLRS AAQMISYEVS LGLIIIGIII STGSMNLTHI
VEAQAGPFGI FNWYWLPHLP MVALFFISAL AETNRPPFDL PEAESELVAG FQVEYSSTPF
LLFMAGEYIA IFLMCALMSL LFFGGWLSPI PGLPDGALWM VLKMGFFFFL FAMVKAIVPR
YRYDQLMRIG WKVFLPLSLA WVVLVAFLAK FEVFGGFWAR WAMGG