Gene Rsph17029_1184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1184 
Symbol 
ID4895770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1228872 
End bp1229897 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content64% 
IMG OID640111770 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001043066 
Protein GI126461952 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.503975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.452124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCG GGATGGGTAT CATCCTCACG ATCGCGGCGC AGGGCCTTCT GGTCATCGCT 
TTCGTGATGA TCTCGCTGCT GTTCCTCGTC TATGGCGACC GGAAGATCTG GGCGGCGGTG
CAGCTGCGCC GCGGGCCGAA CGTCGTGGGC GCCTTCGGCC TGCTGCAGAC GGTGGCGGAT
GCGGCCAAAT ACATCTTCAA GGAAGTGGTG GTGCCCGCGG GCGTGGACCG CCCGGTCTTC
TTCCTCGCGC CGCTCATCTC CTTCGTGCTG GCCGTGCTCG CCTGGGCCGT GATCCCCTTC
AGCCCGGGCT GGGTGCTGTC GGACATCAAC GTGGCGATCC TCTTCGTCTT CGCCGCCTCC
TCGCTCGAGG TCTATGGCGT CATCATGGGC GGCTGGGCCT CGAACTCGAA ATATCCGTTC
CTGGGCAGCC TCCGCTCGGC CGCGCAGATG ATCTCCTACG AGGTCTCGCT CGGCCTCATC
ATCATCGGGA TCATCATCTC GACCGGCTCG ATGAACCTGA GCCATATCGT CGAGGCGCAG
GACGGCGCCT TCGGGCTCTT CAACTGGTAC TGGCTGCCGC ACCTGCCGAT GGTGGCGCTG
TTCTTCATCT CGGCGCTGGC CGAAACGAAC CGCCCGCCCT TCGACCTGCC GGAGGCGGAA
TCCGAACTGG TCGCGGGCTT CCAGGTGGAA TACAGCTCGA CGCCGTTCCT GCTGTTCATG
GCCGGCGAAT ATATCGCCAT CTTCCTCATG TGCGCGTTGA TGAGCCTGCT GTTCTTCGGC
GGCTGGCTCT CGCCCATCCC CGGACTGCCC GACGGCGTGT TCTGGATGGT GGCGAAGATG
GCCTTCTTCT TCTTCCTCTT CGCCATGGTG AAAGCCATCG TGCCGCGCTA CCGCTACGAC
CAGCTCATGC GGATCGGCTG GAAGGTCTTC CTTCCCTTCA GCCTCGGCTG GGTGGTTCTG
GTGGCGTTCC TTGCGAAATT CGAAGTGTTC GGCGGCTTCT GGGCCCGCTG GGCGATGGGA
GGCTGA
 
Protein sequence
MNSGMGIILT IAAQGLLVIA FVMISLLFLV YGDRKIWAAV QLRRGPNVVG AFGLLQTVAD 
AAKYIFKEVV VPAGVDRPVF FLAPLISFVL AVLAWAVIPF SPGWVLSDIN VAILFVFAAS
SLEVYGVIMG GWASNSKYPF LGSLRSAAQM ISYEVSLGLI IIGIIISTGS MNLSHIVEAQ
DGAFGLFNWY WLPHLPMVAL FFISALAETN RPPFDLPEAE SELVAGFQVE YSSTPFLLFM
AGEYIAIFLM CALMSLLFFG GWLSPIPGLP DGVFWMVAKM AFFFFLFAMV KAIVPRYRYD
QLMRIGWKVF LPFSLGWVVL VAFLAKFEVF GGFWARWAMG G