Gene Rsph17029_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1742 
Symbol 
ID4896647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1839669 
End bp1840625 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content70% 
IMG OID640112336 
Productrespiratory-chain NADH dehydrogenase, subunit 1 
Protein accessionYP_001043624 
Protein GI126462510 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0161254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCC TTCTGATCGC CCTCTTTTCG CTGATCCTGC TGCTGGCGCT TCTCGGAGCG 
GCGGGCGTCT TCACCTGGGG CGAGCGGCGG CTTCTGGGCT TCCTGCAGGA GCGGCTCGGG
CCGAACCGCG TGGGGCCCTT CGGCTTCCTG CAATGGGTGG CCGACACGCT GAAGCTCCTC
ACCAAGGAGG ATGCGCCACC GGCCGGGGCC GATCTCGCGG CCTACCGGCT CGCGCCCGCG
CTTGCGGCCT TCCCGATGCT CGCGGGCTTC GGCGTGGTGG CCTTCGCCCC GCGCCTCGTG
ATCTCGGACC TCGACGTGGG CGTGCTCTTC GTCATGGGGA TGCTCGCGCT GACCGTCTGG
GCGCTGGTGC TGGGCGCCTG GGGCTCGCGC AACCGCTACG CCATGCTGGG CGGCCTGCGG
GCGGCCGCGC AGATGCTGGC CTACGAGAGC TTCCTCGGCC TCTCGCTCAT GGGCTGCGTG
CTGCTCGCGG GCAGCTTCCG CATGGGCGAC ATCGTGGCGG CGCAGGAGGG CGGGCTCTGG
TTCATCCTGC TTCAGCCGCT GGGGGCCGCG CTCTTCTTCC TCGCGGGCCT CGCCGCCGCC
CACCGCCTGC CCTTCGACCT GCAGGAGTCC GAGCAGGACC TCGTGGCGGG CTTCATGACG
GAATATTCGG GGATGAGTTT CGCGCTCTTC TTCCTCGGCG AATATCTGGC GATCCTCCTC
GTGGCCGCGC TCTTCACCAC GCTCTTCCTC GGCGGCTGGG CCGGGCCGAT CCTGCCGGGG
CCTGTCTGGT TCGGGCTGAA GGTGGCCGCG ATTTCGGTCG TCTTCGTCTG GCTGCGCGCG
GCCCTGCCGC GCCCGCGCTA CGACCAGCTC ATCTCCTTCG CCTGGAAGGT GGCGCTGCCG
CTGGCGCTCC TGAACCTCTT GGTCACCGCC TGGATCGCGG TGGGGAGAGC GGCATGA
 
Protein sequence
MSVLLIALFS LILLLALLGA AGVFTWGERR LLGFLQERLG PNRVGPFGFL QWVADTLKLL 
TKEDAPPAGA DLAAYRLAPA LAAFPMLAGF GVVAFAPRLV ISDLDVGVLF VMGMLALTVW
ALVLGAWGSR NRYAMLGGLR AAAQMLAYES FLGLSLMGCV LLAGSFRMGD IVAAQEGGLW
FILLQPLGAA LFFLAGLAAA HRLPFDLQES EQDLVAGFMT EYSGMSFALF FLGEYLAILL
VAALFTTLFL GGWAGPILPG PVWFGLKVAA ISVVFVWLRA ALPRPRYDQL ISFAWKVALP
LALLNLLVTA WIAVGRAA