Gene Rsph17025_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2004 
Symbol 
ID5082368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2046467 
End bp2047675 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID640483566 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001168200 
Protein GI146278041 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.639359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCA AGTTCGACGA CGTGCTGACC GGCGAGCAGA AGCTTCGCAA CTTCAACATC 
AACTTCGGCC CGCAACACCC TGCTGCGCAC GGTGTTCTTC GCCTCGTGCT GGAACTTGAC
GGCGAGGTGG TGGAACGCTG CGATCCGCAT ATCGGCCTTC TGCACCGCGG CACCGAGAAG
CTGATGGAGA CGCGGACCTA CCTGCAGAAC CTGCCCTATT TCGACCGGCT CGATTATGTG
GCGCCGATGA ACCAGGAACA TGCCTGGTGC CTCGCCATCG AGAGGCTGAC CGGGGTGCAG
GTGCCGCGGC GGGCCAGCCT GATCCGCGTG CTCTATTCCG AGATCGGGCG CGTTCTGAAC
CACCTGCTGA ACGTGACCAC GCAGGCCATG GACGTGGGCG CGCTGACCCC GCCGCTCTGG
GGCTTCGAGG AGCGCGAGAA GCTGATGGTG TTCTACGAGC GCGCCTCGGG GGCGCGGCTC
CACGCGGCCT ATTTCCGGCC CGGGGGCGTG CACCAGGACC TGACGCCGCG CCTGATCGAG
GACATCGAGG AGTGGGCCGA GCATTTCCCG AAGGTGCTCG ACGACCTAGA CGGGCTGCTG
ACCGAGAACC GGATCTTCAA GCAGCGCAAC GTCGATATCG GCGTCGTGAC CGAGAAGGAC
ATCCTCGACT GGGGCTTCTC GGGCGTGATG GTGCGCGGGT CGGGCCTCGC CTGGGACCTG
CGCCGCTCGC AGCCCTACGA ATGCTACGAC GAGTTCGACT TCCAGATCCC GGTGGGCAAG
AACGGCGATT GCTACGACCG CTACCTCTGC CGCATGGAAG AGATGCGCCA ATCCACCCGG
ATCATCCAGC AGTGCCTTGC CAAGCTGAGG GTGGAGAAGG GGGACGTGCT GGCGCGGGGC
AAGATCACGC CGCCGCCACG GGCCGAGATG AAGACCTCGA TGGAGGCGCT CATCCACCAC
TTCAAGCTTT ACACCGAAGG CTTCCACGTC CCCGCCGGTG AGGTCTATGC CGCCGTCGAG
GCGCCCAAGG GCGAGTTCGG CGTCTATCTG GTGGCCGACG GAACCAACCG GCCCTACCGC
GCCAAGATCC GCGCGCCGGG CTTCCTGCAT CTGCAAGCCA TCGACTACAT CGCCAAGGGC
CACCTGCTGG CCGATGTGTC CGCCATCATC GGCACGCTCG ACGTCGTGTT CGGGGAGATC
GACCGCTAA
 
Protein sequence
MDTKFDDVLT GEQKLRNFNI NFGPQHPAAH GVLRLVLELD GEVVERCDPH IGLLHRGTEK 
LMETRTYLQN LPYFDRLDYV APMNQEHAWC LAIERLTGVQ VPRRASLIRV LYSEIGRVLN
HLLNVTTQAM DVGALTPPLW GFEEREKLMV FYERASGARL HAAYFRPGGV HQDLTPRLIE
DIEEWAEHFP KVLDDLDGLL TENRIFKQRN VDIGVVTEKD ILDWGFSGVM VRGSGLAWDL
RRSQPYECYD EFDFQIPVGK NGDCYDRYLC RMEEMRQSTR IIQQCLAKLR VEKGDVLARG
KITPPPRAEM KTSMEALIHH FKLYTEGFHV PAGEVYAAVE APKGEFGVYL VADGTNRPYR
AKIRAPGFLH LQAIDYIAKG HLLADVSAII GTLDVVFGEI DR