Gene Rsph17029_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3938 
Symbol 
ID4898240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1071145 
End bp1072167 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content78% 
IMG OID640114541 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001045788 
Protein GI126464675 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0505598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAGA GCCTGAGCCG CCGCCATTTC ACCCTTGGCC TAGCCGCCGC AGCCAGTCTC 
GCGACGGCAG GCGCCGCCTT CGCGCTTCGT CCGGGCGAGC GCGTCCTCGT CGTGGGCGGC
GGTCCCGCCG GGGCCGAGGC GGCGCTTGCG CTCAGGGCCG CTCATCCCCG GGCTTCGGTG
CTGCTCGTCG AGCGCGATCC GACGCGGCTT GCGCGCGAGC CCGATGAGGC GGGCCTTGCG
GGCTTCCTGC GCCCGCGCGC CGAGGCGGGG CTTGCGGCCC TGAAGGCCGC GGGCGTGGGT
CTCGCCCTCG ACGAGGTGGT GAGCGTCGAC TGGGCCGCGG GGCGGGCCGT CCTCTTCTCG
GGCCGCGATC TGGCCTTCGA CCGGCTGCTG CTCGCGCCCG GCACGGCGCC GCGCGACGAG
GCGATCCCGG GGCTCGATGC GGTGGCCCGT CACGCCTGGC CCGCCGCCTG GGGCAGCCCG
CGCGAGGCCC GACGTCTGCT CGCAGGTCTT CAGGCGCTGC CCGAGCGCGG CCATGTCGTC
CTGCGCCTGC CCGAGGGCGA GGCCGCCCAT CCCGCGGCGG CGCTCGGCCG GGCGCTGGCG
CTGGCGGGCC ATGTGGCGCG GCGGCCGGGC GCGCGGCTGA CGGTGCTCGA CGGCTCGAAG
GGCGCGGATC TCGCCCGCGC CTTCGCCGAC CGTGCCCCTG CCGAGGCGGC TGCCCGGGTG
GAGTGGGTCT CGGCCGCACA GGGCGGGCGG GTGCGCGCGG TGGATGCGCG GGCAGGGCTG
ATCGAGACCG AGGCGGGACC GATCCGCGCG GATGTGGTGA ATTTCGTGCC GGCGCTGCGG
GCGGGAACCA TCGCCGCGGC GGCGGGCCTG GCCGATGCGA GCGGCTGGTG CCCCTGCGAC
GCGGCGGGCC GGTCGGTCCT GCGGCCCGAG GCTCTGGTGC TGGGCGACGC GCGGAAGTCG
GCCCCGCGCA CCGTGGCCGA GGCGCTCCGG TCGGCGCGCG TCGCCACGGA TCACCTCGCC
TGA
 
Protein sequence
MMQSLSRRHF TLGLAAAASL ATAGAAFALR PGERVLVVGG GPAGAEAALA LRAAHPRASV 
LLVERDPTRL AREPDEAGLA GFLRPRAEAG LAALKAAGVG LALDEVVSVD WAAGRAVLFS
GRDLAFDRLL LAPGTAPRDE AIPGLDAVAR HAWPAAWGSP REARRLLAGL QALPERGHVV
LRLPEGEAAH PAAALGRALA LAGHVARRPG ARLTVLDGSK GADLARAFAD RAPAEAAARV
EWVSAAQGGR VRAVDARAGL IETEAGPIRA DVVNFVPALR AGTIAAAAGL ADASGWCPCD
AAGRSVLRPE ALVLGDARKS APRTVAEALR SARVATDHLA