Gene RSP_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3921 
Symbol 
ID4796497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_009007 
Strand
Start bp106191 
End bp107492 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content74% 
IMG OID640103033 
Producthypothetical protein 
Protein accessionYP_001033882 
Protein GI125654688 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACG CCTCGCCGAT CCTGTCGCTT CCCTACATCC TGCCTTCCCA GGCGCAGAAA 
CATGTGACCC ACAACGAGGC GCTGCAGCGG CTCGATGTGC TGGTCCAGCC CGCCGTGCTC
GACCGCGACC GCTCCGCGCC GCCCGCCGCC CCGGCCGCGG GGGCGCGGCA TCTGGTGGGC
CCGGGCGCCG AAGGGGCCTG GGCAGGGCGG GAGGAGGCCT TTGCGGTCTG GGACGCGGAG
GCGGCGGTCT GGCGTTTCCT CGCCCCGCAG CCGGGCTGGC AGACCTTCGT GCTGGCCGAG
GGAGCGGGGC TCGTCTTCAC CGCCCAGGGC TGGCGCACGC TGATCGGCCT TCTGCCGGAA
TTTCCCTCGC TGGGCATCGC CACCTCGGCC GATGCCACCA ACCGCCTCGC GGTGGCGGGC
CCCGCCACGC TCTTCACCCA TGCGGGTGCG AGCCACCGGA TCAAGGTCAA CAAGGCCGCG
GAGGCCGAGA CGGCGAGCCT CCTGTTCCAG TCCGACTGGT CGGGCCGGGC CGAAATCGGG
CTTGCGGGCA GCGACGACTT CGCGCTGAAG GTCAGCCCGG ACGGCACTTC CTTCCGCACC
GCGCTCAGCG CCGACCGGGC GAGCGGGCGG GTGGCGCTGC CGCAGGGGGC GGTGGTGACG
GGCAGCCTCA CCGGAAGCGC GGTGCAGGCC TCGGCCGCCG ATGCGACCCC GGGCCGGCTC
TTGACGGTGG GGGCCTTCGG GCTGGGGGCG CCGGCGCCGC TCGTCGGCAA TGCCGGGGCG
GTGGACGGCG CGCTCGCGCC GGGCTTTTAC GGCTACGACA GCGCGCAGGG CAGCAGCGGC
GGCCCTGCGG GCGTGCAGGC GGGCCTTCTC CTTCACCAGA GCCGCGGGGC GGGCGAGGTG
CAGCTCTTTC TCGTGGAGGC GGGGGGCGGG GGCCTCATGC CGGGCATCCT CTTCTCGCGC
GCCCGCGGCG AGGGCGCCTG GTCGCCCTGG GTCGCGGGCG GGATCGTCGA GAGCGCGGGC
AACGCCAACG GCCGCTACAT CCGCCATCAG GACGGGACGC AGAGCTGCTG GCAGAAGGTG
ACCACCTCGG CCTCCGCCGA TGTGGTGGCC CCCTTTCCCG CCGCCTTCTC CACCGCCACG
GGCCTCGTCA CGGTCTCGAG CGTGGTCTCG AACGGAGCCC AGGCGCTCAG CCCGCGGCTG
ACCGGGCGGA CGACGACCAG CGTCGGCGTC TCGGTCTTCA GCGCCACGAA CACGCGCCTT
GCCGCGCAGG TCGAGCTGAT CTCGATGGGC CGCTGGTATT GA
 
Protein sequence
MSDASPILSL PYILPSQAQK HVTHNEALQR LDVLVQPAVL DRDRSAPPAA PAAGARHLVG 
PGAEGAWAGR EEAFAVWDAE AAVWRFLAPQ PGWQTFVLAE GAGLVFTAQG WRTLIGLLPE
FPSLGIATSA DATNRLAVAG PATLFTHAGA SHRIKVNKAA EAETASLLFQ SDWSGRAEIG
LAGSDDFALK VSPDGTSFRT ALSADRASGR VALPQGAVVT GSLTGSAVQA SAADATPGRL
LTVGAFGLGA PAPLVGNAGA VDGALAPGFY GYDSAQGSSG GPAGVQAGLL LHQSRGAGEV
QLFLVEAGGG GLMPGILFSR ARGEGAWSPW VAGGIVESAG NANGRYIRHQ DGTQSCWQKV
TTSASADVVA PFPAAFSTAT GLVTVSSVVS NGAQALSPRL TGRTTTSVGV SVFSATNTRL
AAQVELISMG RWY