Gene RSP_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2234 
Symbol 
ID3719763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp850877 
End bp852097 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID640070406 
ProductDNA-binding protein 
Protein accessionYP_352290 
Protein GI77462786 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.62693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG ACCTCGCCCG CAAGCTGGCT ATCCTGTCGG ACGCCGCGAA ATACGATGCC 
TCCTGCGCCT CGAGCGGGGG CACACGGCGC GATTCGAAGG ATGGCAAGGG GCTGGGATCC
TCGGGCGGAA GCGGCATCTG CCACGCCTAT GCGCCGGACG GGCGCTGCAT CAGCCTTCTG
AAGATCCTGA TGACCAATTT CTGCATCTTC GACTGCGCCT ATTGCGTGAA CCGCGTCTCC
TCGCGGGTCG AGCGGGCGCG GTTCTCGGTC GAAGAGGTGG TGACGCTCAC CGTCGAATTC
TACCGGCGGA ACTATATCGA GGGGCTCTTC CTCTCGTCGG GCATCATCCG CTCGCCCGAT
GACACGATGG CCGACATGGT GCGCATCGCC AAGACCCTGC GCGAGCGCGA GCATTTCCGG
GGCTATATCC ACCTCAAGAC CATTCCCGAC GCCGCGCCCG AGCTGATCGA GCAGGCGGGC
CTCTATGCCG ACCGGCTGTC GATCAATGTG GAGCTGCCGA CCGAAGCCGG GCTCGACCGC
TTCGCGCCGG AGAAGTCGGC CACCGGCATC CGCAAGGCGA TGGCCGAGGT GCGGCTGAAG
CGCGAGGCCT CGCGCGAGCC GAGCTTCTCC GGCCGCAGAC CCTCGCGCTT CGCGCCCGCG
GGCCAGTCCA CGCAGATGAT CGTGGGAGCG GACGGGGCGG ACGATGCAGC CATCCTCGGC
AATGCCTCGA CGCTCTATGC CAACTACGGT CTGAGCCGGG TCTATTACTC GGCCTTCTCG
CCCATTCCCG ATGCCTCGAA GGCGCTGCCC CTCGTGCGTC CGCCGCTCCT GCGCGAGCAT
CGGCTCTATC AGGCGGACTG GCTCCTGCGG TTCTACGGCT TCGAGGTGGG CGAGATCGCG
GACAAGGGGA TGCTCGATCT CGAGGTCGAT CCGAAGCTCG CCTGGGCGCT GGCGCATCGC
GAGGCCTTTC CGATGGATGT GAACCGCGCC CCGCGCGAGA TGCTGCTGCG CGTGCCGGGC
TTCGGCACCA AGACGGTGGG CCGCATCCTT GCCGCGCGGG CGCACGGGGC GGTGCGCTAC
GAGCATCTGG TGGCGATGGG CGCGGTGGTG AAACAGGCGC GCCCCTTCAT CGTGGCCCCC
GGCTGGCGGC CGCAGGGGCT GGACGACGCC AGCCTGCGCG CGCGCTTCGT GCCGCCGCCG
GAACAGTTGA GCCTCTTCTG A
 
Protein sequence
MKKDLARKLA ILSDAAKYDA SCASSGGTRR DSKDGKGLGS SGGSGICHAY APDGRCISLL 
KILMTNFCIF DCAYCVNRVS SRVERARFSV EEVVTLTVEF YRRNYIEGLF LSSGIIRSPD
DTMADMVRIA KTLREREHFR GYIHLKTIPD AAPELIEQAG LYADRLSINV ELPTEAGLDR
FAPEKSATGI RKAMAEVRLK REASREPSFS GRRPSRFAPA GQSTQMIVGA DGADDAAILG
NASTLYANYG LSRVYYSAFS PIPDASKALP LVRPPLLREH RLYQADWLLR FYGFEVGEIA
DKGMLDLEVD PKLAWALAHR EAFPMDVNRA PREMLLRVPG FGTKTVGRIL AARAHGAVRY
EHLVAMGAVV KQARPFIVAP GWRPQGLDDA SLRARFVPPP EQLSLF