Gene Rsph17029_0909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0909 
Symbol 
ID4895547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp934584 
End bp935804 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID640111494 
Productradical SAM domain-containing protein 
Protein accessionYP_001042792 
Protein GI126461678 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.338541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.891076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG ATCTCGCCCG CAAGCTGGCC ATCCTGTCGG ACGCCGCGAA ATACGATGCC 
TCCTGCGCCT CGAGCGGGGG CACAAGGCGC GATTCGAAGG ATGGCAAGGG GCTGGGATCC
TCGGGCGGAA GCGGCATCTG CCACGCCTAT GCGCCGGACG GGCGTTGCAT CAGCCTTCTG
AAGATCCTGA TGACCAATTT CTGCATCTTC GACTGCGCCT ATTGCGTGAA CCGCGTCTCC
TCGCGGGTCG AGCGGGCGCG GTTCTCGGTC GAAGAGGTGG TGACGCTCAC CGTCGAATTC
TACCGGCGGA ACTATATCGA GGGGCTCTTC CTCTCGTCGG GCATCATCCG CTCGCCCGAT
GACACGATGG CTGACATGGT GCGCATCGCC AAGACCCTGC GCGAGCGCGA GCATTTCCGG
GGCTACATCC ACCTCAAGAC CATTCCCGAC GCCGCGCCCG AGCTGATCGA GCAGGCGGGC
CTCTATGCCG ACCGGCTGTC GATCAATGTG GAGCTGCCGA CCGAAGCCGG GCTCGACCGC
TTCGCGCCGG AGAAGTCGGC GACCGGCATC CGCAAGGCGA TGGCCGAGGT GCGGCTGAAG
CGCGAGGCCT CGCGCGAGCC GAGCTTCTCC GGCCGCAGAC CCTCGCGCTT CGCGCCGGCC
GGTCAGTCCA CCCAGATGAT CGTGGGGGCT GACGGGGCGG ACGATGCGGC CATCCTCGGC
AATGCCTCGA CGCTTTATGC CAACTACGGT CTGAGCCGGG TCTATTACTC GGCCTTCTCA
CCCATTCCCG ATGCCTCGAA GGCGCTGCCC CTCGTGCGTC CGCCGCTCCT GCGCGAGCAT
CGGCTCTATC AGGCCGACTG GCTTCTGCGC TTCTACGGCT TCGAGGTGGG CGAGATCGCG
GACAAGGGGA TGCTCGATCT CGAGGTCGAT CCGAAGCTCG CCTGGGCGCT GGCGCATCGC
GAGGCCTTTC CGGTGGATGT GAACTGCGCC CCGCGCGAGA TGCTGCTGCG GGTGCCGGGC
TTCGGCACCA AGACGGTGGG CCGGATCCTT GCCGCGCGGG CGCACGGGGC GGTGCGCTAC
GAGCATCTGG TGGCGATGGG CGCGGTGCTG AAACAGGCGC GCCCCTTCAT CGTGGCCCCT
GGCTGGCGGC CGCAGGGGCT GGACGACGCC AGCCTGCGCG CGCGCTTCGT GCCGCCGCCG
GAACAGTTGA GCCTCTTCTG A
 
Protein sequence
MKKDLARKLA ILSDAAKYDA SCASSGGTRR DSKDGKGLGS SGGSGICHAY APDGRCISLL 
KILMTNFCIF DCAYCVNRVS SRVERARFSV EEVVTLTVEF YRRNYIEGLF LSSGIIRSPD
DTMADMVRIA KTLREREHFR GYIHLKTIPD AAPELIEQAG LYADRLSINV ELPTEAGLDR
FAPEKSATGI RKAMAEVRLK REASREPSFS GRRPSRFAPA GQSTQMIVGA DGADDAAILG
NASTLYANYG LSRVYYSAFS PIPDASKALP LVRPPLLREH RLYQADWLLR FYGFEVGEIA
DKGMLDLEVD PKLAWALAHR EAFPVDVNCA PREMLLRVPG FGTKTVGRIL AARAHGAVRY
EHLVAMGAVL KQARPFIVAP GWRPQGLDDA SLRARFVPPP EQLSLF