Gene Rsph17029_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3345 
Symbol 
ID4898407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp398702 
End bp400513 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content73% 
IMG OID640113944 
Productnuclease 
Protein accessionYP_001045213 
Protein GI126464100 
COG category[K] Transcription 
COG ID[COG1475] Predicted transcriptional regulators 
TIGRFAM ID[TIGR00180] ParB-like partition proteins 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0526297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGG ACTTCATCTC CCGAGGCGAT CTGCGCCTCA TCCCGCTCGC CGAGCTCCGG 
CTCTCGACCC TGAACAGCCG TCAGGAGATC GCGGCCGAGG AGGTCGAGGC CATGGCCGAG
AGCCTCGCCG TGGCGGGGCT TCTCCAGAAC CTGATCGGCC ATCTGACGCC TTCCGGCGTC
GAGATCGTGG GCGGCGGCAC CCGGCTCCGC GCGCTGCAGC GCCTCGCGGC CGAGGGCTGG
AGCCGGCACC CGGATCTCAT TCCGATAGAT CCGGTGCCGG TGAAGGTGAC GGCCGACCTG
CAGGAGGCGG TGGCCTGGGC GGGCACCGAG AACAGCGCCC GCTCGGCGCT GCACCCGGCC
GACGAGGTCC GCGCCTATGC CGCGATGCGC GAGCGCGGTG CGAGCCTGTC GCGGATTGCG
CGCAGCTTCG CGCGCTCCGA GGCCCATGTC GAGCGGCGCC TCAAGCTCGC GGATCTGCCG
GCCGAGGCGC TGGCGGCACT CCGCGCGAAC GAGATCTCGC TCGAGATGGC GAAGGCGCTG
ACGCTGGCGC CGAGCGGCGC GCGCTGCCTC GAAGTTCTCA CCTCGGTGCG GGGCCGCGAC
GTGCGGCCCG AGCAGGTGCG GCGCGAGCTC ACGCCCGGCA CCGTGCCCTC GACCGACCGA
CGGGCGGTCT TCGTGGGGAT CGAGGCCTAT CAGGCGGCCG GCGGAACCTC TCAGCGCGAT
CTCTTCGCCG ACCGGACGCT GCTGGAGGAC GAGGCGCTCC TCGACCGGCT CTTCGCCGAG
AAGGGGGCGG CCGAGGCGGA GCGGATCCGC GCGGAAGAGG GCTGGGAATG GGCGACATGG
GTGCCGGAAG AATATGTCTC CTGGACCGTC ACGCAGAAGC TGGTGCGGCT CTACGCGCGG
CCGGGGAAGC TCTCGGAAGG AGAGGAAGCG GAGCTCGCGG CGCTCGAGGA GCGCGAGACC
GAGGACGCCC TCGACGAGGC CGGCCGCGCG CGCCTCACGG AGCTCGAGGC TCGCAGAGAG
GGCGGCTTCA CCGACGCGCA GCGCGCTTCG GCCGGGATCT TCGTCTATTG CAGCAGCCGG
GACGGGCTCT CGGTCGAGCG CGCCTATCAG CAGCCGCGGG CGGTCCCGCG CGGCGCGGCC
GAGGCCGCGC CCGACCTGCC GCAATCGCTG ATCGAGGACC TGCACCGGAT CCGGCTCGGG
GCGCTGCAGG CGCGGCTGAT GGATCAGTCC GAGCTCATGC TCGACCTGCT GGCCTTCTCG
CTCGGCGGCG GCCTCCGCCC CTGGGCGCGC CCGCTCGCGG TCTCGCCGAC CGACCAGCCC
ATCGCGCCGG AGAAGGCCGA CGGCACGCGT TACCCGCCGC GGCTGGCGGC ACGGCTCGAA
CCGAATACGA GCCTCGGCCC GGACGGCACC CCGGCCGAGT TCGAGGCCTT CCGGGCGCTG
GGGAAGAAGC ACCGCAACCA GATCCTGACC GAGGCGCTGG CGCGAACCTT CTGCACCGGC
AGCTCCGGCC TCTCGGCGGC GCTCGCGCGC CAGCTCGGGG TGGAGGTGCG CCGGATCTGG
ACGCCGACCG CCCAGGGCTT CCTCGGGCGC TGCAGCGCGG GCTATCTCGA CCGGCTCTGG
AGCGAGCTCG TGCCGGCGGC CGAGGCGGAT CAGAGTTTCC AGAAGCTGAA GAAGGGGGAG
AAGGCGAAGC GCCTCGAGGC GCTCTTCGCC GACCCCGCCA CCCGCGAGGC CCTCGGCCTC
AACCGCGAGG ACTGCGCGAA GATCGACGCG TGGGTGCCGG CCGAGCTCGG CTTTCCGGAG
GTGACAGAAT GA
 
Protein sequence
MAKDFISRGD LRLIPLAELR LSTLNSRQEI AAEEVEAMAE SLAVAGLLQN LIGHLTPSGV 
EIVGGGTRLR ALQRLAAEGW SRHPDLIPID PVPVKVTADL QEAVAWAGTE NSARSALHPA
DEVRAYAAMR ERGASLSRIA RSFARSEAHV ERRLKLADLP AEALAALRAN EISLEMAKAL
TLAPSGARCL EVLTSVRGRD VRPEQVRREL TPGTVPSTDR RAVFVGIEAY QAAGGTSQRD
LFADRTLLED EALLDRLFAE KGAAEAERIR AEEGWEWATW VPEEYVSWTV TQKLVRLYAR
PGKLSEGEEA ELAALEERET EDALDEAGRA RLTELEARRE GGFTDAQRAS AGIFVYCSSR
DGLSVERAYQ QPRAVPRGAA EAAPDLPQSL IEDLHRIRLG ALQARLMDQS ELMLDLLAFS
LGGGLRPWAR PLAVSPTDQP IAPEKADGTR YPPRLAARLE PNTSLGPDGT PAEFEAFRAL
GKKHRNQILT EALARTFCTG SSGLSAALAR QLGVEVRRIW TPTAQGFLGR CSAGYLDRLW
SELVPAAEAD QSFQKLKKGE KAKRLEALFA DPATREALGL NREDCAKIDA WVPAELGFPE
VTE