Gene Rsph17029_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3778 
Symbol 
ID4898584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp903191 
End bp905170 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content69% 
IMG OID640114382 
Producthypothetical protein 
Protein accessionYP_001045630 
Protein GI126464517 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.913258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TGCGTCAGAG CCAGATCTTC CGCACCTCGA AGCTGGAGGA GGCCGACGGG 
CCGGGCCTCA ATCCGACCGC CACTCCCACG ATGGGCGACA TCATCGCCGC GCGCTTCTCG
CGCCGCGGCT TCCTCAAGGG CTCGATGGCC TCCGCTGCCA TCGCGGCGAC CGTCTCTCCG
GTGGCGCTCC TTGCCGCGGG CGAGGCGCGT GCGCAGGGCA GCTCCGCCTT CAGCTTCCCC
GAGGTCGAGG CCGGCGTCGA TGCCGACCAC CATGTGGCCG AGGGCTACGA TGCCGATGTC
CTGCTGCGCT GGGGCGACAA GGTCTTCGCC GATGCGCCGG AGTTCGATCC GCGGGCCCAG
AGCGAGGCGG CGCAGGAGCG CCAGTTCGGC TACAACAACG ACTTCGTGGG GTTCATTCCG
CTCGACGGGG CGACCGACCG CGGCCTTCTG GTCGTGAACC ACGAATATAC CAACGAACAT
CTGATGTTCC CGAACGTCGT CACCCTGAAG GACGGCGAGA TGGTGGTGGC CGATGCCACC
GCGGACCGGG CCGACATCGA GATGGCGGCC CACGGCGGCA CCGTGATCGA ACTGCGCAAG
GTGGACGGCA AATGGGCGCC GGTGCTCGAC GGGCGTCTGA ACCGCCGCAT CACCGCCAAG
ACCCGGATGC AGCTCACGGG CCCGGCGGCG GGTCATGACC GGCTGAAGAC CTCGGAGGAT
CCATCCGGCG CCGAAGTGCT CGGCACGATC AACAACTGTG CGGGCGGCGT CACCCCGTGG
GGCACCTACA TCATGGCCGA GGAGAACATC CACGGTTACT TCCTGGGCGA CCTGCCGGCC
GATCATCCGG AGGCGCGCAA CCACGGGCGG CTGGGCGTGC CCGGCGCCTC CTACCAGTGG
GGCAGGTTCC ACAAGCGTTT CGACGTGGGT CAGGAGCCCA ACGAGCCGAA CCGCTTCGGC
TGGATCGTCG AGGTCGATGT GATGGACCCC ACTTCGGTGC CGAAGAAGCG GACGGCCCTC
GGGCGCTTCA AGCACGAGGG CGCGGAAAGC GTCGTGGCGA AGGACGGCCG CGTCGTCTTC
TATCTCGGCG ACGACGAGCG CTTCGATTAT GTCTACAAGT TCGTCACCAA CGGCCGCTAC
AACCCCGACG ACCGCGCGGC CAACATGGAC CTCCTCGACG AGGGCACGCT CCATGTCGCC
CGGTTCGAGG CCGACGGCTC GATGCGGTGG ATCCCGCTCG TCCATGGCGA GGGGCCGCTC
ACGGCCGAGA ACGGTTTCGA AAGCCAGGCC GACGTGCTGA TCGAGACGCG CCGCGCGGCG
GATCTCCTCG AGGCCACGCC CATGGACCGG CCCGAGGACA TCCAGCCCAA CCCGCAGACC
GGCCGCGCCT ATGTCATGCT GACCAACAAC ACCAAGCGCA CCGAGGCCGA TGCCGCCAAC
CCGCGCGTGA AGAACGCCTT CGGCCATATC ATCGAGATCC TCGAGGCCGA CGGAGATTTC
ACGGCCACGA CCGGCCGGTG GGAGATCCTG CTCCAGTGCG GCGACCCGGC GGTGGCCGAG
GTGGGCGCGA CCTTCTCGAC CGAGACCACG AAGAACGGCT GGTTCGGCAT GCCGGACAAT
GCCGCGGTGG ATGCCGACGG CCGCCTCTGG GTCTCGACCG ACGGCAACTC GATGGCCGAT
ACCGGCCGGA CCGACGGCCT CTGGGCGGTG GACACCGAGG GCGATGCGCG CGGGACCTCG
CGCCTCTTCT ACCGGGTGCC GGTCGGGGCC GAACTCTGCG GCCCCTGCCC GACCGAGGAC
ATGAGCACCT TCTTCGTTGC GGTCCAGCAT CCGGGCGACG GCGGCGAGGA CTGGGAGGGC
CACGGCCGCC TGTCCTACTA CGAGGATCTC TCCACCCGCT GGCCGGATTT CAAGGACGAC
ATGCCGGTGC GCCCGGCCGT CGTGGCGATC ACCCGGCAGG GCGGCGGCCG CATCGGCTGA
 
Protein sequence
MTDLRQSQIF RTSKLEEADG PGLNPTATPT MGDIIAARFS RRGFLKGSMA SAAIAATVSP 
VALLAAGEAR AQGSSAFSFP EVEAGVDADH HVAEGYDADV LLRWGDKVFA DAPEFDPRAQ
SEAAQERQFG YNNDFVGFIP LDGATDRGLL VVNHEYTNEH LMFPNVVTLK DGEMVVADAT
ADRADIEMAA HGGTVIELRK VDGKWAPVLD GRLNRRITAK TRMQLTGPAA GHDRLKTSED
PSGAEVLGTI NNCAGGVTPW GTYIMAEENI HGYFLGDLPA DHPEARNHGR LGVPGASYQW
GRFHKRFDVG QEPNEPNRFG WIVEVDVMDP TSVPKKRTAL GRFKHEGAES VVAKDGRVVF
YLGDDERFDY VYKFVTNGRY NPDDRAANMD LLDEGTLHVA RFEADGSMRW IPLVHGEGPL
TAENGFESQA DVLIETRRAA DLLEATPMDR PEDIQPNPQT GRAYVMLTNN TKRTEADAAN
PRVKNAFGHI IEILEADGDF TATTGRWEIL LQCGDPAVAE VGATFSTETT KNGWFGMPDN
AAVDADGRLW VSTDGNSMAD TGRTDGLWAV DTEGDARGTS RLFYRVPVGA ELCGPCPTED
MSTFFVAVQH PGDGGEDWEG HGRLSYYEDL STRWPDFKDD MPVRPAVVAI TRQGGGRIG