Gene Rsph17025_4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4052 
Symbol 
ID5086225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp91268 
End bp94330 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content70% 
IMG OID640485615 
Producthypothetical protein 
Protein accessionYP_001170209 
Protein GI146280052 
COG category[S] Function unknown 
COG ID[COG4457] Uncharacterized protein conserved in bacteria, putative virulence factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0348037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.215609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG ACAGGAAAGA ACGCCTCCGC CTTCTGGTCA ATTGGCACGA CGAGATCACG 
CTGGTGCCCT TCTCGGGCAT CCAGATCCTC GATTTCGGCT TCCGCATGGA TGCGCTGACG
CTGCGCCCGC TGCGCTTCAT CGAGCGCACG GTCAGCGCCG GGCCCGACCG GAGCGAGCGG
ATGCTGATCC CGCTCAGCGG CCGCGAGGAG CATGACGCGC CGATCGAATC CGACGCCCGT
CCCGACGACG ACGAATATTC CATCCGCCCC ACCGCCGCGC TCGAGCCCTT CCTGGCCAAG
TGGGTGCCGG TGCCGGTGCT GCGCATCAAG AGCGAGCGCG GGCCGGGGGG CGAGGAGCGG
TTCGATCCGG GCCCCTCGAG CTGGGCGCGG ATGCGCACGG TGGAGCTGGC CGAGCCCGAT
CCCGAGACCG GCTTCACCCA CCGGGTGCAG CTGGCCCTCG ACACCACCCT CGTCGCCCAG
GACCAGAGCC GCCACTATGT CGCCCCCGAG CGCGCCGACG CCGAGAAGCC GCGCGACTTC
CGCTTCGTCT CGGATCCGGC GGTGATGGAC TGGTTCCTGC GCCGGCTCGA GGAGGGCGAC
GACGGCACGA TGATCGACCT GCAGCTCTGG GCCTCGGACT GGCTGAAGGA GCTGTTCCTC
GCCTTCAAGC GCGCCGAGCG GCCGGGCCGC ACCGTCACCG AGGACAGCCT GCCGCACCAG
TTCGAGCACT GGGCGCGCTA CCTGGCCTAT CTGCAGACCA TCGACCACGC CGTCCGGGTG
CCGCGGATGC GCTTCGTCAA CACCGTCTCC GAGCGCGATG CGGTGACGCC GGTCGATGTG
GATCTGGTGC TCGACGTGGG CAATTCGCGC ACCTGCGGCA TCCTGATCGA GCGCTTTCCC
GGCGAGGGGC GGGTGGATCT GGTGCGCTCC TTCCCGCTCG AGATCCGCGA CCTCTCGCGC
CCCGAGCTGC ATTATTCCGG CCTGTTCGAG AGCCGCGTGG AATTCGCCGA GCTGAAGTTC
GGCGAGGATC ATTTCGCCAG CCGCTCGGGC CGGCGCAACG CCTTCGTCTG GCCGGGTTTC
GTCCGGGTGG GCCCCGAGGC GCTGCGGCTG ATCCAGGGCG AGGAGGGCAC CGAGACCTCC
TCGGGCCTCT CCTCGCCCAA GCGCTACATC TGGGACGACG AGGCGCGCCA GCAGGACTGG
CGCTTCCACA ACCACCACGA TCCCAACAAC CTGCCCAAGT CGGTGCGCGC GGCCATGCGC
CAGCTGAACG AGGCGGGCGA CGTGCTCGAA CAGGTGCGCT ACGAGGAGAG CCGGCGCCTG
CGCCCGCGCG GCCGGACCGC GCCGGCGCGC GCCATCCGGC CGCGCTTCTC GCGCTCGGCG
CTCTTCGGCT TCATGCTGGC CGAGATCATC AGCCATGCGC TTGTGCAGGT GAACGACCCG
GCCTCGCGGT CGCGGCGGGC GCAGTCGGAC CTGCCGCGGC GGCTGAACCG GATCATCCTG
ACCCTGCCCA CCGCCACCTC GGTGCAGGAG CAGGCCATCA TCCGCTCGCG CGCCGAGGGG
GCGCTCCGTC TGGTCTGGAG CACGCTGGGC GTGGCCGACA CCGAGACGAG CATCTCGCGA
AGGCCCGAGC TGATCGTGGA ATGGGACGAG GCGAGCTGCA CGCAGCTCGT CTATCTCTAC
AGCGAGCTGA CGCAGAAGTT CGACGGCAAC ATCAACGCCT TCCTCCAGCT CAAGGGCCAT
CCGCGGCGGC GGGCGGGCGC GGCCGAACCC GCGCCGAGCC TGCGGCTGGC CTGCATCGAC
ATCGGCGGCG GCACCACCGA CCTGATGATC TGCACCTACT GGGGCGAGGC CAACCGGGTG
CTGCATCCCG AGCAGACCTT CCGCGAGGGC TTTCGCGTGG CGGGCGACGA TCTGGTGCAG
CGGGTGATCT CGGCCATCAT CCTGCCGCGG CTGCAGGCCT CGATCGAGGC GGCGGGCGGG
CGCTATGTCG GCGAGAAGGT GCGCGAGCTC TTTGCCGGCG ACATCGGCGG GCAGGACCAG
CAGGTGGTGC AGAAGCGCCG CCAGTTCGCG CTGCGGGTGC TGATGCCGCT TTCGGTGGCG
ATCCTCGCCC ATTGCGAGAC CGCCGACGAA TTCGACCGCT TCGACCTCGA GGTGGGGTCG
GCGCTGCGGC CGGCGGTCTC GCAGGAGATC CTCGGCTATC TCGAGGGGGC GGCGCGCGAT
CTGGGCGCCG CGGGCTGGAG CCTGGCCGAC GTGGTGCTGA CGGTCTCGCG CGAGGATGTG
GACGCCATCG CCCGCGAGGT GTTCCAGAAG GTGCTGGGCA ACATGGCCGA GGTGATCGAC
CATCTGGGCG TGGATGTGGT GCTGCTGACG GGCCGCCCCT CGCGGCTGCC CGCGGTGCGC
GCCATCGTCG AGGAGATGCT GGTGGTGCCG CCTCACCGGC TGGTCTCGAT GCACCGCTAC
AAGACCGGCC GCTGGTATCC GTTCCGCGAC CCGATCACCC AGAGGATCGG CGACCCCAAG
AGCACGGTGG CGGTGGGGGG GATGCTCATC GCCCTGTCGG AAAGCCGGAT CCCGAACTTC
AAGGTCTCGA CCGGGGCCTT CCGCATGCGC TCGACCGCGC GCTTCGTCGG CGAGATGGAC
AGCAACGGCC AGATCCGGGA CGAGCGGATC ATGTTCTCGG ATCTCGATCT CGACGCGGCC
CGGCCCGGCA CGCAGCAGAC CGCGCTGGTG CGGATGTTCG CCCCGATCCA CATCGGCTCG
CGCCAGCTTC CGCTCGAACG CTGGACGACG ACGCCGCTGT TCCGGCTCGA CTATGCCAAC
GCGGCCGCGC AGCGGCGGCC CTCGCCCATC CTCGTGACCT TCGAGAAGGC CGAGTTCGAC
GACGGCGAGG CCGAAACCTC GGAGGACCGG CTGCGGCGCG AGGCGCAGCG CGAGTTCCTG
AGGATCACCG AGGTGGAGGA CGGCGCCGGG GACGGCATGA AGACCTCCGA CCTGTGCCTC
AAGCTGCACA CGCTGGGGTT GGACGACGAA TACTGGATCG ACACCGGGGT CTTCCAGTAC
TGA
 
Protein sequence
MIADRKERLR LLVNWHDEIT LVPFSGIQIL DFGFRMDALT LRPLRFIERT VSAGPDRSER 
MLIPLSGREE HDAPIESDAR PDDDEYSIRP TAALEPFLAK WVPVPVLRIK SERGPGGEER
FDPGPSSWAR MRTVELAEPD PETGFTHRVQ LALDTTLVAQ DQSRHYVAPE RADAEKPRDF
RFVSDPAVMD WFLRRLEEGD DGTMIDLQLW ASDWLKELFL AFKRAERPGR TVTEDSLPHQ
FEHWARYLAY LQTIDHAVRV PRMRFVNTVS ERDAVTPVDV DLVLDVGNSR TCGILIERFP
GEGRVDLVRS FPLEIRDLSR PELHYSGLFE SRVEFAELKF GEDHFASRSG RRNAFVWPGF
VRVGPEALRL IQGEEGTETS SGLSSPKRYI WDDEARQQDW RFHNHHDPNN LPKSVRAAMR
QLNEAGDVLE QVRYEESRRL RPRGRTAPAR AIRPRFSRSA LFGFMLAEII SHALVQVNDP
ASRSRRAQSD LPRRLNRIIL TLPTATSVQE QAIIRSRAEG ALRLVWSTLG VADTETSISR
RPELIVEWDE ASCTQLVYLY SELTQKFDGN INAFLQLKGH PRRRAGAAEP APSLRLACID
IGGGTTDLMI CTYWGEANRV LHPEQTFREG FRVAGDDLVQ RVISAIILPR LQASIEAAGG
RYVGEKVREL FAGDIGGQDQ QVVQKRRQFA LRVLMPLSVA ILAHCETADE FDRFDLEVGS
ALRPAVSQEI LGYLEGAARD LGAAGWSLAD VVLTVSREDV DAIAREVFQK VLGNMAEVID
HLGVDVVLLT GRPSRLPAVR AIVEEMLVVP PHRLVSMHRY KTGRWYPFRD PITQRIGDPK
STVAVGGMLI ALSESRIPNF KVSTGAFRMR STARFVGEMD SNGQIRDERI MFSDLDLDAA
RPGTQQTALV RMFAPIHIGS RQLPLERWTT TPLFRLDYAN AAAQRRPSPI LVTFEKAEFD
DGEAETSEDR LRREAQREFL RITEVEDGAG DGMKTSDLCL KLHTLGLDDE YWIDTGVFQY