Gene Rsph17029_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1848 
Symbol 
ID4897233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1948540 
End bp1951446 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content60% 
IMG OID640112440 
Producthypothetical protein 
Protein accessionYP_001043724 
Protein GI126462610 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AGCTCATCGA AGTAGCCATT CCACTTGAGG CGATTAATGC GGCGTCGGCG 
CGAGAAAAAT CGATCCGACA TGGGCATCCT TCCACCCTGC ACCTTTGGTG GGCGCGGCGG
CCGCTCGCGG CCTGTCGGGC GGTGTTGTTC GCGCAGCTGG TGGACGATCC CTCGTCGCGG
GTGGACGAGC TGATGGCCGA CCCGAAGCTG CGGGCGCAGG CCGAGGTGGA GCTGCCCGCG
CGGCTGGCGG CCTGGGAGAA GAGCAAGGCG GCTGCACAGG GCGCAGCGGC GAATGCGCCA
GAGCCGACGC TGGAGGATGT GGCGGTCGAG ATTGAGCGCA GGCGGCTGTT TGCCATCATC
GAGGACTTGG TGAAGTGGGA GAACTCGACC AACGAGGAGG TGCTGGAACG CGCGCGGGCC
GAGATCCGGC GCAGCTGCGG CGGCGTGTTG CCCCCTGTCT ATGACCCGTT CTCGGGCGGC
GGGTCGATCC CGCTCGAGGC GCAGCGGTTG GGCCTACCCG CCTATGGGTC TGACCTGAAC
CCGGTGGCGG TGATGATCGG AAAGGCGATG ATCGAGATCC CGCCGAAATT CAAGAACATG
CCGCCCATCC ACCCCGGCAT CAAAGAGCGG TCGTTCTACC GCAACGCCGA AGGGCTGGCC
GAAGACGTGA AATATTACGG CGAATGGATG CGGGAGAAGG CATGGGATCG CATCGGGCAT
CTCTACCCGC AGGTGGACTT GCCGAAGGAG TACGGCGGCG GCAAGGCGAC GGTGATCGCC
TGGATCTGGG CGCGGACGGT GCCGAGCCCG GACCCGGCCT TTTCTGAAGT GCAGGTACCG
ATCGCGTCCA GTTTCCTGCT AAGCGCGAAA CCGGGAAAAG AAGCTTGGGT CGAGCCGACT
GTCGAACGAG CAACCAAGAG AATCACATAC CGGATACGCC AAAAGGGCAC AAAGGCTGAA
ATTGCAGAGG CGAAGAATGG CACAAAGGCG GGACGTGGCG CGAACTTCCG TTGCCTGATT
TCGGACACCG CCATTACACC CGACTATGTC AAGCGCATGG GCCGCGATGG GCGGATGGGC
CAGACGCTTC TAGCTATCGT GGCTGAGGGA AAAGGTGGGC GTGCTTACGT AGCGCCAACT
AACGACCAGG TCGCCACCGC CATCTCGGCA GAACCCACTT GGCGACCTGA GGCCGCGTTG
CCGAATGATC CACGGAATTT CTGGACAGTA GATTACGGAC TCACAAATTT TGGCGATCTA
TTCACAAGCC GCCAACTTGT CGCGCTGAAT ACATTAAGCG AACTGGTACA TGAAGTCCGT
GCAAAGATAG AAAGCGATGC AGCTTCTTCT GGCCTTATCG CTGATGGGAT GCCACTTCGG
GATGGGGGCA AAGGAGCCCT TGCATATGCC GAAGCCGTAA GTATCTATCT CGGCTTCCTT
ATTGGACAGG TCGCCAACCA TTGTTCAACC ATCTGTGGAT GGAACAGTCC TAACCAGCAG
ATGCGATCCA CTTTTTCGCG GCAATCCCTA CCAATGACGT GGGATTTTGC CGAGGTAAAC
GTATTTAGCG AATCCAGCGG AAGCTACCAC AGCCTCTTCA CAAGGATGGT GAAGGGATTT
GAAGTGCTTG GGGCAAGCGA TGAGAAATCT GCGATCACGC AAAGTGATGC GCAGGGCGTC
CAGTACCCCC TAGACACTGC GATCTCTACG GATCCGCCGT ACTACGACAA TATTGGCTAC
GCAGACTTGT CTGATTTCTT CTTCTGCTGG ATGAAGCCAG CATTGAGGGC AATATATCCC
GATTTGTTTT CTCTGATAAC AACACCAAAG GCGGAAGAAT TGGTGGCAAC CCCTTATCGT
CATGGAGGAA AGGATGCCGC AGAAGCCTTT TTTCTTGATG GAATGAGTCG GGCCATCGCA
CGAATGGCGG AGGCTGGAAG TGGTGCCTTT CCAGCCACAA TCTACTACGC ATTTAAGCAG
AGTGAGATTG AACAAGAGGG TATCAGCTCA ACAGGCTGGG CGACATTCGT TCAATCGGTC
ATGGACGCCG GCTATTCTGT GGTGGGAACC TGGCCTCTCC GTACAGAAAA GCCGGGACGG
ATGATTGCGG TTGGGACAAA CGCGCTCGCA AATTCTGTCG TGCTCGTCTG CCGCAAGAAG
GATGCCAAGG CTGACACGAT CACTCGCGCC GAGTTCATTC GCGCTCTGAA ACGCGAACTG
CCCCCGGCCA TCGCCGAGCT TCAAGCCGCC AGCATCGCTC CGGCCGACAT GCCGCAGTCG
GCTATCGGCC CCGGCATGGG CGTCTTCTCG CGCTACCGCG CCGTGCTCGA GGCCGACGAC
AGTGCGATGA CGGTCAAGAC CGCGCTGCAG CTGATCAATG CCGAGCTCGA CGAATATCTC
GGCGGCATCC AGGGCGAGTT CGACGCCGAT ACTCGCTTCG CCATCACCTG GTTCGAACAG
AACGGCATGG GCAAGGGAGA CTTCGGCGCC GCCGACAGCC TCGCCCGCGC CCGCGGCATC
GCAGTCGACA GCGTGAAGCA TGCCGGGATC GTCGAAAGCG CGGCGGGAAA GGTGCGCCTG
TTGAAGCGCG ACGAGCTCGA TCCCGATTGG GCGCCCGAGG AGGACGGACA TCTGACCGTC
TGGGAATGCC TGCAGCACCT CGTGCGCCTG CACGAAAAGG AGGGCCTGTC TCACGACACT
GCGGCGCTGC TGAAACGCTT CGGGCCCCAG GCCGAGGCAG TGAAGGATCT GGCCTACTGC
CTCTACGACA TCGCCGCCAA CAAGCGGCGC GAGGCCTCCG AGGCCACGGT CTACAACGCC
CTGATCGCCG ACTGGTCAGA GCTGAGCCAG ATGGCCGCCA CGGTTTCGCT TGAAGGGCGG
AACCGGCAAA CGCGATTTGA ACTGTAA
 
Protein sequence
MKKKLIEVAI PLEAINAASA REKSIRHGHP STLHLWWARR PLAACRAVLF AQLVDDPSSR 
VDELMADPKL RAQAEVELPA RLAAWEKSKA AAQGAAANAP EPTLEDVAVE IERRRLFAII
EDLVKWENST NEEVLERARA EIRRSCGGVL PPVYDPFSGG GSIPLEAQRL GLPAYGSDLN
PVAVMIGKAM IEIPPKFKNM PPIHPGIKER SFYRNAEGLA EDVKYYGEWM REKAWDRIGH
LYPQVDLPKE YGGGKATVIA WIWARTVPSP DPAFSEVQVP IASSFLLSAK PGKEAWVEPT
VERATKRITY RIRQKGTKAE IAEAKNGTKA GRGANFRCLI SDTAITPDYV KRMGRDGRMG
QTLLAIVAEG KGGRAYVAPT NDQVATAISA EPTWRPEAAL PNDPRNFWTV DYGLTNFGDL
FTSRQLVALN TLSELVHEVR AKIESDAASS GLIADGMPLR DGGKGALAYA EAVSIYLGFL
IGQVANHCST ICGWNSPNQQ MRSTFSRQSL PMTWDFAEVN VFSESSGSYH SLFTRMVKGF
EVLGASDEKS AITQSDAQGV QYPLDTAIST DPPYYDNIGY ADLSDFFFCW MKPALRAIYP
DLFSLITTPK AEELVATPYR HGGKDAAEAF FLDGMSRAIA RMAEAGSGAF PATIYYAFKQ
SEIEQEGISS TGWATFVQSV MDAGYSVVGT WPLRTEKPGR MIAVGTNALA NSVVLVCRKK
DAKADTITRA EFIRALKREL PPAIAELQAA SIAPADMPQS AIGPGMGVFS RYRAVLEADD
SAMTVKTALQ LINAELDEYL GGIQGEFDAD TRFAITWFEQ NGMGKGDFGA ADSLARARGI
AVDSVKHAGI VESAAGKVRL LKRDELDPDW APEEDGHLTV WECLQHLVRL HEKEGLSHDT
AALLKRFGPQ AEAVKDLAYC LYDIAANKRR EASEATVYNA LIADWSELSQ MAATVSLEGR
NRQTRFEL