Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_3731 |
Symbol | |
ID | 5085597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009429 |
Strand | - |
Start bp | 633098 |
End bp | 636622 |
Gene Length | 3525 bp |
Protein Length | 1174 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640485294 |
Product | hypothetical protein |
Protein accession | YP_001169903 |
Protein GI | 146279745 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCA CGAAACTCAA GAAGTTCGCT CAGTTTGCAC GGCGCGCGCT CATCGAACAG GTCGCAGCAA GGCTGGAGGC CGTCCGCGGG GAAGGCTCTG CAGCTCAGCG CGAGCATCCG GTGGCGTTCA AGAAGCTCCA GGCGAGCATC GCCGAGCACG GTGCTGACGA TGTCATCGAG CGGGTCGCCT ACATCTGGTT CAACCGCTTT ACCGCCCTGC GCTTCCTTGA TGTGAACGAA CTCAGCGCCG TTCGCGTAGT CTCTCCTTTG CCTGGGCAGT TCCAGCCTGA GATCCTTGCG GATGCGAAGG CCGGGAACAT CAACGAACAG CTCGTGCCTG AGGCGACCCG CCGCAAGGTC CGCGACCTGC TGGAGGGGCT GACCCCCAGC CAGGATCCTC AGGCCGAGGC GTATCGACTG CTTCTCGTCG CCATCTGCAA CTCCTGGCAG ACCACCATGC CGTTCATGTT CGAACGGATC GATGACTACA CCGAGCTTCT GCTGCCCGCC GACCTGCTCT CCAACGCCTC GATCCTCGCC TATCTGCGCG AGGCCATGAC GCCGGACGCC TGTCAGGATG TCGAGATCAT CGGCTGGCTC TACCAGTTCT ACATCTCGGA GAAGAAGGAT CAGGTCTTCG CCGCGCTGAA GAAGAACAAG AAGATCGAGG CCGAGAACAT CCCGGCCGCG ACCCAGCTCT TCACCCCGCA CTGGATCGTC AGGTATCTGG TCGAGAACTC CCTTGGGAGG CTCTGGCTGC TGAACCGGCC GGGGTCCAAG CTGGCCGAGC GGATGGACTA CTACATCGCG CCGAAAGAGC CCGAGACGGA CTTCCTGAAG ATCGGCAAGC CCGAGGAAAT CAAGGTCTGC GATCCGGCCT GCGGGTCGGG GCACATGCTG ACCTATGCCT TCGACCTGCT CTACGCGATC TACGAGGAAG AGGGGTATGA GGCCGACAAG ATCCCCGCGC TGATCTTGGC GAACAACCTG ACGGGGGTCG AGATCGACGA CCGCGCGGGG GCGCTGGCGG CCTTTGCGCT GGCGATGAAG GCGGCGGCAA AGCTGGGGCG GCGGCGGTTC CTGCGGATGG AGGCGAAGCC TGACATCTGC GTTTTGCAGG ACGTGAAGTT CACCGACGCG GAATTGCGGG ACGTGGCCGC CGTGGTCGGC AAAGACCTCT TCACCGACGA ACTGCGCGAA ACGCTGGGGC AGTTCGAGCA GGCCAAGAAC TTCGGCTCGC TGATCGTGCC GAAACTGCGC GACCCAGCCG AGACACTGCG AGTGGTGGTG GCTCGGGATT TTGACGCCGA CCTGCTGCTG CGCCCGGTTC TGGAGCGCCT TGAAAAAGTG CTGTGCATGG CCGAGGCGCT GTCGCCGAAG TATCATGTTG TGGTGGCCAA CCCCCCGTAT ATGGGCAGCA AAGGGATGAA CAAGACCCTC GAAAGCTGGG CCAAGAAACA CTTCAAGGAT AGTCGCGCGG ACCTGTATGC GATGTTCATG GAAAGGTCAT TGGCGCTTTG TGTGCGTCAT GGCTTCACTG CAATGATCAA CATGCAGTCT TGGATGTTTC TTTCAACGTT TGACAGCTTG AGAAGCAAAC TCCTGACGCA GAACGCCATT GTAAGCATGG CTCACCTGGG GGAGCGAGGG TTCGATACGA TTGGGGGCGC CGTCGTCTCA ACGACGGCTT TCGTATTGGG CAAGGCCTAC AGCCCTAGAA CGGAAGGGGT TTTTCTCCGC CTCGTTTCTG GTCGCAACGA GGCAGAAAAA TCGACTGCGG CACGCGCTGC CGTGAGCGGT GTAAACTCAG ATGTGAAATT TTGTTCTTCA GCTGCGGACT TCCAGAAGCT TCCGGGTAAT CCAATTGCGT ATTGGGCATC GGAAGCGTTT CTGCGCCTAT TTGATGAAGG CAAGCCCATC ACAGACTTCG TATCGTCTCG CGATGGCCTA ACCACTGGGG ACAACGAAGC TTTCATCAGG TATTACTGGG AGCCAGCCTT TGAAAGCGTT GGGTTTGGAT ACGCTGACGC AAAAGACTTC TGGAAATCAG GTCGGAAATT CGCCCCCCTT ATAAAAGGTG GCTCATATCG GAAATGGTTT GGAAACCTTT CCAATGTAAT CACCTATGAC AAACAGCACT ATGAAGTGCT TGCAGTGAGC GGAAACAAAC TTCCCTCAAG AGAGAAGTAT TTTCAACGAA ACCTGAACTG GACTCGTATT TCTTCGCCAA GCGGATCGTT TCGATATACC GAACCTGGTT GCATCTTCGA AAGCGCAAGT CTTTGCACAT ATTCTGACTC AGATCAGGAT ATGTTTTATG CTCTGGCAGG TGCGAACAGC TGTATCACAA TTCCACAGCT TGATCTAGTG AACCCAACTA CGAACCTGCT GTCCGGTTAT TTTGACATGC TTCACTTGCC GGAACATGAC GCCGCAGCTC GCTCAATAGC AGCAGGAAAT GCCAAGCAAC TTGTGAGCTT GGCCAAATCC GACTGGGACG CCTACGAAAC CTCTTGGGAT TTCACTACGC TCCCGCTGCT CTCGCCCGAT CACCGGGCCG GGACGCTTGA GGCCACCTTT GCACGCCTGC GCGCCCACTG GCAGGGCATG ACAGACGAGA TGAAGCGGCT CGAGGAAGAG AACAACCGCA TCTTCATCAA CGCCTATGGT CTGCAGGACG AGCTGACCCC CGAAGTGCCG ATCAAGGAGA TCACCCTCAC CTGCAACCCC GCCTATTCAA TCAGCGGCGA CAAGACCGAA ACCGAGCGCG AGGACGCGCT TCGCGTTTGC ACGATGCAGG AGTTCCTGAG CTATGCCGTG GGCTGCATGT TCGGCCGCTA CAGCCTCGAT GCGCCGGGCC TGATCCTCGC CAATCAGGGC GAAACGCTGG CCGAATACCT CGCCCGCGTG CCCGAGCCGA GCTTGCTGCC CGACGAAGAC AACGTCATCC CCGTGCTGTC GGAGGAGTTC GAGAGCTGGT TCCCCGATGA CATCGCGGAC CGCTTCCGCA AGTTCCTGCG CGTGACCTTC GGCGATGCGT ATTTCCGCGA GAACCTCGCC TTCATCGAGG CGCAGGTGGG CGACATCCGC AAATACTTCT CAAAGGCGTT CTACGACGAC CACGTGAAGC GCTACAAGAA GCGTCCGATC TACTGGATGT TCTCCAGCCC CAAGGGCACG TTCCAAGCGC TGATCTACAT GCACCGCTAC CGGCCCGACA CCGTCTCGGT CCTGCGGAAC GAGTATGTGG TCGAGTTCAT CCGCAAACTC GAGGCCGAGC GTGCAAAGCT GGCAAAGCAG TCCGACGACC CCTCGGCCAC GCAGGCACAG CGCGCCAAGG CGGAGAAAGC CATTGGCACC ATCGTGAAGC AGATAGCAGA GCTCGAGGAA TGGGAGCGTG AAGTCATCTT CCCTCTGGCG CAGGAGAAAA AGAAAATCGA TCTCGACGAG GGCGTCAAAC GGAACTACCC CCGCTTCGGC GCGGCGTTGA AACCCATCAA GGGCCTGGAG GAGGCGGATG AGTGA
|
Protein sequence | MDTTKLKKFA QFARRALIEQ VAARLEAVRG EGSAAQREHP VAFKKLQASI AEHGADDVIE RVAYIWFNRF TALRFLDVNE LSAVRVVSPL PGQFQPEILA DAKAGNINEQ LVPEATRRKV RDLLEGLTPS QDPQAEAYRL LLVAICNSWQ TTMPFMFERI DDYTELLLPA DLLSNASILA YLREAMTPDA CQDVEIIGWL YQFYISEKKD QVFAALKKNK KIEAENIPAA TQLFTPHWIV RYLVENSLGR LWLLNRPGSK LAERMDYYIA PKEPETDFLK IGKPEEIKVC DPACGSGHML TYAFDLLYAI YEEEGYEADK IPALILANNL TGVEIDDRAG ALAAFALAMK AAAKLGRRRF LRMEAKPDIC VLQDVKFTDA ELRDVAAVVG KDLFTDELRE TLGQFEQAKN FGSLIVPKLR DPAETLRVVV ARDFDADLLL RPVLERLEKV LCMAEALSPK YHVVVANPPY MGSKGMNKTL ESWAKKHFKD SRADLYAMFM ERSLALCVRH GFTAMINMQS WMFLSTFDSL RSKLLTQNAI VSMAHLGERG FDTIGGAVVS TTAFVLGKAY SPRTEGVFLR LVSGRNEAEK STAARAAVSG VNSDVKFCSS AADFQKLPGN PIAYWASEAF LRLFDEGKPI TDFVSSRDGL TTGDNEAFIR YYWEPAFESV GFGYADAKDF WKSGRKFAPL IKGGSYRKWF GNLSNVITYD KQHYEVLAVS GNKLPSREKY FQRNLNWTRI SSPSGSFRYT EPGCIFESAS LCTYSDSDQD MFYALAGANS CITIPQLDLV NPTTNLLSGY FDMLHLPEHD AAARSIAAGN AKQLVSLAKS DWDAYETSWD FTTLPLLSPD HRAGTLEATF ARLRAHWQGM TDEMKRLEEE NNRIFINAYG LQDELTPEVP IKEITLTCNP AYSISGDKTE TEREDALRVC TMQEFLSYAV GCMFGRYSLD APGLILANQG ETLAEYLARV PEPSLLPDED NVIPVLSEEF ESWFPDDIAD RFRKFLRVTF GDAYFRENLA FIEAQVGDIR KYFSKAFYDD HVKRYKKRPI YWMFSSPKGT FQALIYMHRY RPDTVSVLRN EYVVEFIRKL EAERAKLAKQ SDDPSATQAQ RAKAEKAIGT IVKQIAELEE WEREVIFPLA QEKKKIDLDE GVKRNYPRFG AALKPIKGLE EADE
|
| |