Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3192 |
Symbol | |
ID | 4898706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 229955 |
End bp | 232741 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640113793 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001045063 |
Protein GI | 126463950 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.245022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGGCGG CGGAGCAATT CCAGCTGCTG GCGAAGATGG GAGAATCCCT GGCCGCCGGC GTCCAGGGTT TCCACCTCGC GATCGAGCAC GGTCACTTCT TCCTGCGCGA GACGCAGACC GGCCTCAGCC GCCTCGCGAC CGGCGCCGAT GATCCCCTGC TGCCGCTGAT CGCAGAAGGA ATCGACCGGG CCGACAGCGT CGATCTGGCG GTGGCCTTCG CGATGGAGTC CGGCGTCCAG TTGGTCGAGC CCTGGTTCCG GGACCTGCTT GGCCGCGGCG GCCGGCTGCG GATAGTCGTC GGCGACTACA TGGACGTGAC AGAACCGGCG GCGCTGCACC GGCTCGCCGA TCTCGAGGGC GCGCAGTTGC GGGTGTTCGA GACCGGCACC GGAACCTTCC ATCCCAAGGC CTGGCTCTTC CGCGCGGCCG ACCGGCAGGG GGCGGCGATC GTCGGGAGTT CCAACCTGTC GCGGACGGCG CTGACCACCG GCATCGAATG GAACCTGCAT TCCGAAGGCG CCGCCGATGT TGTCGCCCCG GCCTTCGAGG CACTGCTCGC CCATCCGCAG ACGCGACCTC TGACCCCCGA TTGGATCGCC GCCTATGCCG CCCGGCGCCG GGCGACTCCG CTGACCGATC TCGCGCAGCG CATTGTCTCG GACGAACCGC AGGCCCCGCC GCCCGAGCCG CACACCATCC AGCGCGCCGC TTTGGCCGCG CTGACGGCCA CGCGGCGGGC GGGTCACCGT ACCGGTCTCG TCGTGCTGGC GACCGGTCTG GGCAAGACCT GGCTCGCCGC CTTCGACAGC CGGCACTTTG CCCGGGTCCT CTTCGTCGCG CATCGCGAAG AGATCCTGAC CCAGGCCATG TCCTCCTTCC GCCGCATCCG CCCCGAGGCG CGGTTCGGCC GCTACGACGG CACCGAGAAG GACGATGGCG CCGAGATCCT CTTCGCCTCG ATCCAGACCC TGGGTCGGGC CAACCACCTG CGCCGCTTCG CGCCCGACGC CTTCGACTAC ATCGTGGTGG ACGAGTTCCA CCACGCTTCG GCCGGCAGCT ATCGCGGCCT TCTGGACCAT TTCACCCCGC GGTTCCTGCT CGGCCTGACC GCCACGCCGG ACCGGTCGGA CGGCGCCGAT CTGCTGGCGC TCTGCGGCGA CAACCTCGTC TATCACTGCG ATCTCTTTGA AGGGATCGAG GCCGGACTCC TGTCGCCGTT CCGCTACCTC GGCGTGCCCG ACGAGGTGGA CTACGCCCAG ATCCCTTGGC GGTCGAACCA GTTCGACCCC GAGGCGCTGG AGGCCGCGCT GGCGACGGAG GCCCGCGCCC GCAACGCGCT CGACCAGTTC CACCGCCACC GCGACGGCCC GGCGATCGGC TTCTGCTGCT CGGTACGGCA TGCCGAATTC ATGGCTACCT TCTTCCAGGC GCAGGGTCTG CGGGCCGTCG CCGTCCATTC CGGCCCAGGC TCTGCCCCGC GGGCGACCTC GCTCGACCGG CTGGGCCACG GCGAGATCGA CATTCTATTT GCGGTCGACA TGTTCAACGA GGGCGTCGAC GTGCCGAACA TCGGGACGGT GATGATGCTG CGCCCGACGG AAAGCGTGAT CCTCTGGCTC CAGCAACTTG GCCGGGGCCT GCGGCGGGTC GAGGGGAAGC TGCTGCGCGT CATCGACTAC ATCGGCAACC ACCGCGTCTT CCTGACCAAG CTGCGCGCGC TGCTGGCCGC AGGCCCCGGC GACCGCTCGC TGGCGCAACG GCTGGAGCAG GCGGCGGCCG GCACTCTGGC GCTGCCGCCG GGGTGCAGCG TCACCTACGA CCTGCGGGTG ATCGACATCC TGCGCGACCT CTTGCGGCCG AAGTCTGGGA TGGAGGATCT CGAAGCGCAA TACCGCGACT TCCGCCTCCG CCACGGCCAG CGTCCAACAG CAGCCGATAT CGCCCGGATG GGCTTCGATC CGGCCCGGAA CGGGCATGGC GGCTGGTTCG ACTTCGTCCG CGACATGGGC AATCCGATCG ATGCCCGCGC TCAGACGACG GTTGCGGGGT TGCTGCGGCA GATCGAGACG GACCGTACGC TGACGCCTCA AGCGATCGCG GCGCTGGAAA GCCTCCGCTC TGGCCGAAGC GCGTCGGAGG AAGGCCTGGC CTATTGGGAC CTGAACCCGC TCCTCCGCCG GGACGGGAGC CACCTGACGC TGACCCGGCC GGACCCGGAG GGCACCGCGG CGGCGATGAT CGTCGAACTG CTCGAGTGGC GCGCCGGGCA ACTGAGGGAG CCCGTGCTGA AGGAGCCGGC GGCGCCGTTC ATCACGGCGG GTTCGGAGCT CTGGCGCGAA TACCTACGGG AAGCGATCCC GCCACTCTTC GGCGCCACCT TCAACACCGG CAGCTGGAAC GCCGGGATCG TGCGCCTGGA GCACGACCTG ATCCTGTTGA CGACCCTGAA GAAGGGCAGC CTCTCGGCCG GGAACCACTA CAAGGACGGC TTCCTCGCTC CCGACCGCAT GCAGTGGCAG AGCCAGACCC AAACGCGTCG TGACAGCCAG ATCGGCCGAA TGCTCGCGGG GACCGAGCCC GGCGCGCGCG TCCACCTCTT CGTCCGATCG GGCAAGCTGA GAAACGGCAA GGCGGCACCC TTCCTCTACT GCGGCCAGCC CGAGTTCCTT GGCTGGGACG GCGAGAAGCC GATCACCGTC ACCTGGCGCC TGCGGGAGGC GGTGCCCCCG CACCTGCGGA CTGGGCTGGG TATCGCCTCA GACCGGAGAC ACACAGAAGA AAGCTGA
|
Protein sequence | MTAAEQFQLL AKMGESLAAG VQGFHLAIEH GHFFLRETQT GLSRLATGAD DPLLPLIAEG IDRADSVDLA VAFAMESGVQ LVEPWFRDLL GRGGRLRIVV GDYMDVTEPA ALHRLADLEG AQLRVFETGT GTFHPKAWLF RAADRQGAAI VGSSNLSRTA LTTGIEWNLH SEGAADVVAP AFEALLAHPQ TRPLTPDWIA AYAARRRATP LTDLAQRIVS DEPQAPPPEP HTIQRAALAA LTATRRAGHR TGLVVLATGL GKTWLAAFDS RHFARVLFVA HREEILTQAM SSFRRIRPEA RFGRYDGTEK DDGAEILFAS IQTLGRANHL RRFAPDAFDY IVVDEFHHAS AGSYRGLLDH FTPRFLLGLT ATPDRSDGAD LLALCGDNLV YHCDLFEGIE AGLLSPFRYL GVPDEVDYAQ IPWRSNQFDP EALEAALATE ARARNALDQF HRHRDGPAIG FCCSVRHAEF MATFFQAQGL RAVAVHSGPG SAPRATSLDR LGHGEIDILF AVDMFNEGVD VPNIGTVMML RPTESVILWL QQLGRGLRRV EGKLLRVIDY IGNHRVFLTK LRALLAAGPG DRSLAQRLEQ AAAGTLALPP GCSVTYDLRV IDILRDLLRP KSGMEDLEAQ YRDFRLRHGQ RPTAADIARM GFDPARNGHG GWFDFVRDMG NPIDARAQTT VAGLLRQIET DRTLTPQAIA ALESLRSGRS ASEEGLAYWD LNPLLRRDGS HLTLTRPDPE GTAAAMIVEL LEWRAGQLRE PVLKEPAAPF ITAGSELWRE YLREAIPPLF GATFNTGSWN AGIVRLEHDL ILLTTLKKGS LSAGNHYKDG FLAPDRMQWQ SQTQTRRDSQ IGRMLAGTEP GARVHLFVRS GKLRNGKAAP FLYCGQPEFL GWDGEKPITV TWRLREAVPP HLRTGLGIAS DRRHTEES
|
| |