Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3386 |
Symbol | |
ID | 4898492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 440422 |
End bp | 441657 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640113985 |
Product | type II restriction endonuclease, putative |
Protein accession | YP_001045254 |
Protein GI | 126464141 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.174346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTACG GCTCTCTCTC GGATCATTTC ACCGGGATCG TCGCCAAGCG CCTTTCAACG GTCGAGGCGG ATACGGCGCG GTCGAATCAG CACGAGTTCA ATGGTACGAA CGAGCTGCGC CGGCTGCTGG GCGGCGAGCG CATCGAGCGC AGGCCGTCGC GCTTCATCTG GCTCGGCGGC GAGAACGAGG GGATTACCGA CGACGCGCCA GTCACATGGT ACGACGCTCG GGAGCGCCAT CCGACGCGAT CGGAATGGCG GCTCTATTTC CAAGCAAACG CCGTGACCGA GGTCGCGCAG GCCGGTGACC TTCTGGTCGT GGCCCGCCGC CCGAGCGGCG ATCTGATGTT CATTGTCGCA CCGCAGAGTT CCACTCTCGA GAACCAGATC GCCTGGCTCT TCGGGCTGGA TCACGGACTT GGCGCCGGCT TCCGCTACGA GGGTTTCGAG GGAGCCGGCG ATCGCGGCCT CGACTTCGTC AGCAATTATG TTCTTGAAGA GATCGGCATC GAGCCCGAGG TGCCGGAGGC CGATCGGCTG GATGAAATCG TCGGCCGGTT CGGGGCGCAG TTCCCCAGCT CGCGGACATT CTCCGCGCTG GCCCGGCAGA ACCTGCCCGA AGTCGATCCA CGCGACGATG CCGATGCAGC TCTGCTCGCA TGGATCGAGT TCGAGGAGGC CCTTTTCCGG CGCTTGGAGC GCCATATCGT TGCCGCGCGC TTGGAGGTTG GCTTCCTCAC GGATGGCACG GCCGATGTCG ACGGCTTTCT GCAGTTCTCA CTGTCGGTAC AGAACCGCAG AAAAAGCCGC ATGGGCCTGT CGCTCGAGAA CCATGTCGAG GAGATGTTGA CGGTACTGGG CCTCAGATAC GCACGGGGAG CGCGGACCGA GGGGAATTCG AAGCCGGATT TCCTGTTCCC CGGCGTCGCC GAGTACGCCG ATCCGGGCTA CAGCGCTGAC CGCCTGTCCA TGCTGGGAGT GAAGTCCACG CTCAAGGATC GCTGGCGCCA GGTGCTTGCA GAAGCCGCAC GCATCGACCG CAAGCACCTG CTGACGCTGG AGCCCGGCAT CTCCACCCAC CAGACGAACG AGATGATCCG TCACTCCCTG CAGCTCGTCG TGCCACGGGG CCTGCATACG ACCTACACAC CGGAACAGGC TGGGTGGCTC ATGACCGTTC GCGGTTTCCT CAATCTGGTG GCAGCACGGG AAGCTGCACG ACCCTATCGC GAATGA
|
Protein sequence | MRYGSLSDHF TGIVAKRLST VEADTARSNQ HEFNGTNELR RLLGGERIER RPSRFIWLGG ENEGITDDAP VTWYDARERH PTRSEWRLYF QANAVTEVAQ AGDLLVVARR PSGDLMFIVA PQSSTLENQI AWLFGLDHGL GAGFRYEGFE GAGDRGLDFV SNYVLEEIGI EPEVPEADRL DEIVGRFGAQ FPSSRTFSAL ARQNLPEVDP RDDADAALLA WIEFEEALFR RLERHIVAAR LEVGFLTDGT ADVDGFLQFS LSVQNRRKSR MGLSLENHVE EMLTVLGLRY ARGARTEGNS KPDFLFPGVA EYADPGYSAD RLSMLGVKST LKDRWRQVLA EAARIDRKHL LTLEPGISTH QTNEMIRHSL QLVVPRGLHT TYTPEQAGWL MTVRGFLNLV AAREAARPYR E
|
| |