Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3668 |
Symbol | |
ID | 4898658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 768911 |
End bp | 770437 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640114276 |
Product | sulfatase |
Protein accession | YP_001045530 |
Protein GI | 126464417 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.902858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0745402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGCG CGGCGCTGAG CCTCCTCCTC CTTGGCCTTG TGCTGGCGCT GCCCGGCCAC CCGTGGGGCG CGTGGCGCTT TCCGCTCGAG CTTCCGGCGC TGGTGGCGCT TCTGGCGCTC CGGCCGGGAC GCGCGCGGGG GCTGCGGGCG CTGCTGGTTG CGGCTCTCGG GGCGGTCACG GTGCTGAAGG CGCTCGATCT CGCGCTGCAG CTGGCGTTCG GGCGGCCGTT CGATCCGGTG GCCGACCTGC CGCTTCTCGC CTCGGCCCTC GATCTGGGCG AGGGGACGCT GGGGGCGGGA GGGGCACTGC TGGCCGCGGT GCTCATCGGG GCGGTGCTGG CCGGTCTGGC GGCGGGCCTC TGGTGGGCGA GCGGGGTCTG GGCCGGACGC GGACGCCCGC TTGCTTGCGG GACGGCGGCG GCGCTGGTCC TTGCGCTGGT GGCGGCGGAT CTCGCTTGGC CCGACCGGGT GCCGGGCGCG GCCTTCACCA CGCGGCTCGT CTGGGACCAT GCCGTGACGG CCCGGCAGAC GCGCGCCGAT CTGGCGGCCT TCCGCGCGGC GGCCCGCACC GATCCGTGGG CCGGACGCAC GGATGCCTTC GCGCGCCTGG GGCCGGCGGA GCTTCGCATC CTCTTCGTCG AAAGCTACGG GCGGGCGAGC TTCGACAATC CGCTCTATGC CTCCCATGCC GCCCTCCTGC GCGCGGCAGA GAGCGGGATC GCGGCGCAGG GCCTCGCCAT GCGGTCGGGC TGGCTCGGCT CGCCGGTTGC GGGCGGGCAG AGCTGGCTTG CCCATGCCAC GCTGGCCTCT GGTCTGCGGA TCGACGGCGC CATCCGCTAC CGCGCCCTGA TCGCGAGCCC GCGCAAGACC CTGTTCGAGC TGGCGCGAGC CGCGGGGCGG GAGACGCTGG CCGTCATGCC GGGCATCACC CGCGCCTGGC CCGAGGGCGT GAGGCTCGGC TTCTCGCACA TTCTGGATGC CGAGGGGCTG GGCTACCGGG GGCGTCCCTT CAACTGGGTC ACCATGCCCG ACCAGTTCAC CCTGACGGCC TTCGACCGGC TCGGGCCCGA GGCGTCGCGG GTGGCGCAGA TCGTGCTTCT CTCGAGCCAC GCGCCCTGGG TACCCGAGCC GCGGCTGGTG CCGTGGGAGG CGGTGGACGA CGGCCGGATC TTCGACGATC AGGCTGCGGC GGGCGATCCG CCGGAGGTGG TCTGGCGCGA TCCCGACCGG GTGCGGGCCG CCTATCGCCA GTCGCTCGCC TATGCGCTGC GGACGGCAAC GGCCCATGCG GCGCGGTCGG GCGCGGGTGC CTTGACGCTG ATCCTCGGGG ATCATCCGCC CGCCCCCTTC GTCTCGGGCA TCGCGGGGCG GGACGTGCCG GCCCATCTGA TCGGCCCGCC CGAGCTTCTG GCCGCTTTCG ACGGCTGGGG CTGGACCGCG GGCCTGATCC CGGCGCCGGA CCTGCCCGTC CTGCCGATGG AGGGCTTCCG CGACCGGTTC CTGACCGCCC TCTCCGGCCC GCCATGA
|
Protein sequence | MARAALSLLL LGLVLALPGH PWGAWRFPLE LPALVALLAL RPGRARGLRA LLVAALGAVT VLKALDLALQ LAFGRPFDPV ADLPLLASAL DLGEGTLGAG GALLAAVLIG AVLAGLAAGL WWASGVWAGR GRPLACGTAA ALVLALVAAD LAWPDRVPGA AFTTRLVWDH AVTARQTRAD LAAFRAAART DPWAGRTDAF ARLGPAELRI LFVESYGRAS FDNPLYASHA ALLRAAESGI AAQGLAMRSG WLGSPVAGGQ SWLAHATLAS GLRIDGAIRY RALIASPRKT LFELARAAGR ETLAVMPGIT RAWPEGVRLG FSHILDAEGL GYRGRPFNWV TMPDQFTLTA FDRLGPEASR VAQIVLLSSH APWVPEPRLV PWEAVDDGRI FDDQAAAGDP PEVVWRDPDR VRAAYRQSLA YALRTATAHA ARSGAGALTL ILGDHPPAPF VSGIAGRDVP AHLIGPPELL AAFDGWGWTA GLIPAPDLPV LPMEGFRDRF LTALSGPP
|
| |