Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2764 |
Symbol | |
ID | 4897256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2906587 |
End bp | 2908290 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640113366 |
Product | heparinase II/III family protein |
Protein accession | YP_001044638 |
Protein GI | 126463524 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGACG GCAGGGTGGC GCTCCTCGAC CGGCTTGCCG CGCGGCGCGC GCTCCTGCAC CGGCCTGCCC GCCGCTTCGC CACGATCCCC GAACCGGCCT CGCTCGGCCT GCCCGAGCGC GGACGCTGGC TGCTGGCCGG CAGCTTCCTC GTCGAGGGGC GTCTGGTGCA GGCACCGGGC GCATCCCCCT GGGAGGTGCC GGGGGCCGGG CCCGGGCAGA CGGCCGAGGC CCACGGCTTT GCCTGGCTCG ACGATCTGGC GGGGCTGGGA GACCGGCCCG CACGGGCGCT GGCGCGGGAC TGGACCTTCC GCTGGATCGC CCGCTTCGGC CGGGGCCGCG GACCCGGCTG GACGCCTGCG CTCGCGGCGC GCCGTCTCGG CCGCTGGATG AGCCATGCGG CCTTGCTGAC CGAGGCGCAG GGCCGCGATC AGGCGACGCT GGCCCGCGCC GCCTCGCAGA CGCTGCGCTT TCTCGACCGC CGCTGGACGA CGGCGCGCGG CCCGGCCCGG ATCGAGACGC TTGCGGCCAT CCTGCGCGCG GGCCTCGCGC TCGAGGGGAT GGAGCGCCAT GTCGGCGCCG CCGCCACCGC ATTGGGCCGC GAGGCCGAGG CGCAGGTCGA TGCCCATGGC GCCATCGCCT CGCGCAGCCC CGAGGATCTG GCCCTGCTTT TTGCGCTGCT GGCCGAGGCC GAGGCCGCCC TGACGGCCGC GGGCCGCCCC GTGGCCGAGC CGCACCGGGC CGCCCTGCGC CGGATCGCGC CGGTGCTGCG GACCCTGCGC CATTCCGACG GCGGGCTGGC CCGCTTCCAC GGCGGCGGCC GGACCGTGGC CGAACGGGTG GAGCGCGCCC TCGCGGCCGC CACAGCGCCG CCGCTGCGGA CCGAGGGCGC CGCGATGGGC TATCTGCGCC TGTCGGTGGG CCGGACGAGC GTGATCGTCG ATGCCGCCGA TCCGCCCGCG GGTCCGCACG GCCATGCTTC GACGCTCGGC TTCGAGATGA GTTCGGGGCG CAGGCCCGTC ATCATCTCCT GCGGTTCGGG CCGCGCCTGG GGGGCGGAAT GGCACCGGGC CGGGCGCGCG ACTCCCTCGC ATTCGACCCT CGCCATCGAG GGATTTTCCT CCTCGCGGCT CGGCCGGTCG GGCGAGGAGA TGACCGAGCG CGCCCGCGTC CTTTCCGCAC ACCGCCAGCG CAGCGCGGCG GGCGCGCAGC TCAGCCTCGT CCATGCCGGC TGGGCCGAGA CGCACGGGCT GACCCACCGG CGCGAGATTC TGCTGGCGCC GGACGGACGC AGCCTCTCGG GCGCGGATAC GCTGGCCGCA CTGACCGCGG CCGAGAAGAA ACGGCTCGAG ACCGTGCTGA AGAGCGCCGG AGGGCAGGGC GTGCGCTTCG CGCTGCGCTT TCACCTCCAT CCCGACGTGC GGCCCACGGT CGAGGGCGGT GGGCTCTGCG TGGGTCTCAG GCTCGCCAGC GGCGAGGGCT GGAGCTTCCA GTTCGAGGGC GAGGCGCGCC TGACGCTCGA TCCCTCGGTC TATCTCGACC GCACCCATCT GTTGCCGCGG GCGACGAAAC AGATCGTGCT GCATGGGGTT CTTGTGAGTT CCGAAGCCCG GATCGGCTGG ACCTTCGCGA AGACAGAGGA TACTCCGCTG GCCATTCGTG ACCTGGACCG GCTCGATCTG CCCGACCCGC CCGATCGACC GTGA
|
Protein sequence | MSDGRVALLD RLAARRALLH RPARRFATIP EPASLGLPER GRWLLAGSFL VEGRLVQAPG ASPWEVPGAG PGQTAEAHGF AWLDDLAGLG DRPARALARD WTFRWIARFG RGRGPGWTPA LAARRLGRWM SHAALLTEAQ GRDQATLARA ASQTLRFLDR RWTTARGPAR IETLAAILRA GLALEGMERH VGAAATALGR EAEAQVDAHG AIASRSPEDL ALLFALLAEA EAALTAAGRP VAEPHRAALR RIAPVLRTLR HSDGGLARFH GGGRTVAERV ERALAAATAP PLRTEGAAMG YLRLSVGRTS VIVDAADPPA GPHGHASTLG FEMSSGRRPV IISCGSGRAW GAEWHRAGRA TPSHSTLAIE GFSSSRLGRS GEEMTERARV LSAHRQRSAA GAQLSLVHAG WAETHGLTHR REILLAPDGR SLSGADTLAA LTAAEKKRLE TVLKSAGGQG VRFALRFHLH PDVRPTVEGG GLCVGLRLAS GEGWSFQFEG EARLTLDPSV YLDRTHLLPR ATKQIVLHGV LVSSEARIGW TFAKTEDTPL AIRDLDRLDL PDPPDRP
|
| |