Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3032 |
Symbol | |
ID | 5710884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3199429 |
End bp | 3201147 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641268959 |
Product | heparinase II/III family protein |
Protein accession | YP_001534366 |
Protein GI | 159045572 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.310136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGCA GTCTGCAACA GGACCTTTCG CGGTCGACGG GCTCGCGTTC CCTCGGCCAC AGGGTCATGG ACGGGTTCCA TGCCCGGCTG GCGCGGTTCG GGCGTCCGGC CACGGCCTTC GTGTCGACGC CGGAGCCGAC GAGCTTCGGC GATCCGGCGC GGGGGCGGCA GTTGCTGGCG GGCAATCACC TGATCAATGG CGAGGTGATC ACCGCGCCGG GCGTGTCGCT GTGGGAGATT GCCGAGAACG CGCCGGGGCG CGATGCGCTG CTGCTGCATG ATTTCAAATG GCTGGATGAT CTCGCGGCCC TGGGCGGCCC GGCGGCACAA GGGCTGGCGC AGACCGCGCT GGGCGAGTGG ATCGCGCGTT ACGGGACCGG GCGTGGGCCG GGCTGGACCC CGGACCTGAC CGCGCGGCGG ATGATGCGCT GGATTTCCCA TGCGGTGATG TTGCTGGGCG GGGAAGGATC GCGCCTGTCG CCGGGGTTCT TCCGGTCGCT CGGGCAGCAG GGCGTATTTC TCAGCCGCCG CTGGAAGACC GCCGCGTCGG GGCGCGCGCG GTTCGAGGCG CTGGCCGGGA TGATTTACGC GGGGATGAGC CTGCGCGGGC TGGAGGGGCG GGTGGATGCC GCCGCCGCCG CCCTCGACCG GGCCTGCGCG GCGGAAATCG ACGGGCAGGG GGGGCTCGCC TCTCGCAACC CCGAAGACAT GCTGGAGGTG CTGCTCCTGC TGACCTGGGC CGCGGACGCG TTGGAGGAGC GGGGCCGCAG TCCGAGCGTG GAGCAGGTGC GCGCCATCGC CCGGATCGCG CCGTCCTTGC GTGCGCTGCG TCACGGCGAT GGCGGGCTGG TACGTTGCCA CGGCGGCGGG CGCGGCGCGG CAGAGCAGGT GGCGCAGGCG CTGGCCCGAA CCGGGATTGT CGCGACCTCG GATGCGACGC TGGTGATGGG CTATGCCCGG CTGGCGGCGG GCAGCACCAC GCTGATCCTC GATGCCGCGG TGCCGCCGCG GGGGGCAGGG GCGCTCTCGG CCCATGCCAG CACGCTCGCC TTCGAGATGA CCTCGGGCCG GTTCCCGATC GTGATCAATT GCGGCTCGGG GGCGAGTTTC GGGGCGGAGT GGCGCCGGGC CGGGCGTGCG ACTGCGTCCC ATTCCACCCT GGGGATCGAC GGCACCTCTT CCTCCCGGCT GGAGCCGCGG GCGCGGCGGC GGGGCCAGGA GGAGGCCCTT CTCGACGTGC CCACGCGGGT GACGGTGGAG GCGTTCGATA CGTCGCCGCA TCTGCGCGGG CTGCTGCTGA CCCATAACGG TTATGACCCG ACCCACGGTT TGACCCATGC AAGGCAGGTG CAGTTGAGGG CCGATGGGCG GATGCTGGAG GGCACCGATA CGCTGGCTGC GCTGGAGCCC GGCGCACGCC TGCGCTGCGA CCGGGCGATG ACGGCGGCGG GCGGCGCGGG GCTGGCCTTC ACGGTGCGGT TCCACCTGGC CCCCGAGGTG GAGGCGCAGG TCGACATGGG GGGCACCGCG GTGTCGTTGC TGCCGGGCGG TGACGAGGTG TGGATCTTCC GGGCGCGGGG GGCGCGGATG GCGCTTGCCC CCAGCGTCGC GCTCGAACGC GGGCGGCTGA AGCCCCGCGC TACCCGGCAG ATCGTGCTGA CCGGGCGGCT TCTGGATTAC GCCACGGAGC TGGACTGGTC GTTCACCCGG GTGAGCTGA
|
Protein sequence | MAGSLQQDLS RSTGSRSLGH RVMDGFHARL ARFGRPATAF VSTPEPTSFG DPARGRQLLA GNHLINGEVI TAPGVSLWEI AENAPGRDAL LLHDFKWLDD LAALGGPAAQ GLAQTALGEW IARYGTGRGP GWTPDLTARR MMRWISHAVM LLGGEGSRLS PGFFRSLGQQ GVFLSRRWKT AASGRARFEA LAGMIYAGMS LRGLEGRVDA AAAALDRACA AEIDGQGGLA SRNPEDMLEV LLLLTWAADA LEERGRSPSV EQVRAIARIA PSLRALRHGD GGLVRCHGGG RGAAEQVAQA LARTGIVATS DATLVMGYAR LAAGSTTLIL DAAVPPRGAG ALSAHASTLA FEMTSGRFPI VINCGSGASF GAEWRRAGRA TASHSTLGID GTSSSRLEPR ARRRGQEEAL LDVPTRVTVE AFDTSPHLRG LLLTHNGYDP THGLTHARQV QLRADGRMLE GTDTLAALEP GARLRCDRAM TAAGGAGLAF TVRFHLAPEV EAQVDMGGTA VSLLPGGDEV WIFRARGARM ALAPSVALER GRLKPRATRQ IVLTGRLLDY ATELDWSFTR VS
|
| |