Gene Dshi_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3032 
Symbol 
ID5710884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3199429 
End bp3201147 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content73% 
IMG OID641268959 
Productheparinase II/III family protein 
Protein accessionYP_001534366 
Protein GI159045572 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.310136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGCA GTCTGCAACA GGACCTTTCG CGGTCGACGG GCTCGCGTTC CCTCGGCCAC 
AGGGTCATGG ACGGGTTCCA TGCCCGGCTG GCGCGGTTCG GGCGTCCGGC CACGGCCTTC
GTGTCGACGC CGGAGCCGAC GAGCTTCGGC GATCCGGCGC GGGGGCGGCA GTTGCTGGCG
GGCAATCACC TGATCAATGG CGAGGTGATC ACCGCGCCGG GCGTGTCGCT GTGGGAGATT
GCCGAGAACG CGCCGGGGCG CGATGCGCTG CTGCTGCATG ATTTCAAATG GCTGGATGAT
CTCGCGGCCC TGGGCGGCCC GGCGGCACAA GGGCTGGCGC AGACCGCGCT GGGCGAGTGG
ATCGCGCGTT ACGGGACCGG GCGTGGGCCG GGCTGGACCC CGGACCTGAC CGCGCGGCGG
ATGATGCGCT GGATTTCCCA TGCGGTGATG TTGCTGGGCG GGGAAGGATC GCGCCTGTCG
CCGGGGTTCT TCCGGTCGCT CGGGCAGCAG GGCGTATTTC TCAGCCGCCG CTGGAAGACC
GCCGCGTCGG GGCGCGCGCG GTTCGAGGCG CTGGCCGGGA TGATTTACGC GGGGATGAGC
CTGCGCGGGC TGGAGGGGCG GGTGGATGCC GCCGCCGCCG CCCTCGACCG GGCCTGCGCG
GCGGAAATCG ACGGGCAGGG GGGGCTCGCC TCTCGCAACC CCGAAGACAT GCTGGAGGTG
CTGCTCCTGC TGACCTGGGC CGCGGACGCG TTGGAGGAGC GGGGCCGCAG TCCGAGCGTG
GAGCAGGTGC GCGCCATCGC CCGGATCGCG CCGTCCTTGC GTGCGCTGCG TCACGGCGAT
GGCGGGCTGG TACGTTGCCA CGGCGGCGGG CGCGGCGCGG CAGAGCAGGT GGCGCAGGCG
CTGGCCCGAA CCGGGATTGT CGCGACCTCG GATGCGACGC TGGTGATGGG CTATGCCCGG
CTGGCGGCGG GCAGCACCAC GCTGATCCTC GATGCCGCGG TGCCGCCGCG GGGGGCAGGG
GCGCTCTCGG CCCATGCCAG CACGCTCGCC TTCGAGATGA CCTCGGGCCG GTTCCCGATC
GTGATCAATT GCGGCTCGGG GGCGAGTTTC GGGGCGGAGT GGCGCCGGGC CGGGCGTGCG
ACTGCGTCCC ATTCCACCCT GGGGATCGAC GGCACCTCTT CCTCCCGGCT GGAGCCGCGG
GCGCGGCGGC GGGGCCAGGA GGAGGCCCTT CTCGACGTGC CCACGCGGGT GACGGTGGAG
GCGTTCGATA CGTCGCCGCA TCTGCGCGGG CTGCTGCTGA CCCATAACGG TTATGACCCG
ACCCACGGTT TGACCCATGC AAGGCAGGTG CAGTTGAGGG CCGATGGGCG GATGCTGGAG
GGCACCGATA CGCTGGCTGC GCTGGAGCCC GGCGCACGCC TGCGCTGCGA CCGGGCGATG
ACGGCGGCGG GCGGCGCGGG GCTGGCCTTC ACGGTGCGGT TCCACCTGGC CCCCGAGGTG
GAGGCGCAGG TCGACATGGG GGGCACCGCG GTGTCGTTGC TGCCGGGCGG TGACGAGGTG
TGGATCTTCC GGGCGCGGGG GGCGCGGATG GCGCTTGCCC CCAGCGTCGC GCTCGAACGC
GGGCGGCTGA AGCCCCGCGC TACCCGGCAG ATCGTGCTGA CCGGGCGGCT TCTGGATTAC
GCCACGGAGC TGGACTGGTC GTTCACCCGG GTGAGCTGA
 
Protein sequence
MAGSLQQDLS RSTGSRSLGH RVMDGFHARL ARFGRPATAF VSTPEPTSFG DPARGRQLLA 
GNHLINGEVI TAPGVSLWEI AENAPGRDAL LLHDFKWLDD LAALGGPAAQ GLAQTALGEW
IARYGTGRGP GWTPDLTARR MMRWISHAVM LLGGEGSRLS PGFFRSLGQQ GVFLSRRWKT
AASGRARFEA LAGMIYAGMS LRGLEGRVDA AAAALDRACA AEIDGQGGLA SRNPEDMLEV
LLLLTWAADA LEERGRSPSV EQVRAIARIA PSLRALRHGD GGLVRCHGGG RGAAEQVAQA
LARTGIVATS DATLVMGYAR LAAGSTTLIL DAAVPPRGAG ALSAHASTLA FEMTSGRFPI
VINCGSGASF GAEWRRAGRA TASHSTLGID GTSSSRLEPR ARRRGQEEAL LDVPTRVTVE
AFDTSPHLRG LLLTHNGYDP THGLTHARQV QLRADGRMLE GTDTLAALEP GARLRCDRAM
TAAGGAGLAF TVRFHLAPEV EAQVDMGGTA VSLLPGGDEV WIFRARGARM ALAPSVALER
GRLKPRATRQ IVLTGRLLDY ATELDWSFTR VS