Gene Dfer_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_4034 
Symbol 
ID8227632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4885082 
End bp4886974 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content50% 
IMG OID644931877 
ProductHeparinase II/III family protein 
Protein accessionYP_003088402 
Protein GI255037781 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0114283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGC TCCGGGACTT GTCGCTGCTT TGGCATATTG TTCGCCAAAT GGGCTGGGGA 
TATGTACTTT TTCGCGCTGG TTATTGGCTA GAAGGCAAAA CCGGCTTGCT CCGACTTCGT
TTTCCCAAAC GATCAACCAG GCGATCGTTC ATTAGCCTCC GGGAATGGCG AGATCGCAAC
CCTGCATTTC TCTTCGATCC TCATGAGGTG AATGTGAGCA GGAATGCGGG GCTTCACGAG
TTGAAGTGGC GGGTTAGCCA AATCAGGCGC CGGAGGTTTC AGTATTTCAG CAGTCAATGG
TTTGATGTTC ACGATTGGCA TACTAATCCG CTGACAGGCT ATCGGTACAA TGTGAATGAG
CATTGGTCGC TAGTGGATGA CTTTTCGGTA ACGGCGGGAG ATATCAAATA TGTCTGGGAA
AAATCGAGGT TTACATTTCT GTATGACCTC ATTCGATATG ACTATCATTT CAATGAGGAT
CAATCTGCGG AGGCTTTCAC ACTGATTTCC GATTGGATCG CTCAAAATCC CGTCAATCGC
GGCCCGAACT GGAAATGCAG CCAGGAAATC GCCCTGCGTG TGTTAAACTG GACTTTTGCA
TTGCATTACT TTCGAGGCTC GCACGCATTG AACCAAAGTC TACTCGATCA AATACTGTGC
AGCATTTACG ACCAAATCCG GCATGTTGCC CAAAACATCC GTTTTTCCAG GACTGCCGTC
CGCAACAACC ATATCCTTAC CGAAGCAATG GGCCTTTTTA CAACCGGGTT AACATTCCCG
TTTTTCCCCG AAAGTCCGGA ATGGGAAAAG GCAGGCAAGG AGTTATTTGA AAGAGAAATC
ATCACGCAAA TCGCGCCGGA TGGAACGTAT CTGCAATTTT CCATGAACTA TCACCGGGTG
GTAGTGCAAC TGCTCACCTG GGGAATCCGG CTGGGCGAGG TGCATGGCGA TTGTCTCTGC
GAAACCGTTT ATGCACAGGC GAAGGCTTCG CTGGTGTTTC TGAGAACCTG CCAGGATGAT
AGCAATGGTC AACTTCCTCA CTACGGGCAT TGCGACGGCG CCCTGTTTTT TCCGCTATCG
GAGTGTGAGT TCAGGGATTT TCGCCCTCAA ATGGCTGCTT TGGCAAATGT GCAGGGTGTT
GATTTGGGTT ATCAGACGGG AAAATGGTTG GAGGAAAGCC ATTGGTTGTG TGGACATCGC
CGACCGATAG CGGCACAAAT CAATCCCCGC TGCTTGTCAA CTTTCCGGGA TGGGGGCTAC
CATGTTATCC GAAATGCCGG CACAATTACA TTCCTGCGCT GCGGAAGTTA CAAGAGCCGG
CCATTTCAGG CAGATAACAA TCATCTCGAT ATCTGGATAA ATGGTCGAAA TATCCTGCGC
GACGCCGGAA CGTGGCTTTA CAATGCCGAT GAGGAATCGA CGCGGTATTT TTCCGGCACC
CGCTCGCATA ACACGGTTAC GATCGGTGAT TTCGACCAAA TGCAGAAAGA CCGGCGTTTC
ATTTGGACGC ACTGGATCAC CAGCGCCAAT GGCCAATGCG GCCGCGGCAA CGATGAATGC
TGGATTGAAG CCGAGTTCGA GGGGTTTCGC CATGTAGGCA AGGATATCGT CCACAAGCGG
CGTGTCATAA AGCAGGTTGG CCGGCTCCAC TGGGTAATCG AGGACTGGGT AGCGAACGTG
CCGACCGGCT TGCAAGTCCG GCAGCGATGG CATCCCGATG AACATTTCAG CCGCGATTTC
TGCATCCGGA CGTTCGACGA GCACGGACAT GAAATTCTTC CTGTCGTCGA CTCGGGCTGG
TATTCAGCTT TTTATGGTCA TAAAATCCAA AGTAATATGC TGGTTTTCGA AACTTCCGGG
AATTATTTCA AAACCATCAT CGAAAAAATA TAA
 
Protein sequence
MKMLRDLSLL WHIVRQMGWG YVLFRAGYWL EGKTGLLRLR FPKRSTRRSF ISLREWRDRN 
PAFLFDPHEV NVSRNAGLHE LKWRVSQIRR RRFQYFSSQW FDVHDWHTNP LTGYRYNVNE
HWSLVDDFSV TAGDIKYVWE KSRFTFLYDL IRYDYHFNED QSAEAFTLIS DWIAQNPVNR
GPNWKCSQEI ALRVLNWTFA LHYFRGSHAL NQSLLDQILC SIYDQIRHVA QNIRFSRTAV
RNNHILTEAM GLFTTGLTFP FFPESPEWEK AGKELFEREI ITQIAPDGTY LQFSMNYHRV
VVQLLTWGIR LGEVHGDCLC ETVYAQAKAS LVFLRTCQDD SNGQLPHYGH CDGALFFPLS
ECEFRDFRPQ MAALANVQGV DLGYQTGKWL EESHWLCGHR RPIAAQINPR CLSTFRDGGY
HVIRNAGTIT FLRCGSYKSR PFQADNNHLD IWINGRNILR DAGTWLYNAD EESTRYFSGT
RSHNTVTIGD FDQMQKDRRF IWTHWITSAN GQCGRGNDEC WIEAEFEGFR HVGKDIVHKR
RVIKQVGRLH WVIEDWVANV PTGLQVRQRW HPDEHFSRDF CIRTFDEHGH EILPVVDSGW
YSAFYGHKIQ SNMLVFETSG NYFKTIIEKI