Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0053 |
Symbol | |
ID | 5711675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 49943 |
End bp | 51163 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641265947 |
Product | HI0933 family protein |
Protein accession | YP_001531403 |
Protein GI | 159042609 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000000000927745 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGGTA TCGCCCCCTT GCCCGAGATC GAGACGGATG CGCTGGTCGT GGGGGCCGGT CCCGCGGGGC TGATGGCCGC CGAGCAGCTG GCGCAGGCGG GGTTTTCCGT GCGTATCGCC GAGCAGATGC CCAGTGCCGG GCGCAAGTTC CTGATGGCGG GCAAGAGCGG GCTGAACCTG ACCAAGGATG AAGAGATGCC CGCGTTCCTG GGGGCTTATG GCGGGGCGGC GGCGTGGCTG GCGCCGATGC TGGAGGCGTT CGGGCCCGAC GCGGTGCAGG ACTGGGCCCG GGGGCTGGGC CAGCCGGTCT TCACCGGGTC CACGGGGCGG GTGTTTCCCG AGGCGATGAA GGCGTCGCCG CTGCTCCGCG CGTGGCTGGC GCGGCTGGCG GGCCTGGGCG TGCGGCTCGA CACGCGCTGG CGCTGTCTCG GGTGGCAGGA CGGGGCGCTG CGGTTCGAGA CGCCCGCGGG GCCGGTGCGG GTGCGGGCGC GGGCGGTGGT GCTGGGTCTG GGCGGGGCGA GCTGGCGGCG GCTTGGCTCG GACGGGGCCT GGGCCGGGTG GATCGGGGCC GCGTGCGCGC CGTTCGCCCC GGCGAATGTG GGGCTGCGGG TGGATTGGAG CCCGCATATG GCGCGCCATT TCGGGGCGCC GGTGAAGGGG GCCGCGTTGT CGTCGGGCGG GGTGGTGTCG CGCGGCGAGG TCGTGGTCTC GGCCCGGGGG CTGGAGGGCG GCGGGCTCTA TCCGCTGTGC CCGGCCCTGC GGGAGGGCGC GGGGCTGCGG GTGGATCTGT GCCCGGACCT GGAGGTCGGG GCGCTGGCCG CGCGGCTGGC CCGGGTGCCG GCCAAGGCCA GCGGGGCCAG CCGGTTGCGC AAGGGCGCGG GCCTGTCGCC GGTCAAGCAG GCGCTGGTGC AGGAATGCGC GCGCCCCCTG TCGCGCGATC CGGCGGATTT GGCGCGGGTT CTCAAGGATT TGGGGGTGCC GCACCAGGGG GTGCGCCCGC TGGACGAGGC GATTTCGGTG GCCGGGGGTG TCGCGCGCGC GGCGCTGGAC GACCGGTTGA TGCTGCGCGA CCGGCCCGGC GTGTTCGCCT GCGGCGAGAT GCTGGACTGG GAGGCGCCGA CGGGGGGCTA CCTGCTCACG GGCTGTTTCG CGACGGGGCG TTGGGCCGGG CTGGGGGCGG TGGACTGGCT GCGGGGTGCT CAGGCGGCGG CGCGGGCGTA G
|
Protein sequence | MTGIAPLPEI ETDALVVGAG PAGLMAAEQL AQAGFSVRIA EQMPSAGRKF LMAGKSGLNL TKDEEMPAFL GAYGGAAAWL APMLEAFGPD AVQDWARGLG QPVFTGSTGR VFPEAMKASP LLRAWLARLA GLGVRLDTRW RCLGWQDGAL RFETPAGPVR VRARAVVLGL GGASWRRLGS DGAWAGWIGA ACAPFAPANV GLRVDWSPHM ARHFGAPVKG AALSSGGVVS RGEVVVSARG LEGGGLYPLC PALREGAGLR VDLCPDLEVG ALAARLARVP AKASGASRLR KGAGLSPVKQ ALVQECARPL SRDPADLARV LKDLGVPHQG VRPLDEAISV AGGVARAALD DRLMLRDRPG VFACGEMLDW EAPTGGYLLT GCFATGRWAG LGAVDWLRGA QAAARA
|
| |