Gene Dshi_0053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0053 
Symbol 
ID5711675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp49943 
End bp51163 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content76% 
IMG OID641265947 
ProductHI0933 family protein 
Protein accessionYP_001531403 
Protein GI159042609 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000927745 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGGTA TCGCCCCCTT GCCCGAGATC GAGACGGATG CGCTGGTCGT GGGGGCCGGT 
CCCGCGGGGC TGATGGCCGC CGAGCAGCTG GCGCAGGCGG GGTTTTCCGT GCGTATCGCC
GAGCAGATGC CCAGTGCCGG GCGCAAGTTC CTGATGGCGG GCAAGAGCGG GCTGAACCTG
ACCAAGGATG AAGAGATGCC CGCGTTCCTG GGGGCTTATG GCGGGGCGGC GGCGTGGCTG
GCGCCGATGC TGGAGGCGTT CGGGCCCGAC GCGGTGCAGG ACTGGGCCCG GGGGCTGGGC
CAGCCGGTCT TCACCGGGTC CACGGGGCGG GTGTTTCCCG AGGCGATGAA GGCGTCGCCG
CTGCTCCGCG CGTGGCTGGC GCGGCTGGCG GGCCTGGGCG TGCGGCTCGA CACGCGCTGG
CGCTGTCTCG GGTGGCAGGA CGGGGCGCTG CGGTTCGAGA CGCCCGCGGG GCCGGTGCGG
GTGCGGGCGC GGGCGGTGGT GCTGGGTCTG GGCGGGGCGA GCTGGCGGCG GCTTGGCTCG
GACGGGGCCT GGGCCGGGTG GATCGGGGCC GCGTGCGCGC CGTTCGCCCC GGCGAATGTG
GGGCTGCGGG TGGATTGGAG CCCGCATATG GCGCGCCATT TCGGGGCGCC GGTGAAGGGG
GCCGCGTTGT CGTCGGGCGG GGTGGTGTCG CGCGGCGAGG TCGTGGTCTC GGCCCGGGGG
CTGGAGGGCG GCGGGCTCTA TCCGCTGTGC CCGGCCCTGC GGGAGGGCGC GGGGCTGCGG
GTGGATCTGT GCCCGGACCT GGAGGTCGGG GCGCTGGCCG CGCGGCTGGC CCGGGTGCCG
GCCAAGGCCA GCGGGGCCAG CCGGTTGCGC AAGGGCGCGG GCCTGTCGCC GGTCAAGCAG
GCGCTGGTGC AGGAATGCGC GCGCCCCCTG TCGCGCGATC CGGCGGATTT GGCGCGGGTT
CTCAAGGATT TGGGGGTGCC GCACCAGGGG GTGCGCCCGC TGGACGAGGC GATTTCGGTG
GCCGGGGGTG TCGCGCGCGC GGCGCTGGAC GACCGGTTGA TGCTGCGCGA CCGGCCCGGC
GTGTTCGCCT GCGGCGAGAT GCTGGACTGG GAGGCGCCGA CGGGGGGCTA CCTGCTCACG
GGCTGTTTCG CGACGGGGCG TTGGGCCGGG CTGGGGGCGG TGGACTGGCT GCGGGGTGCT
CAGGCGGCGG CGCGGGCGTA G
 
Protein sequence
MTGIAPLPEI ETDALVVGAG PAGLMAAEQL AQAGFSVRIA EQMPSAGRKF LMAGKSGLNL 
TKDEEMPAFL GAYGGAAAWL APMLEAFGPD AVQDWARGLG QPVFTGSTGR VFPEAMKASP
LLRAWLARLA GLGVRLDTRW RCLGWQDGAL RFETPAGPVR VRARAVVLGL GGASWRRLGS
DGAWAGWIGA ACAPFAPANV GLRVDWSPHM ARHFGAPVKG AALSSGGVVS RGEVVVSARG
LEGGGLYPLC PALREGAGLR VDLCPDLEVG ALAARLARVP AKASGASRLR KGAGLSPVKQ
ALVQECARPL SRDPADLARV LKDLGVPHQG VRPLDEAISV AGGVARAALD DRLMLRDRPG
VFACGEMLDW EAPTGGYLLT GCFATGRWAG LGAVDWLRGA QAAARA