Gene Dshi_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3840 
Symbol 
ID5714369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp48774 
End bp50414 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content73% 
IMG OID641276755 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001542051 
Protein GI159046380 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGGC GGCGCAGCGG GGCTCAGGCC CTTGTGGAGA GCCTGTTTTC GGCGGGCGTG 
GACACGGTCT ACGGCGTGCC GGGCGAGGAG ACCACGGCGC TGATGGCCGC CCTTCACGAC
AGCGAGATGG CGTTCGTGCT GTGCCGGCAC GAACAGGCGG CGGCCTTCAT GGCCGGGGTG
CATGGGCGGC TGACCGGGCG ACCGGCGGCG TGTCTCGCCA CGCTGGGGCC GGGGGCGACG
AACCTGGTGA CCGGGGTGGC CGATGCCACG CTGGATTTCG TGCCGATGAT TGCCATCACG
GGGCAGGGGG GGTGCGGCCG CCTTGGCCGG GAGAGCCACC AGATCATCGA CCTGGAGGCG
CTGTTTGCGC CGGTGACGAA ACAGAGCCGG ACCTTGCTGG AGGCCGACGC GGTGCCCGGG
GCCGTGGCCG AGGCCGTGCG CCTGGCCCGG GCGGAAAAGC CGGGCGCGGT GCATCTGTGC
CTGCCCGAGG ACGTGGCGGA CGCGCAGACC GCGCTGCGCG ATGTGCCGGT GCCCCAGGTC
CTGCCCAGCC CCCCCGCGCC GGAGGCGATT GCCCAGGCGC TGACCCTGTT GACCCGGGCG
GAACGGCCCA TCCTGCTGGC CGGGGCCGGG GTGATCCGGG CCGGGGCGAC GGCGGAGCTG
CGCGCCTTTG CCGAGGCGAC CGGGATCGCG GTGGTCACGA GCTTCATGGC CGGGGGCGTG
CTGCCGCCGG AGCACGAGTT GACCCTGTTC ACCGTGGGCC AGCCCGAAGG CGATTATGTG
GATCTGTCCT TCGAGGCGGC GGATCTGATC GTGTCGGTGG GGTTCGACCC GGTGGAATAC
CCTGCCGCGG ATCTGAGCCG GGACGGGGCG ATACCGGTGC TGCATCTCGG GGCCGGCCCG
GCCCCGGCGG ATGCGGGCTG GCACGTCGCC GGGCAGGTGG TCGCGGGCTT GCCCGAGACC
CTCGCGGCGC TCGCCGAGGC ACTTGAGGCG CGGCGCTGGG ACATGCCGCC GGCCTTTGCC
GGGGTGCAGG CGGGGATGCG CAAAGCCTTT GCAAGAGCCT ATTCCACGTC GTCGGAGGGG
CCGGTCGCGC CCCAGGATAT CTGCGCCGAG ATCACCCGGC AGTTGCGCGC CGAGGACACG
GTCCTGTCCG GCGTGGGGCT GCACAAGCTG TGGATCGCGC GGCATGTGCT GCCGCGGCGG
CCCGGGCAGG TGATCATTCC CAACGGGCTG GCGGGGATGG GCCTTGCCCT GCCCGGGGCG
GTTGCGGCGG CGCGGTTGCA GCAGGCGGGC CGGGTGCTCG CGATCTGCGG CGACGGGGAC
GTGATGATGA ACGTCCAGGA CATGGAGACG GCGGCGCGGC TGGGTCTCGA CCTGACGGTG
ATGGTCTGGG AGGACGGCGG CTATGGCCTG ATCGACGCGC ATCAGCAGAA GGCCGGGGAC
GACTCGACCT TCGGCTTCGG CACGCCCGAC TGGGGGCGGC TCGCGCGCGC CTTCGGCTGG
AGCCATGCGC CGGTCGCGGG CCTGTCGGAG TTGGGCGAGA TCCTGCGCGC GGGCCATGAC
AGCGCCGGGC CGACCCTGGT CTCCGTGCCG GTGGATTATG CCGCCGGAGG GGGCTTGCCG
GGGGTGCGCC CCGCGGCGTG A
 
Protein sequence
MAGRRSGAQA LVESLFSAGV DTVYGVPGEE TTALMAALHD SEMAFVLCRH EQAAAFMAGV 
HGRLTGRPAA CLATLGPGAT NLVTGVADAT LDFVPMIAIT GQGGCGRLGR ESHQIIDLEA
LFAPVTKQSR TLLEADAVPG AVAEAVRLAR AEKPGAVHLC LPEDVADAQT ALRDVPVPQV
LPSPPAPEAI AQALTLLTRA ERPILLAGAG VIRAGATAEL RAFAEATGIA VVTSFMAGGV
LPPEHELTLF TVGQPEGDYV DLSFEAADLI VSVGFDPVEY PAADLSRDGA IPVLHLGAGP
APADAGWHVA GQVVAGLPET LAALAEALEA RRWDMPPAFA GVQAGMRKAF ARAYSTSSEG
PVAPQDICAE ITRQLRAEDT VLSGVGLHKL WIARHVLPRR PGQVIIPNGL AGMGLALPGA
VAAARLQQAG RVLAICGDGD VMMNVQDMET AARLGLDLTV MVWEDGGYGL IDAHQQKAGD
DSTFGFGTPD WGRLARAFGW SHAPVAGLSE LGEILRAGHD SAGPTLVSVP VDYAAGGGLP
GVRPAA