Gene Daro_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3701 
Symbol 
ID3567913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3978823 
End bp3979836 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content67% 
IMG OID637682174 
Producttransketolase, central region:transketolase, C-terminal 
Protein accessionYP_286900 
Protein GI71909313 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.86607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACAC TCACCCTGAA CGATGCCATC GGCCTGGCCC TGGCCGAGGA AATGCGCCGT 
GACCACAAGG TCATTGCCTT CGGCGAAGGT ATTGCCACCA AGCGCCATGA ACTCGTTACC
GAATTCGGCG CCCTGCGTGT CCGCAACACG CCGCTGGCCG AGGGGATCAT CGCCGGCACG
GCAGCCGGTG CCGCCGCCGG AGGCCTGCGC CCGGTCGCCG ATCTGCTCTT CGCTCCCTTC
CTCTGCTATG CCATGGACGA GCTGGTCAAC AGCGCCGGCA AACTGCGCTA CATGTCGGGC
GGCCAATTCA GCTTCCCGCT GGTCGCGCTG GCCATGACGG GGGCCGGCTG GGGCGTTGGC
GCCCAGCACA ACCACAACGT CGAAGCCTGG TTCGTGCATA GCCCCGGCCT CAAGGTCGTC
ATGCCAAGCA ACCCGGCCGA CGCCCGCGCG CTGCTCAAGA CGGCCATCCG CGACGACAAC
CCGGTCGTTT TCCTGCTTGA CATCGGCCTG CTCTATCAAC CCGGCGAAGT GCCAAGCGAA
GCCGTGCCGA TACCGCTCGG TCAGGCGACC ACGGTTCGCG CCGGCACGGA TGTCAGCCTC
ATTTCCTACG GCAAGACCGT GCATCACTGC GCGCAGGCGG CAGGAAGCCT GGCGGCCGAA
GGAATCGCCG CCGAAGTCAT CGACCTGCGC AGCCTGAAGC CGCTCGACGA GGCTGCCATC
CTCGCCACCG CCCGGAAGAC CGGGCGCGTC GTCGTCGTCC ATGAAGCCAA CCGCCTGTGC
GGTGTCGGCG CCGAAATCGC CGCGCTAATC GCCGAACAGG CCTTTGCCAG CCTCAAGGCG
CCCGTTGTCC GCCTCGGCGG CCCGGACGCC CCGGTGCCAT CCAGCTTCCC GCTCGAACAG
GCCACCGTGC CACAAGCCGA TGCCATTGCT GCCGCGGCAA GGCAACTTTG CGCATCGCGT
CGAGCCTTAA CCCACCCCCC AACGGAGAAC TCAAAATGCG CCTTACCACA CTGA
 
Protein sequence
MPTLTLNDAI GLALAEEMRR DHKVIAFGEG IATKRHELVT EFGALRVRNT PLAEGIIAGT 
AAGAAAGGLR PVADLLFAPF LCYAMDELVN SAGKLRYMSG GQFSFPLVAL AMTGAGWGVG
AQHNHNVEAW FVHSPGLKVV MPSNPADARA LLKTAIRDDN PVVFLLDIGL LYQPGEVPSE
AVPIPLGQAT TVRAGTDVSL ISYGKTVHHC AQAAGSLAAE GIAAEVIDLR SLKPLDEAAI
LATARKTGRV VVVHEANRLC GVGAEIAALI AEQAFASLKA PVVRLGGPDA PVPSSFPLEQ
ATVPQADAIA AAARQLCASR RALTHPPTEN SKCALPH