Gene Daro_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3626 
Symbol 
ID3567992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3899298 
End bp3901370 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content66% 
IMG OID637682099 
Producttransketolase 
Protein accessionYP_286825 
Protein GI71909238 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.560894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.611761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCA ACCAGACCGA AATCACCTTC CGCCGCGAAC TGGCCAACGC CGTCCGCTTT 
CTGGCCATCG ATGCCGTCAA TCAGGCCAAG TCCGGACACC CTGGCGCGCC GATGGGCATG
GCCGACATCG CCGAAGTGCT GTGGCGTGAC CATCTCAAGC ACAATCCAGC CAACCCGCAG
TGGGCGGACC GCGACCGCTT CGTACTGTCG AACGGTCACG GCTCGATGCT GCTCTACGCG
CTGCTTCACC TGACCGGCTA CGACCTCAAT ATCGATGACC TCAAGAACTT CCGCCAGTTC
GGCAGCCGCA CCGCCGGCCA TCCGGAAGTC GGCCACACGC CCGGCGTCGA GACGACCACC
GGGCCGCTCG GCCAGGGCCT GACCAACGCC GTCGGCATGG CGCTGGCCGA AAAGCTGCTG
GCCCAGCGCT ACAACCGGCC AGGTTGCGAG ATCGTCGATC ACCGGACTTG GGTTTTCGTC
GGTGACGGCT GTCTGATGGA AGGCATCAGC CACGAAGCCT GTTCGCTGGC CGGCGTCTGG
GGCCTCGACA AGCTGACCTG CTTCTACGAC GACAACGGGA TTTCCATCGA CGGCCACGTC
AAAGGCTGGT TCCGCGACGA CACCCCGGCC CGCTTCCGCG CCTACGGCTG GCATGTTGTC
GGGCCGATTG ACGGCCATGA CTCGGTGGCG CTGTCCGCGG CCATTGCCGA AGCCAAGGCC
GTCACCAGCC AACCGACCCT GATCGTCTGC CGCACGCAGA TCGGTTGGGG TTCGCCGAAC
AAGGCCGGTT CGCACGACGT TCATGGCGCG CCGCTCGGCG CCGATGAAAC GGCCGCCACC
CGTGCCGCGC TTGGCTGGTT GCATCCGCCG TTTGAAGTGC CGGACAGCCT GCGCGCCGCG
TGGAATGCCC AGTCTACTGG CGCCACTGCC GAAGCCTCAT GGCTGGCCAG GTTCGCCGCT
TACCGCACCG AGTATCCGGA ACTCGCCGCC GAATTCGAAC GCACGCAGGC CGGCGGTCTG
CCGGAAAAAT GGCCGGAAAT TAAAAGCGAA CTGCTGGCCA CGGCCGGCCG CAAGGAAGGC
GCCGTCGCCA CCCGCAAGTC GTCGCAGAAC TGCCTCGATT GGCTGGTCGA CCGCGTGCCC
GAACTACTTG GCGGCTCGGC CGACCTGACC GGCTCCAACC TGACCGCCGG CAAAGGCAGC
GTGGCCCTGC ATGAAGTACC CCAAGGGCAC TTCCTTCGGG GCGCTGGTGA GCGGCAGGCC
AACTACATTT CCTACGGCGT CCGCGAATTC GGCATGACGG CGATCATGAA CGGTGTCGCC
CTGCACGGCG GGTTGATTCC CTACGGCGGC ACCTTCGCCG TCTTCTCCGA CTACGCCCGC
AACGCGATCC GGATGAGCGC CCTGATGCAG CAGCGCGTCG TCCATGTCCT GACCCACGAT
TCCATCGGCC TCGGCGAGGA TGGCCCGACC CACCAGCCGG TCGAGCACGC CAGCAGCCTG
CGCATCATTC CCGGCCTCGA CCTGTGGCGC CCGTGCGACG AGCTGGAAAC AGCCATCGCC
TGGGGCGCCG CGCTCGAGCG CCAAAACGGA CCTTCCACCC TCTTTTTGTC GCGTCAAAAC
CTGCCGCAAT ACGGCGGCGC GGCGAGCCGG GCGGAAGGTG CCAGCCGCGG CGGCTACGTG
CTCTCCGAAG CCGACGGCCC GCTGCAGGCA GTGATCATCG CCACCGGCTC GGAAGTCGCC
ATCGCCATGC AGGCGCAGGC CATTCTGAAA ACCGGCGGCG TTGCAGTGCG TGTCGTCTCG
ATGCCCTGCA CGCGGCGCTT CGACCAGCAG CCTTCGACGT GGAAGAAGCT CGTGCTGCCG
CCGGAAGTCT GCCGCGTCGC CATCGAAGCC GGCCAGACCG ATTTCTGGCG AAAGTACGTT
GGCCTCGACG GCGACGTGCT CGGCCTCGAC GAATTCGGCG CCTCGGCCCC GGCCCCGGTG
CTTTACGAAC ACTACGGCCT GACCGCGGAC AACCTGGCGC AGACGGTGTT GCGCACCATC
GTCAGTGCCG GGGGCAGTGA TGGTGACTTC TGA
 
Protein sequence
MDRNQTEITF RRELANAVRF LAIDAVNQAK SGHPGAPMGM ADIAEVLWRD HLKHNPANPQ 
WADRDRFVLS NGHGSMLLYA LLHLTGYDLN IDDLKNFRQF GSRTAGHPEV GHTPGVETTT
GPLGQGLTNA VGMALAEKLL AQRYNRPGCE IVDHRTWVFV GDGCLMEGIS HEACSLAGVW
GLDKLTCFYD DNGISIDGHV KGWFRDDTPA RFRAYGWHVV GPIDGHDSVA LSAAIAEAKA
VTSQPTLIVC RTQIGWGSPN KAGSHDVHGA PLGADETAAT RAALGWLHPP FEVPDSLRAA
WNAQSTGATA EASWLARFAA YRTEYPELAA EFERTQAGGL PEKWPEIKSE LLATAGRKEG
AVATRKSSQN CLDWLVDRVP ELLGGSADLT GSNLTAGKGS VALHEVPQGH FLRGAGERQA
NYISYGVREF GMTAIMNGVA LHGGLIPYGG TFAVFSDYAR NAIRMSALMQ QRVVHVLTHD
SIGLGEDGPT HQPVEHASSL RIIPGLDLWR PCDELETAIA WGAALERQNG PSTLFLSRQN
LPQYGGAASR AEGASRGGYV LSEADGPLQA VIIATGSEVA IAMQAQAILK TGGVAVRVVS
MPCTRRFDQQ PSTWKKLVLP PEVCRVAIEA GQTDFWRKYV GLDGDVLGLD EFGASAPAPV
LYEHYGLTAD NLAQTVLRTI VSAGGSDGDF