Gene TM1040_0376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0376 
Symbol 
ID4078609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp384690 
End bp386501 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content63% 
IMG OID638005671 
Productphosphogluconate dehydratase 
Protein accessionYP_612371 
Protein GI99080217 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCA ATGCAACACT CGCCGCCGTC ACTGACCGTA TCATCGAACG CAGCCGCCCC 
ACGCGCGAGG CTTATCTGGC GCGGATGCGC GCCGCCGCCT CCAAAGGCCC GGCCCGCGCG
CATCTGAGTT GCTCCGGTCA GGCCCACGCC TATGCCGCCA CCGGCCCGGA TCAGGAGACA
CTGGCCACCG GTACAGGCGG GCACCTCGGC ATCGTCACCG CCTACAACGA CATGCTCTCG
GCGCATCAGC CGTTTGAAAC CTACCCCATG CTGATCCGTG AAGCGGTGCG CGAAGCGGGC
GGCACCGCGC AGGTGGCCGG TGGCGTGCCC GCCATGTGCG ATGGCATCAC GCAGGGCGAA
GCCGGCATGG AGCTGTCGCT CTTTTCGCGC GACAGTATTG CGATGTCCGT CGGCATCGCC
CTGTCGCACA ACGTCTTTGA CGCCACTGTG TTTCTGGGCG TTTGCGACAA GATCGTTCCC
GGCCTTGTGA TCGGCGCCCA GGCCTTTGGC CATTTGCCCG CCGTGTTTCT GCCCGCAGGA
CCGATGACCT CTGGTCTGTC AAACGACGAA AAAGCGCAGA TTCGCCAGAA GTTTGCCAAG
GGTGAGATCG GCCGGGACGA GCTACTCAAG GCCGAAATGG CCGCCTATCA CGGCCCCGGC
ACATGTACGT TCTACGGCAC CGCCAACACC AACCAGATGT TGATGGAGTT CATGGGGCTG
CACCTGCCCG GGTCGTCCTT TGTCAATCCC GGCACCGAGT TGCGCGACGC CCTCACCCGC
GAAGGCGCCA AACGCGCGCT GTCGCTCTCG GCGCTTGGCA ATCACTACAC GCCGACCTGC
GATATTCTTG ATGAAAAGGC CTTTGTGAAC GGCATGGTCG GGCTGATGGC GACCGGTGGT
TCTACCAACC TGCTGATCCA TCTTATTGCC ATGGCCCGCG CGGGCGGCAT CATCCTCGAC
TGGCAGGATT TCTCCGAGAT CTCCGATGTG GTGCCGCTGC TGGCGCGCGT CTATCCCAAC
GGGCTGGCGG ATGTGAACCA CTTCCACGCC GCTGGTGGGC TTGGATACAT GATCGGAGAA
CTCCTCGAGA ATGGCTATCT GCACCCCGAC ACAAAAACCG TAACCGGCGA GGGGCTTGGC
GCCTATCTGA TGGAGCCTTT CCTCGACGGT GGCGCGCTCA CATGGCGCAA GGGCACCACG
GAATCCCTGA ACGACAAGAT CGTGCGCCCT GCCTCGGATC CGTTCCAGAA GACCGGTGGC
CTTGCACGCC TGACCGGCAA CCTCGGCACC GGGGTGATGA AGATCTCTGC CGTAGCCGAA
GAACACCGCG TCGTTGAGGC ACCCGTGCGC GTTTTCCACG ACCAGGACGA GGCCAAGGCG
GCCTTCAAGG CGGGCGAGCT TGATGAAGGC GACGTTGTGA TCGTGGTCCG TTTCCAAGGC
CCCAAGGCCA ATGGCATGCC GGAGCTGCAC TCGATGACAC CCTTCCTGTC GATCATGCAG
GGGCGGGGCC AGAAGGTGGC GCTTGTGACC GATGGTCGGA TGTCGGGTGC CTCTGGCAAG
GTTCCCTCCG CGATCCATGT CGTCCCCGAG GCCCTCGATG GCGGCGCAAT CGCTAAGCTG
CAGGATGGCG ACATTGTGCG GGTGGATGCG ATCTCGGGCA ATCTCGAAGT CCTCACAGAA
GGCGTGCTAG ACCGCCCGGC GGCAACCGCC GATCTTACCT CTTATCAACA CGGCACTGGG
CGCGAGCTCT TTGCCCTGTT TCGCAGTTCC GTGACCTCTG CCGACACCGG CGCAACCGTA
TTTGGAGTTT GA
 
Protein sequence
MSLNATLAAV TDRIIERSRP TREAYLARMR AAASKGPARA HLSCSGQAHA YAATGPDQET 
LATGTGGHLG IVTAYNDMLS AHQPFETYPM LIREAVREAG GTAQVAGGVP AMCDGITQGE
AGMELSLFSR DSIAMSVGIA LSHNVFDATV FLGVCDKIVP GLVIGAQAFG HLPAVFLPAG
PMTSGLSNDE KAQIRQKFAK GEIGRDELLK AEMAAYHGPG TCTFYGTANT NQMLMEFMGL
HLPGSSFVNP GTELRDALTR EGAKRALSLS ALGNHYTPTC DILDEKAFVN GMVGLMATGG
STNLLIHLIA MARAGGIILD WQDFSEISDV VPLLARVYPN GLADVNHFHA AGGLGYMIGE
LLENGYLHPD TKTVTGEGLG AYLMEPFLDG GALTWRKGTT ESLNDKIVRP ASDPFQKTGG
LARLTGNLGT GVMKISAVAE EHRVVEAPVR VFHDQDEAKA AFKAGELDEG DVVIVVRFQG
PKANGMPELH SMTPFLSIMQ GRGQKVALVT DGRMSGASGK VPSAIHVVPE ALDGGAIAKL
QDGDIVRVDA ISGNLEVLTE GVLDRPAATA DLTSYQHGTG RELFALFRSS VTSADTGATV
FGV