Gene Anae109_4443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4443 
Symbol 
ID5375788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp5202638 
End bp5203648 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content71% 
IMG OID640845971 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001381605 
Protein GI153007280 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGTCC TCGTCACCGG CGGCTCCGGG TTCATCGGCG CGAACCTCGT CCGCCTCCTG 
CTCGTCGAGC GGCCGGGGTG GCGCGTCGTC AACCTCGACG CCCTCACCTA CGCCGGGAAC
GCCGAGAACC TGGCGGAGCT CGACGGGCAC GCGCGCTACC GCTTCGTGCG GGGGGACATC
TGCAACGGCG AGCTCGTCGC CGACGTGCTC GAGACGGAGA GGATCGACGC GGTGCTGCAC
CTCGCCGCCG AGAGCCACGT CGATCGCTCG ATCCTGTCGC CGCCCGTGTT CATCGAGACG
AACGTGCGCG GCACGCAGGT GCTGCTCGAG GCGGCGCGCG AGCTCGGGGT GAGGCGCTTC
GTCCACGTCT CCACCGACGA GGTGTACGGC TCCCTCGGCC CGAGCGGCCT GTTCACGGAG
GAGACGCCGC TCGACCCCTC CTCGCCGTAC TCCGCCTCGA AGGCCTCGAG CGACCTGCTC
GCGCTCGCCT ACGCGCGCAC GTTCGAGCTG CCGGTGGTGG TCACGCGCTG CTCGAACAAC
TACGGCCCCT ACCAGTTCCC GGAGAAGCTC ATCCCGCTCG CGATCGCCAA CGCGCTGCGG
GACCTGCCGC TGCCGGTGTA CGGCGACGGC CTGCACGTGC GCGACTGGAT CCACGTGGAG
GATCACTGCC GCGGGCTCCT CGCCGCGCTG GAGAAGGGCG AGAGCGGGCA GGTCTACAAC
CTGGGCGCGT CGAGCGAGCG GCACAACCTC GACGTCGTGA AGCAGGTGCT GCGGCTCGTC
GGGAAGCCCG AGTCGCTCAT CCAGCACGTG GCCGACCGGC CGGGGCACGA CCGTCGCTAC
GCCATCGACT CGACCAAGGC GCGGACGGTG CTCGGCTGGG CGCCGCGCCA CCGGTTCGAG
GAGGCGCTCG CGGCGACGGT GCGCTGGTAC GTGGAGCGCC GGCCGTGGTG GGAGCGGATC
ATCTCCGGCG AGTACCTCGC GTACTACGAG AAGCAGTACG GAGCCGGCTA G
 
Protein sequence
MNVLVTGGSG FIGANLVRLL LVERPGWRVV NLDALTYAGN AENLAELDGH ARYRFVRGDI 
CNGELVADVL ETERIDAVLH LAAESHVDRS ILSPPVFIET NVRGTQVLLE AARELGVRRF
VHVSTDEVYG SLGPSGLFTE ETPLDPSSPY SASKASSDLL ALAYARTFEL PVVVTRCSNN
YGPYQFPEKL IPLAIANALR DLPLPVYGDG LHVRDWIHVE DHCRGLLAAL EKGESGQVYN
LGASSERHNL DVVKQVLRLV GKPESLIQHV ADRPGHDRRY AIDSTKARTV LGWAPRHRFE
EALAATVRWY VERRPWWERI ISGEYLAYYE KQYGAG