Gene Tmz1t_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2239 
Symbol 
ID7083671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2522728 
End bp2523804 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content67% 
IMG OID643699258 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002355874 
Protein GI217970640 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00478334 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACCC AGACCGAAAA CCTCAACGTC CTCGCCTTCG ACCACATGCC CTCGCCGGAC 
GAGGTGAAGG CGCGCGTGCC GCTGACCGAG CGCGCCGCGG CTGCGGTGGT GGCCGGGCGC
AAGGCGGTGA TGGACATCCT CGATCGCAAG GATCCGCGCG TGTTCGTGGT GGTCGGGCCG
TGCTCGATCC ATGACCCAGT GGCCGGGCTG GATTATGCGC GACGGCTGAA GGCGCTCGCC
GACGAGGTCT CCGACGTGCT GCTGCTGGTG ATGCGGGTGT ATTTCGAGAA GCCGCGCACC
TCGACCGGGT GGAAGGGCTA CATCAACGAT CCGTTCATGG ACGACTCCTT CCGCATCGAC
GTGGGCATGG AGCGTGCGCG CAAATTCCTC CTCGACGTGT GCGAGCTCGG CCTGCCCACG
GCCACCGAGG CGCTCGACCC GATCGCACCG CAGTATTACG GCGACCTCAT CGCCTGGACC
GCGATCGGCG CGCGCACCTC CGAGTCGCAG ACCCATCGCG AGATGGCCTC GGGCCTGTCG
ACGCCGGTCG GCTTCAAGAA CGCCACCGAT GGCGACCTCG AGGTGGCGAT CAACGCGATC
ATTTCGGCCG GCAGTCCGCA CAGCTTCCTC GGCATCAACA GCCAGGGCCA GTCGGCGGTT
ACCCGCACGC GCGGCAACCG TTACGGCCAC GTGGTGCTGC GCGGCGGCGG CGGCCGGCCC
AACTACGACA CGGTGTCGGT GTCGCTGGCC GAGCAGGCGC TCGCGAAGGC CAAGCTGGCG
AAGAACATCG TGGTCGATTG CTCGCACGCC AACTCGTGGA AGAAGCCCGA ATACCAGCCC
CTGGTGATGA AGGACGTGAT GCATCAGATC CGCGAGGGCA ACCAGTCGAT CGTCGGCCTG
ATGATCGAGA GCAATATCGA AGCCGGCAAC CAGCCGATTC CGGCCGACCT GTCGCAGCTC
AAGTACGGCT GTTCGGTCAC CGATGCCTGT GTCGATTGGG CGACGACCGA GGACATGATC
CGCAAGTCCG CCGCCGTGCT GCGCGACGTG CTGCCGAAGC GGGAGCGGCG CGCATGA
 
Protein sequence
MPTQTENLNV LAFDHMPSPD EVKARVPLTE RAAAAVVAGR KAVMDILDRK DPRVFVVVGP 
CSIHDPVAGL DYARRLKALA DEVSDVLLLV MRVYFEKPRT STGWKGYIND PFMDDSFRID
VGMERARKFL LDVCELGLPT ATEALDPIAP QYYGDLIAWT AIGARTSESQ THREMASGLS
TPVGFKNATD GDLEVAINAI ISAGSPHSFL GINSQGQSAV TRTRGNRYGH VVLRGGGGRP
NYDTVSVSLA EQALAKAKLA KNIVVDCSHA NSWKKPEYQP LVMKDVMHQI REGNQSIVGL
MIESNIEAGN QPIPADLSQL KYGCSVTDAC VDWATTEDMI RKSAAVLRDV LPKRERRA