Gene Tfu_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1039 
Symbol 
ID3579591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp1217385 
End bp1218431 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID637684734 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_289100 
Protein GI72161443 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCATCG TTATGGCGCC CGACGCCCCT TCCGACACCA TCGACTCCAT CGTTGACCTC 
GTCGCCTCGG TGGGTGGTGA GGCCTACGTG ACCCGGGGGG TGAGCCGGAC CATCATCGGC
CTGGTAGGCG ACGTGGAACG GTTCGAGACG CTTAATCTGC GTGCCCTCCC CGGCGTCGCC
GACATCCTGC GCATCTCCAC TCCCTACAAG CTGGTCAGCC GGGAAAACAC GACTGAGCGG
TCGGTCGTGC AGGTTGCTGG GGTACCGATC GGCGGCGACC ATATGACACT CATCGCCGGT
CCCTGCGCGG TGGAAACCCC GGAGCAGACT TTGGAAGCGG CGCTGATGGC GAAGCGTGCG
GGTGCCGCGC TGCTGCGCGG CGGCGCCTAC AAGCCGCGGA CCTCCCCCTA CGCGTTCCAG
GGCCTGGGCG AGACCGGTCT GAAGATCCTG TCGGATGTGC GTGCGGAGAC CGGCCTGCCG
ATTGTGACCG AAGTGGTGGA CGCCTCCGAC GTGGAGCTGG TCGCCTCCTA CGCCGACATG
CTGCAGATCG GCACCCGCAA CATGCAGAAC TTCGCGCTGC TGCAAGCCGT GGGCGACGCG
GGCAAGCCGG TGCTGCTCAA ACGCGGGATG AGCAGCACGA TCGAGGAGTG GCTGATGGCC
GCCGAGTACA TTGCGCAGCG CGGCAACTTG AACATCGTGC TGTGCGAGCG CGGCATCCGC
ACGTTCGAGA AAGCGACCCG TAACACGCTG GACGTGAGCG CGGTCGCCGT GGCGCAGCGG
CTGTCGCACC TGCCCGTGGT GGTGGACCCG TCGCATTCGG GCGGCAAGCG GGAACTGGTG
CTGCCGCTGT CGCGTGCGGC GATCGCGGTG GGCGCGGACG GCCTCATCGT CGACGTGCAC
CCTGCTCCGG AGACGGCACT GTGCGACGGG CCGCAGGCGC TCACGCACGC AGACTTGGCC
GAGCTGGCGC ACGTGGTGAC GGCGCTGCCG CCGCTCGTGG GCCGCACGCT CACGCCCAGC
GTGGCGCAGG TGGGCGCCGG CGTGTAA
 
Protein sequence
MVIVMAPDAP SDTIDSIVDL VASVGGEAYV TRGVSRTIIG LVGDVERFET LNLRALPGVA 
DILRISTPYK LVSRENTTER SVVQVAGVPI GGDHMTLIAG PCAVETPEQT LEAALMAKRA
GAALLRGGAY KPRTSPYAFQ GLGETGLKIL SDVRAETGLP IVTEVVDASD VELVASYADM
LQIGTRNMQN FALLQAVGDA GKPVLLKRGM SSTIEEWLMA AEYIAQRGNL NIVLCERGIR
TFEKATRNTL DVSAVAVAQR LSHLPVVVDP SHSGGKRELV LPLSRAAIAV GADGLIVDVH
PAPETALCDG PQALTHADLA ELAHVVTALP PLVGRTLTPS VAQVGAGV