Gene Teth514_1425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1425 
Symbol 
ID5877816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1460253 
End bp1461269 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content39% 
IMG OID641541774 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_001663050 
Protein GI167040065 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01362] 3-deoxy-8-phosphooctulonate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00105429 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAATTG TTATGAATAT TGATGCCACT GATAAGCAAA TCTCTGATAT TACAAATCTT 
TTAACATCTC TCGGTTTAGG TTACCACATT TCAAAAGGTG AAGAAAAAAT AGTCATAGGT
GTTATAGGTG ATAAGAAAAA ATTAGAAGGT AAGGCTATAG AAATGATGGA GGGAGTAGAA
AAAGTCATTC CTATTGTGGA GCCTTACAAA CTCGCCAGCA GAATTTTCAA ACCAGAGCCT
ACCATTGTGG AAGTAGAGGA TGTAAAAATA GGTGGAAACA ACATAGTTAT AATGGCAGGA
CCTTGTGCAG TAGAAAGCAG AGAACAACTT TTTGAAAGTG CTATGGCGGT TAAAAAAGCG
GGAGCTCAAT TTTTAAGAGG AGGGGCATTT AAACCGAGAA CATCTCCTTA TTCTTTCCAA
GGTCTTGAAG AAGAGGGTTT AAAGATGCTT AAGGAAGCAA AAGAACTCAT TGGACTTAAG
ATTGTGACAG AGGTCATGGA TGTACATTCA GTTGAACTAG TGGCACAATA CGCGGATGTT
CTACAGATAG GTGCGAGAAA TATGCAGAAT TTCCCATTGC TTAAAGCAGT GGGTAGAATT
AACAAGCCTG TTTTATTGAA GAGGGGACTT GCTGCAACAT TAGAAGAGTG GTTAAGTGCT
GCTGAGTATA TTTTAAGTGA AGGCAACAAA GACGTTATCC TCTGTGAAAG AGGAATAAGG
ACTTTTGAAA CTTATACGCG AAATACTTTA GATTTAAGTG CAGTTCCAGC TATAAAAAAA
TTGAGTCATT TGCCTATAGT TGTTGACCCC AGTCACGGCA CTGGGAAATG GCACCTTGTA
GCTCCTATGG CTAAAGCTGC TATAGCAGCA GGAGCTGATG GACTCATTAT TGAGGTACAT
CCAGACCCCA AAAATGCATT ATCCGATGGG GCTCAATCCC TTACTCCTGA AAATTTTGAG
ACTTTGTGTC AAGACATAAA GGTTATCGCT AAAGCAGTAG GGCGTGATTT TGTATGA
 
Protein sequence
MVIVMNIDAT DKQISDITNL LTSLGLGYHI SKGEEKIVIG VIGDKKKLEG KAIEMMEGVE 
KVIPIVEPYK LASRIFKPEP TIVEVEDVKI GGNNIVIMAG PCAVESREQL FESAMAVKKA
GAQFLRGGAF KPRTSPYSFQ GLEEEGLKML KEAKELIGLK IVTEVMDVHS VELVAQYADV
LQIGARNMQN FPLLKAVGRI NKPVLLKRGL AATLEEWLSA AEYILSEGNK DVILCERGIR
TFETYTRNTL DLSAVPAIKK LSHLPIVVDP SHGTGKWHLV APMAKAAIAA GADGLIIEVH
PDPKNALSDG AQSLTPENFE TLCQDIKVIA KAVGRDFV