Gene Apre_1062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1062 
Symbol 
ID8397849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1134811 
End bp1136019 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content37% 
IMG OID644995409 
Productthreonine dehydratase 
Protein accessionYP_003152810 
Protein GI257066554 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR00260] threonine synthase
[TIGR01124] threonine ammonia-lyase, biosynthetic, long form
[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0280706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCAA ATTTAGAAAT GATTAAAGAA GCAAGAGAAA TTCTTGAAGG TAATATTGAA 
AAGACTCCAA TATATACAGC ATCTAGAATG GGTGAGAATC TCTATATCAA GATGGAAAAC
TTACAAAAAA CAGGTTCCTT TAAACTAAGA GGAGCCTTCA ACAAGATTGC CCACCTTACA
GATGAACAAA AGAAAAAAGG TGTAATATCT TGTTCAGCAG GAAACCACGC CCAAGGTGTG
GCCCTATCAG CAACTAGACA AGGGATCAAA TCATATATAT GTATCCCATC AATTGCTCCT
CTTTCTAAGA TCGAAGCTAC TAGGGGCTAT GGTGGTGAAG TAATCATAGT AGATGGAACC
TTTGATGATG CTCAAGCTAA GGCTTATGAG CTTCAAAAAG AAAGAGATCT AACTTACGTT
GCACCTTTTG ATGATGAATA TGTACTATCT GGACAAGGTA CTATAGGTCT TGAAATCTTA
GATCAACTTC CAGATGTAAA ATACATCGTA GTTCCAATAG GTGGGGGTGG ACTAATTTCA
GGAATAGCCT TGGCTGTAAA ATCCCTAAGA CCAGATGTAA AAATCATAGG TGTGGAACCA
GAAAATGCAG CATCAATGCT CGCTTCAAGA AAAGCAGGGA AAATTGTAAC ACTTGATTCT
GCAAACACTA TGGCTGATGG TATAGCTGTC AAAAAACCAG GCGAGATTAC CTTTGACCTA
TGCGAAAAAT ATGTCGATGA AATAGTAACA GTATCAGAAG ATGAAATAAC CAACGCCATC
CTAAGACTTC TAGAAGAAAG TAAGGTAAGT GCAGAAGGAG CAGGAGCTTC ATCTGTTGCT
GCAGTACTTT CAAACAAATA TGATTTCTCT GATGGAAAAG TCTGTGCGGT TCTTTCTGGT
GGTAATATTA ACGTTAACAC AATCTATCAA ATCATTAACT CCGGTTTATT TAAAACTGGA
AGACTTACAG AAATTACCAC AACAATCTCC GATAAACCAG GTGAGCTAAT CAGACTTCTC
ACTATAATCA AAGACTTGGG CGCAAATATC AAAAATATCG ACCAATTTAA ATCAGCAGAA
ACAGTTGGAT TTGACCATGC AGTAGTAAGA ATTATAGCAG AAACTTATAA CAAAGAACAT
AGAAACCAAG TTTACCAAGC TCTAGCAGAT GCTGGATATG CAGAAAGTCA TATAAGACGC
AACAAATAA
 
Protein sequence
MTANLEMIKE AREILEGNIE KTPIYTASRM GENLYIKMEN LQKTGSFKLR GAFNKIAHLT 
DEQKKKGVIS CSAGNHAQGV ALSATRQGIK SYICIPSIAP LSKIEATRGY GGEVIIVDGT
FDDAQAKAYE LQKERDLTYV APFDDEYVLS GQGTIGLEIL DQLPDVKYIV VPIGGGGLIS
GIALAVKSLR PDVKIIGVEP ENAASMLASR KAGKIVTLDS ANTMADGIAV KKPGEITFDL
CEKYVDEIVT VSEDEITNAI LRLLEESKVS AEGAGASSVA AVLSNKYDFS DGKVCAVLSG
GNINVNTIYQ IINSGLFKTG RLTEITTTIS DKPGELIRLL TIIKDLGANI KNIDQFKSAE
TVGFDHAVVR IIAETYNKEH RNQVYQALAD AGYAESHIRR NK