Gene Anae109_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3556 
Symbol 
ID5378193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4169158 
End bp4170369 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content71% 
IMG OID640845078 
Productthreonine dehydratase 
Protein accessionYP_001380721 
Protein GI153006396 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form
[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.597428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.136455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTCGC TCCAGGACGT CCAGGCCGCG CTCGGTCGGA TCCGGGATCG CATCTACGTC 
TCGCCCTGCG CCCGCACCGA GACGCTCTCC CGGCTCACCG GCACGAGCGC CCATCTCAAG
CTCGAGAACC TCCAGATGAC CGGCGCCTAC AAGGAGCGCG GGGCGCTGAA CAAGCTCCTG
CTGCTCTCGC CGGCGGAGCG CGACCTCGGC CTCATCGCGG CGAGCGCGGG GAACCACGCG
CAGGCGGTCG CCTACCACGC CGGGCGGCTC GGCGTGAAGG CGACCATCGT GATGCCGGAG
ACCACGCCCA TCACGAAGGT GGCGAACACC CGCGCCCACG GCGCGCGCAT CGTGCTGCAC
GGCGCCAGCT ACGACGAGGC GTACGCGGAG GCGCGGCGGC TGGAGCAGGC CGAGGGGCTC
ACCTTCGTCC ACCCGTTCGA CGACCCCGCC ATCATCGCGG GCCAGGGCAC CGTCGGGCTC
GAGATCCTGG AGCAGGACCC GGCGGTCGAG ACGATCGTCG TGCCCATCGG AGGCGGCGGT
CTCGTCTCCG GCGTCGCGGT GGCGGCCAAG GAGACGCGGC CCGGCGTACG CGTCGTGGGC
GTCGAGACGG AGGTGCTGCC CTGCATGGTC GCGGCGCTCG AGGCCGGCAA GCCGGTGACG
CTCGAGGCGG CGAACACCGT CGCCGACGGC ATCGCGGTGA AGCGCGCGGG CGAGCTCACG
CTCGATCACG TGAAGCGCTA CGTGGACGAC ATCGTCACGG TCTCGGAGGA GGAGATCGCG
AGCGCGATCC TGTACCTGCT CGAGAAGGAG AAGACGGTCG CGGAGGGCGC CGGCGCGGTC
GCGGTGGCGG CGCTCATGCA GCGCAAGATC CGCGGCGTCG AGGGGCGGAA CGTCGTCGCC
GTCGTCTCCG GTGGCAACAT CGACGTGAAC CTCGTCGCGC GCGTGATCGA GCGCGGCCTC
GTCAAGGACG GGCGGCTCGT CCGGATCAGC GTGGCGCTCC AGGACAAGCC GGGGCAGCTC
GCCAAGGTGT CCGCCATCGT GGCCCACCAC CGCGCGAACG TGATCGAGGT TCACCACACG
CGCGCGTTCG CCTACCGCTT CGGCGACACG ACGCTGCAGC TGACCCTCGA GACGCGCGGG
CCAGAGCACG TCGAGGAGCT CCTCACGGCC CTGCGCGAGC GGGGATACCA GGTGCAGCGG
ATGGGGATGT AG
 
Protein sequence
MVSLQDVQAA LGRIRDRIYV SPCARTETLS RLTGTSAHLK LENLQMTGAY KERGALNKLL 
LLSPAERDLG LIAASAGNHA QAVAYHAGRL GVKATIVMPE TTPITKVANT RAHGARIVLH
GASYDEAYAE ARRLEQAEGL TFVHPFDDPA IIAGQGTVGL EILEQDPAVE TIVVPIGGGG
LVSGVAVAAK ETRPGVRVVG VETEVLPCMV AALEAGKPVT LEAANTVADG IAVKRAGELT
LDHVKRYVDD IVTVSEEEIA SAILYLLEKE KTVAEGAGAV AVAALMQRKI RGVEGRNVVA
VVSGGNIDVN LVARVIERGL VKDGRLVRIS VALQDKPGQL AKVSAIVAHH RANVIEVHHT
RAFAYRFGDT TLQLTLETRG PEHVEELLTA LRERGYQVQR MGM