Gene Anae109_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2078 
Symbol 
ID5376371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2358141 
End bp2359499 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content73% 
IMG OID640843591 
Productthreonine synthase 
Protein accessionYP_001379265 
Protein GI153004940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.818493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0807622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC CCGCACCCTC GGCCTGGTTC CGCTGCGCGG AAGGCTGCGA CTTCCGGGCG 
GAGCTCGTCG ACGTCGTCTA CGAGTGCCCC CGCTGCGGCG GGCTGCTCGA GGTCGAGCAC
GACCGCGCCG CGCTCGCCGC GCGCTCCGGC GACGAGTGGA GGGCGCTGTT CGACGGCCGC
TTCAAGCTCG GGGCCTGGCC GTACGGCTCA GGGGTGTGGG GCAAGAAGGA GTGGGTCTAC
CCCCAGCTCG CGACCGAGAA CGTGGTCTCC ATGTACGAGG GCGGCAGCCC GCTCCTCCGG
GTGGATCGCT ACGCGCGCGA GCTCGGCCTC GAGGACGTGT GGGTGAAGGA GTGTGGCGTC
ACGCACACCG GGTCGTTCAA GGACCTGGGC ATGACGGTGC TCGTCTCCGC GGTGAAGGAG
ATGCGCGCGC GCGGCCGTGA GGTGCGGGCC GTCGCCTGCG CCTCCACGGG CGACACCTCC
GCCGCGCTCA GCGCCTACTG CGCCGCCGCC GGCATCCCGA GCGTGGTGCT CCTGCCGCGC
GGCAAGATCT CGACGGCCCA GCTCGTCCAG CCCATCTCGA ACGGTGCGCT CGTCCTCGAG
CTCGACACCG ACTTCGACGG CTGCATGCGC GTCGTGCGGG ACCTCGCGCG CACGAAGGAC
ATCTACCTCG CGAACTCGAT GAACTCCCTG CGCATCGAGG GGCAGAAGAC CGCCTCCATC
GAGATCGCGC AGCAGCTGGG CTGGACCACG CCGGACTGGA TCGTCATCCC GGGCGGGAAC
CTGGGGAACG CGAGCGCCGT GGGGAAGGGC TTCCAGCTCA TGAAGGAGCT GGGGCTGGTC
GACCGGCTGC CACGGCTCGT GGTCGCCCAG GCGGCGCAGG CGAACCCCCT CTGGAAGGCC
ACCAGCGGCG CTGGGGTCAA GCCCACCACG GCGGTGACGG TCGAGCCCGT CCCGGCGCAG
CGGACGCTCG CGTCGGCGAT CCAGATCGGC GCGCCGGTCT CGGCCCGGCG CGCCCTGCGC
GCGCTCGAGG CGCTCGACGG CGTCGTCGAG CAGGCCACCG AGCAGGAGCT CGCGGACGCC
GCCGCGCGGG CAGATCGCGC CGGCCTCTTC ACCTGTCCGC ACACGGGCGT CGCCCTCGCG
GCCCTCGAGA AGCTCGCGGC GCGCGGCGCC GTCCGCCGCG GCGAGCGGGT GGTGGTGATC
TCCACCGCGC ACGGGCTCAA GTTCTCGGAC TTCAAGGTCG GTTACCACGA CGGGACGTTG
CCGGGGATCG CGTCGCCGCT GCGGAACCCG GGGGTCCGCC TGCCGGCGAC CCTCGGCGCC
GTCCAGGACG CGATTGCCGC CCGCTTCGGG CGGGGGTAG
 
Protein sequence
MTAPAPSAWF RCAEGCDFRA ELVDVVYECP RCGGLLEVEH DRAALAARSG DEWRALFDGR 
FKLGAWPYGS GVWGKKEWVY PQLATENVVS MYEGGSPLLR VDRYARELGL EDVWVKECGV
THTGSFKDLG MTVLVSAVKE MRARGREVRA VACASTGDTS AALSAYCAAA GIPSVVLLPR
GKISTAQLVQ PISNGALVLE LDTDFDGCMR VVRDLARTKD IYLANSMNSL RIEGQKTASI
EIAQQLGWTT PDWIVIPGGN LGNASAVGKG FQLMKELGLV DRLPRLVVAQ AAQANPLWKA
TSGAGVKPTT AVTVEPVPAQ RTLASAIQIG APVSARRALR ALEALDGVVE QATEQELADA
AARADRAGLF TCPHTGVALA ALEKLAARGA VRRGERVVVI STAHGLKFSD FKVGYHDGTL
PGIASPLRNP GVRLPATLGA VQDAIAARFG RG