Gene Cagg_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1836 
Symbol 
ID7267748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2252194 
End bp2253519 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content56% 
IMG OID643566672 
Productthreonine synthase 
Protein accessionYP_002463167 
Protein GI219848734 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCACG CTTGGTTTGC CGCTCAAGAT GACCCCAACG AGCGTTATCC GCTCACAGAA 
GTAGTGTACT ACAGTCGATC AGGCGGGTTA TTGGAAGTGC AGCACGATAT GGCCGCACTA
GCTCAACGCA GTCCTGAAGA GTGGAAACGA TTATTCGATG AGCGCTGGAT GCGCACAACA
TGGCCGTATG GTAGTGGAGT ATGGGGCAAG AAAGAGTGGG TCTGCCCGAT TGTCGATAAC
GACAATGTCG TCAGTATGTT TGAAGGCGGG ACCAACCTCT TCTGGGCCGA ACGGTTTGGG
CGTGAGATCG GGCTAGAAGA TTTGTGGATC AAGATGTGTG GCAATTCGCA CACCGGCTCG
TTCAAAGATC TCGGAATGAC GGTGCTGGTG AGCGTTGTCA AGCAGATGAT CAGCGAGGGT
AAGCCGATAC GTGCGATTGC CTGTGCCTCA ACCGGTGATA CATCGGCGGC ATTGGCTGCA
TACGGTGCGG CTGCCGGGAT TCCCACCATT GTCTTTCTAC CCAAAGGCAA GGTGAGCATT
GCCCAATTAG TACAGCCGGT TGCCCACGGT GCGCTTGTGC TGGCACTCGA TACCGATTTC
GATGGCTGCA TGCGGATTGT CCGCGAAATC ACGGCTAATC CCGACAACGG CATCTATTTA
GCCAATTCGC TCAACTCGCT GCGGATCGAG GGGCAAAAGA CGGTTGGGAT CGAGATCGTG
CAGCAATTCG ATTGGGAAGT GCCGGACTGG ATTATTATTC CCGGCGGTAA TCTTGGCAAT
ACCTTTGCCC TCGGCAAAGG CCTGCTCATG ATGTACGAGT TAGGGTTGAT CAACCGCTTG
CCACGGATTG TGACTGCACA AGCAGCCAAC GCCAATCCAC TCTATCGCTC GTACCTGACC
GGTTTCCGCG AATATCAGCC GATCAAAGCC CAACCGACAG CGGCGAGTGC GATTCAGATC
GGCGATCCGG TCAGCATTAA TCGGGCGATC AGCATCCTGC GTCGCTTCAA CGGCATCGTC
GAACAGGCAA CCGAGCAGGA ACTCGCCGAT GCCGCCGCTC GTGCCGACCG CACCGGTGCA
TATGCCTGCC CCCATACCGG TGTCGCGTTG GCCGCACTGA TCAAATTGGT GCAACGTGGC
GAGATCAAGC GCAGTGATCG AGTCGTTGTC ATCTCGACCG CGCATGGGCT TAAGTTTAGC
CGTTTCAAAG TCGAGTACCA TGAAGGCACA CTGCGCGACG TAATCGGCCA GTACCGTAAC
CCGCCAATCG AACTGCCACC GGATATCGAT GCAGTACGCC GCGAAATCGA CCGCCGGTTC
GGATAG
 
Protein sequence
MYHAWFAAQD DPNERYPLTE VVYYSRSGGL LEVQHDMAAL AQRSPEEWKR LFDERWMRTT 
WPYGSGVWGK KEWVCPIVDN DNVVSMFEGG TNLFWAERFG REIGLEDLWI KMCGNSHTGS
FKDLGMTVLV SVVKQMISEG KPIRAIACAS TGDTSAALAA YGAAAGIPTI VFLPKGKVSI
AQLVQPVAHG ALVLALDTDF DGCMRIVREI TANPDNGIYL ANSLNSLRIE GQKTVGIEIV
QQFDWEVPDW IIIPGGNLGN TFALGKGLLM MYELGLINRL PRIVTAQAAN ANPLYRSYLT
GFREYQPIKA QPTAASAIQI GDPVSINRAI SILRRFNGIV EQATEQELAD AAARADRTGA
YACPHTGVAL AALIKLVQRG EIKRSDRVVV ISTAHGLKFS RFKVEYHEGT LRDVIGQYRN
PPIELPPDID AVRREIDRRF G