Gene Cagg_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0117 
Symbol 
ID7266855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp162911 
End bp164218 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content58% 
IMG OID643564989 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002461505 
Protein GI219847072 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.293801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0224314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGA TAACGTTAAC CGCACCCAAG CGTTTGCGCG GAGTTATTCA GGTACCGGGA 
GATAAATCGA TCTCACACCG ATCGGTGTTG CTGAACGCGA TTGCTACCGG CAGTGCTCAC
ATTACGAACT TTTTACCCGG TGCCGATTGT CTTTCCTCGG TAGCCTGTGT GCGAAGCCTA
GGCGTAACGG TCGAGCAGCC TCATGAGCGT GAATTGATTA TCCACGGTGT TGGTCTGGGT
GGATTACGTG AATCAACCGA TGTGCTCGAC TGTGGTAATT CCGGTACTAC GCTGCGTCTG
CTGGCCGGCA TACTGTCCGG TCAGCCGTTT TTTAGTGTCT TGAGCGGTGA TTCATCGTTG
CGTTCGCGTC CGCAGCGGCG GGTTGTTGGG CCACTGCGTG CAATGGGTGC GCAGATCGAT
GGGCGCGCCG ACGGCGACCG GGCACCGCTG GCAATTCGCG GTAGTACGCT ACGTGGTGGT
CAGTACGAAT TGACTATCGC GTCCGCCCAG GTGAAATCTG CTCTCTTGTT GGCTGCACTG
TATGCCGATG GCCCACTGAC GCTCGGTGGA CGGATCGATT CGCGCGATCA TACCGAGCGG
ATGCTTGCGG CAATGGGGGT GGAGATAACC GTATCGCCTG ACCGGATTAC CCTGCATCCG
CCGACAGCAG CAACTGCCCC GGTCGCTCTT TCCCTGCGGG TCCCCGGTGA TCCCTCCTCG
GCAGCGTTTT GGTGGGTAGC TGCTGCGATC CATCCCGATG CCGAACTTGT CACTCCTGGC
GTCTGTCTCA ACCCGACCCG TACCGGTGCC CTTGATGTGC TGCGGGCGAT GGGGGCTGAG
ATTGAAATAA TGAACGAGCG GTTGGAAGGG AGTGAGTTGG TCGGCGATGT CGTCGTCCGC
TCTTCGGTGT TGCGGGGGAC AACCATCGCC GGCTCTCTGA TCCCTCGTCT GATTGATGAA
ATTCCGGTGC TAGCCGTCGC TGCTGCCTGT GCCGATGGTG AAACGGTTAT TCGTGATGCG
CAAGAATTGC GCGCTAAAGA GACCGATCGG ATCACCACCG TGGCTGCCGG GCTGAGTGCG
TTGGGGGTTA CCGTCGAACC AACGATTGAT GGTATGGTGA TCACCGGTAA ACCCGATCAA
CTCACCGGTG CTACTTTGCA CAGCTATCAC GACCATCGCC TGGCAATGGC ATGGGCCGTT
GCCGCCCTTG TCGCTCGTGG TGAAACAACC ATTGTTGAAC CGGCAGCAGT GGTGATCAGC
TATCCCGATT TCTGGCAGAC TCTCGCCGCG ATCCAGGAGG ACGTATGA
 
Protein sequence
MTEITLTAPK RLRGVIQVPG DKSISHRSVL LNAIATGSAH ITNFLPGADC LSSVACVRSL 
GVTVEQPHER ELIIHGVGLG GLRESTDVLD CGNSGTTLRL LAGILSGQPF FSVLSGDSSL
RSRPQRRVVG PLRAMGAQID GRADGDRAPL AIRGSTLRGG QYELTIASAQ VKSALLLAAL
YADGPLTLGG RIDSRDHTER MLAAMGVEIT VSPDRITLHP PTAATAPVAL SLRVPGDPSS
AAFWWVAAAI HPDAELVTPG VCLNPTRTGA LDVLRAMGAE IEIMNERLEG SELVGDVVVR
SSVLRGTTIA GSLIPRLIDE IPVLAVAAAC ADGETVIRDA QELRAKETDR ITTVAAGLSA
LGVTVEPTID GMVITGKPDQ LTGATLHSYH DHRLAMAWAV AALVARGETT IVEPAAVVIS
YPDFWQTLAA IQEDV