Gene Cagg_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1126 
Symbol 
ID7268580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1392055 
End bp1393470 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content62% 
IMG OID643565969 
ProductAromatic-L-amino-acid decarboxylase 
Protein accessionYP_002462472 
Protein GI219848039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0279279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCATC CTGACGAATT CCGCCGTATT GGCTACCAGA TCATCGACAT GATCGCCGAT 
TACCGTGCAA CCATCGCCAA CCGTCCGGTC TGGTCACAGT TGCGTCCCGG CGAGTTTCGT
AGTCAGTTGC CGGCCACCCC ACCTGAACAG CCGGAACCAC CGGAAGCGAT CCTCGCCGAT
GTGGAACGTC TGATCATCCC CGGTTTATCG AACTGGCAAC ACCCCCGCTT TTTCGGTTAT
TTTCCGGCCA ACGCTAGCCT CGCTTCACTG TTGGGCGATT TTCTCAGTGG TGGTCTCGGT
CAATTGGGTT TGAATTGGCA GGCTAGCCCA CCGTTAACCG AACTCGAAGA GCTGACAACC
GACTGGATGC GACAGTTGCT GGGCTTGAGC GAGGCGTGGC GCGGGGTAAT TCAGGATACG
GCAAGCACCA GTACGCTGGT GGCGTTGCTC TGTGCCCGTG AACGGGCCAG CGACCATAGC
CAGGTGCGCG GCGGCTTGCA GGCGCTGCCG CAGCCGCTGG TGGTCTATAC CTCAATCCAG
AGCCACAGTT CGGTGGAGAA GGCGGCGCTG TTAGCCGGTT TTGGCCGCGA TAACCTCCGC
CTGTTGCCGG TTGACGATAC CTTCGCCCTG CGCGTGGACA CACTCGCCGA TGCTATTGCT
ACCGACCGCG CCGCCGGTCG AGTACCGTGC GCGGTGGTGG CCAGTATCGG CGCAACGGCA
ACCACCGCCT GTGATCCGCT CGAACCGATT GGCGAACTGT GCCGGCGTGA GGGGATTTGG
CTGCACGTTG ATGCGGCAAT GGCCGGCTCG GCGATGATCT TGCCCGAATG TCGCTATCTC
TGGCAGGGGA TCGAACAGGC CGATAGCCTT GTCCTCAATC CGCACAAATG GCTGGGGGCG
GCGTTCGATT GCTCGCTTTA CTACGTGCGC GATCCGCAGC ATCTTATCAG AGTGATGTCA
ACCAACCCCA GTTATTTGCA AACCAGCGCC GACGGCGCTG TCACCAACTA TCGCGACTGG
GGCATTCCGT TGGGCCGGCG CTTCCGCGCG CTGAAGCTCT ACTTCTTGCT ACGCTGCGAA
GGGGCCGAGG GGTTGCGCAC CCGCCTGCGC CGCGACATCG CTAATGCTCG CTGGCTGGCT
GAGCAGATCG ACGCGACGCC GCACTGGCGG CGATTGGCGC CGGTACCGCT CCAGACAGTC
TGCGTGCGCC ACGAACCACC CGGTCTGACC GGTGAAGACC TTGATCGCCA TACCTTACGC
TGGGTAGGCG CGATTAATGC CAGCGGTGCA GCGTACCTGA CCCCTGCGAT GCTCGATGGC
CGTTGGATGG TGCGGATCAG CATTGGCGCC GAGCCAACCG AGCACACTGA TGTGGCGGCG
CTGTGGGCAT TGATGCAAGA GGTGGTACGA GGGTAG
 
Protein sequence
MMHPDEFRRI GYQIIDMIAD YRATIANRPV WSQLRPGEFR SQLPATPPEQ PEPPEAILAD 
VERLIIPGLS NWQHPRFFGY FPANASLASL LGDFLSGGLG QLGLNWQASP PLTELEELTT
DWMRQLLGLS EAWRGVIQDT ASTSTLVALL CARERASDHS QVRGGLQALP QPLVVYTSIQ
SHSSVEKAAL LAGFGRDNLR LLPVDDTFAL RVDTLADAIA TDRAAGRVPC AVVASIGATA
TTACDPLEPI GELCRREGIW LHVDAAMAGS AMILPECRYL WQGIEQADSL VLNPHKWLGA
AFDCSLYYVR DPQHLIRVMS TNPSYLQTSA DGAVTNYRDW GIPLGRRFRA LKLYFLLRCE
GAEGLRTRLR RDIANARWLA EQIDATPHWR RLAPVPLQTV CVRHEPPGLT GEDLDRHTLR
WVGAINASGA AYLTPAMLDG RWMVRISIGA EPTEHTDVAA LWALMQEVVR G