Gene Cagg_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1658 
Symbol 
ID7268960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2022859 
End bp2024439 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content56% 
IMG OID643566500 
Producthistidine ammonia-lyase 
Protein accessionYP_002462995 
Protein GI219848562 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGTG ACGAAGTGCT TCTTGATGGG GCAAGCCTTA CTATCGAGCA GGTTTTGGCC 
GTAGCCTATG GTCAACCCGG TAACCCGGCG GTACGCCTGA CCCCGGTAGC GCGCGAGCGG
GTAACCCGCG CAGCCCAAGC TATTCAAGAT TTACTCGCTC GTGGCGTGGT CGCCTACGGG
ATTACTACCG GTTTTGGGGC ATTCAAAGAT CGGGTGATTG CGTCCGAACA AGTCGAACAA
TTGCAGTACA ACATTCTGGT CAGCCATGCT GTAGGCGTGG GGCCGGTCTT CGATCTGCCT
ACGACGCGGG CCATTATGCT CATCCGTGCC AATACTCTTG CCCGTGGTCA TTCGGGTGTG
CGCCTGGAAA CGCTCGAACG GCTGATCGAT ATGCTCAACT ACGGTATTCA TCCGCGCATC
CCTAGTAAAG GTTCGCTGGG GGCGAGCGGT GATCTCGCGC CACTCGCCCA TATGGCGTTA
CCGATGCTCG GCTTGGGAGA GGTCGAATGG CATGGAGAGG TGATGCCGGC AACCGTCGTA
TTGCAACGGT TAGGCTGGCA ACCGCTCCAC TTGGCGGCAA AAGAGGGTTT GGCACTCACG
AACGGAACGG CAGTCATGTG TGCGCTGGGC GTGCTCGAAA CAGCACGCGC CGAGTTGTTG
AGTGCGACCG CCGATATAGC CGGTTGTCTG AGCCTTGAGG CTCTTCACGG TACACCGGCA
GCGTTCGATC CGCGACTCCA TGAGCTACGT CCCTTTCCGC GGCAGATCGA GTGCGCCGCT
CATCTGCGCG ACTTACTGGC CGGTAGTGAG TTTGTGCGCA CGAACGATCC TCGTCACGTC
CAAGATGCGT ACACGTTACG CTGTATTCCC CAAGTCCATG GTGCTGTCCG TGACGCGATT
GCGTATGCAC GATGGGTATT CTCCATCGAA CTCAATGCCG TGACCGATAA TCCACTGATT
TTTGTCGATG ATGATGGTAG GGTTGAGGTA ATCTCCGGTG GAAACTTTCA CGGTGAACCA
CTCGCGATTG CATTAGATTA CCTCGGTTTA GCCGTTGCCG AATTGGGTAA CATCGCTGAG
CGACGTTTAA TGCGCCTAAC TGACGAAGCT TCCAACACGC ACGTCTTACC GGCGTTTCTC
ACCCATGACG GTGGTCTCAA CTCAGGATTT ATGATTGTCC AATATACCGC TGCTGCCTTA
GCCACCGAAA ATAAGGTGCT CGCCCATCCG GCCAGCGTTG ATAGTATTCC GACCTCGGCT
AACGTCGAGG ATCACGTGAG TATGGGTCTA ACCGCCGGCC TTAAATTACG TTCGATCCTC
GATAATGTCG CTCAGATCTT GGCGCTGGAG CTATTTGCCG CCGCACAAGG CATCGATTTT
CGCCGCCAAG CCTTGGGCGC AGCAGCACGA CTTGGTCGCG GCACCGGCCC GGTGTATGAG
TTGATCCGTC AACACATCCC GTTTATCGCC GAAGATACGC TACTGCATCC CTACATCATC
ACAATGAGCG AATTGGTAGC GAAGGGTAAG ATCGTCGCAG CAGCACAGAT GTATGGAATG
AGGGCTGGTG GTGGATGTTA A
 
Protein sequence
MSGDEVLLDG ASLTIEQVLA VAYGQPGNPA VRLTPVARER VTRAAQAIQD LLARGVVAYG 
ITTGFGAFKD RVIASEQVEQ LQYNILVSHA VGVGPVFDLP TTRAIMLIRA NTLARGHSGV
RLETLERLID MLNYGIHPRI PSKGSLGASG DLAPLAHMAL PMLGLGEVEW HGEVMPATVV
LQRLGWQPLH LAAKEGLALT NGTAVMCALG VLETARAELL SATADIAGCL SLEALHGTPA
AFDPRLHELR PFPRQIECAA HLRDLLAGSE FVRTNDPRHV QDAYTLRCIP QVHGAVRDAI
AYARWVFSIE LNAVTDNPLI FVDDDGRVEV ISGGNFHGEP LAIALDYLGL AVAELGNIAE
RRLMRLTDEA SNTHVLPAFL THDGGLNSGF MIVQYTAAAL ATENKVLAHP ASVDSIPTSA
NVEDHVSMGL TAGLKLRSIL DNVAQILALE LFAAAQGIDF RRQALGAAAR LGRGTGPVYE
LIRQHIPFIA EDTLLHPYII TMSELVAKGK IVAAAQMYGM RAGGGC