Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1658 |
Symbol | |
ID | 7268960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2022859 |
End bp | 2024439 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643566500 |
Product | histidine ammonia-lyase |
Protein accession | YP_002462995 |
Protein GI | 219848562 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.180825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGGTG ACGAAGTGCT TCTTGATGGG GCAAGCCTTA CTATCGAGCA GGTTTTGGCC GTAGCCTATG GTCAACCCGG TAACCCGGCG GTACGCCTGA CCCCGGTAGC GCGCGAGCGG GTAACCCGCG CAGCCCAAGC TATTCAAGAT TTACTCGCTC GTGGCGTGGT CGCCTACGGG ATTACTACCG GTTTTGGGGC ATTCAAAGAT CGGGTGATTG CGTCCGAACA AGTCGAACAA TTGCAGTACA ACATTCTGGT CAGCCATGCT GTAGGCGTGG GGCCGGTCTT CGATCTGCCT ACGACGCGGG CCATTATGCT CATCCGTGCC AATACTCTTG CCCGTGGTCA TTCGGGTGTG CGCCTGGAAA CGCTCGAACG GCTGATCGAT ATGCTCAACT ACGGTATTCA TCCGCGCATC CCTAGTAAAG GTTCGCTGGG GGCGAGCGGT GATCTCGCGC CACTCGCCCA TATGGCGTTA CCGATGCTCG GCTTGGGAGA GGTCGAATGG CATGGAGAGG TGATGCCGGC AACCGTCGTA TTGCAACGGT TAGGCTGGCA ACCGCTCCAC TTGGCGGCAA AAGAGGGTTT GGCACTCACG AACGGAACGG CAGTCATGTG TGCGCTGGGC GTGCTCGAAA CAGCACGCGC CGAGTTGTTG AGTGCGACCG CCGATATAGC CGGTTGTCTG AGCCTTGAGG CTCTTCACGG TACACCGGCA GCGTTCGATC CGCGACTCCA TGAGCTACGT CCCTTTCCGC GGCAGATCGA GTGCGCCGCT CATCTGCGCG ACTTACTGGC CGGTAGTGAG TTTGTGCGCA CGAACGATCC TCGTCACGTC CAAGATGCGT ACACGTTACG CTGTATTCCC CAAGTCCATG GTGCTGTCCG TGACGCGATT GCGTATGCAC GATGGGTATT CTCCATCGAA CTCAATGCCG TGACCGATAA TCCACTGATT TTTGTCGATG ATGATGGTAG GGTTGAGGTA ATCTCCGGTG GAAACTTTCA CGGTGAACCA CTCGCGATTG CATTAGATTA CCTCGGTTTA GCCGTTGCCG AATTGGGTAA CATCGCTGAG CGACGTTTAA TGCGCCTAAC TGACGAAGCT TCCAACACGC ACGTCTTACC GGCGTTTCTC ACCCATGACG GTGGTCTCAA CTCAGGATTT ATGATTGTCC AATATACCGC TGCTGCCTTA GCCACCGAAA ATAAGGTGCT CGCCCATCCG GCCAGCGTTG ATAGTATTCC GACCTCGGCT AACGTCGAGG ATCACGTGAG TATGGGTCTA ACCGCCGGCC TTAAATTACG TTCGATCCTC GATAATGTCG CTCAGATCTT GGCGCTGGAG CTATTTGCCG CCGCACAAGG CATCGATTTT CGCCGCCAAG CCTTGGGCGC AGCAGCACGA CTTGGTCGCG GCACCGGCCC GGTGTATGAG TTGATCCGTC AACACATCCC GTTTATCGCC GAAGATACGC TACTGCATCC CTACATCATC ACAATGAGCG AATTGGTAGC GAAGGGTAAG ATCGTCGCAG CAGCACAGAT GTATGGAATG AGGGCTGGTG GTGGATGTTA A
|
Protein sequence | MSGDEVLLDG ASLTIEQVLA VAYGQPGNPA VRLTPVARER VTRAAQAIQD LLARGVVAYG ITTGFGAFKD RVIASEQVEQ LQYNILVSHA VGVGPVFDLP TTRAIMLIRA NTLARGHSGV RLETLERLID MLNYGIHPRI PSKGSLGASG DLAPLAHMAL PMLGLGEVEW HGEVMPATVV LQRLGWQPLH LAAKEGLALT NGTAVMCALG VLETARAELL SATADIAGCL SLEALHGTPA AFDPRLHELR PFPRQIECAA HLRDLLAGSE FVRTNDPRHV QDAYTLRCIP QVHGAVRDAI AYARWVFSIE LNAVTDNPLI FVDDDGRVEV ISGGNFHGEP LAIALDYLGL AVAELGNIAE RRLMRLTDEA SNTHVLPAFL THDGGLNSGF MIVQYTAAAL ATENKVLAHP ASVDSIPTSA NVEDHVSMGL TAGLKLRSIL DNVAQILALE LFAAAQGIDF RRQALGAAAR LGRGTGPVYE LIRQHIPFIA EDTLLHPYII TMSELVAKGK IVAAAQMYGM RAGGGC
|
| |