Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1297 |
Symbol | |
ID | 5898752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1369372 |
End bp | 1370907 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561782 |
Product | histidine ammonia-lyase |
Protein accession | YP_001682925 |
Protein GI | 167645262 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0927091 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGAAG TCGTTCTCAA TCCCGGTGAA GTCAGCCTGT CGGACTGGAA AGCCGTCTAT CGCGGCGCGC CGGCGCGCCT GAACGAAAGC GCCGGCCCGG TGATCGCGCA GAGCGCCGCC GCCGTTGAAC GCATCCTGGC CAAGGGCGCG CCGGTCTATG GGATCAATAC GGGCTTTGGG AAACTGGCCA GCGTCCGCAT CGGCGACGCG GATCTCGAGA CCCTGCAGCG CAATATCGTG CTGTCCCACG CCGCGGGGAC GGGCGCGCCG TCGCCGGTCG CGGTGATCCG GCTGATGATG GCCTTGAAGC TGGCCAGCCT GGCCCAGGGC GCCTCGGGCG TGCGACCGGC GACCACCGAC CTGCTCGAAG CCATGATCGT GAAGGGCCTG ACGCCGGTCG TGCCCTGCCA GGGCTCGGTC GGCGCCTCGG GCGACCTGGC GCCGCTGGCC CACATGGCCG CGACGATGAT CGGGGTCGGC GAGATCTTCG TTGAAGGCCA GCGCCTGCCG GCCGTCCAGG CCCTGATGGA AGCGGGCCTC AAACCCCTGA CCCTGGGTCC CAAGGAGGGC CTGGCCCTGC TGAACGGCAC CCAGTTCTCG ACCGCCAACG CCCTGGCCGC CCTGTTCGAC GCCGAGCGCC TGTTCCAGTC GGCCCTGGTC ACCGGCGCCC TGGCCACCGA GGCGGCCAAG GGCTCGGACA CCCCGTTCGA CCCGCGCATC CACACCCTGC GCCGCCAGCC CGGCCAGATC GAGACCGCCG CCGCCCTGCG CGCCCTGATG GCCGGCTCGG CGATCCGCGA CTCGCACCGC GAGGGCGACA CGCGGGTTCA GGACCCCTAC TGCCTGCGCT GCCAGCCGCA GGTGATGGGC GCGGCGCTGG ACATCCTGCG CCAGGCCGCC GTCACGCTCT CCACCGAGGC CAACGGCGTC TCCGACAATC CGTTGATCTT TCCCGACACC GACGAGGCCC TGTCGGGCGG CAACTTCCAC GCCGAGCCGG TGGCCTTCGC CGCCGACATC ATCGCCCTGG CCGTCTGCGA GATCGGCTCG ATCGCCGAGC GCCGCATCGC CATGCTGGTC GACCCCGCCT GCTCGGGCCT GCCGGCCTTC CTGACGCCGA AGCCCGGCCT GAACTCGGGC TTCATGATCC CCCAGGTCAC CGCCGCCGCC CTGGTGTCGG AGAACAAGCA GAAGGCCTAT CCAGCCAGCG TCGACTCCAT CCCGACCTCG GCCAACCAGG AGGACCACGT CTCGATGGCC GCCCACGGCG CGCGGCGCCT GCTGGCCATG GTCGAGGCCG CCGAGGCGGT GATCGGCATC GAGCTGCTGG CTGCGGTGCA GGGCTGCGAC TTCCACGCGC CGCTGGCGTC GAGTCCCGCG CTGGAGAGCG TGCGCGGCCT GTTGCGGGCT CAGGTTCCGC ACCTGTCGGA CGACCGGCAC TTCCATCCCG ACATGGAGGC AGCCAACGCC TTGGTGCGGT CGGGCGCGGT CGTGGCGGCC GCGTCGAGCG TTGAATTGCC GGGGGTGGAA GGATGA
|
Protein sequence | MIEVVLNPGE VSLSDWKAVY RGAPARLNES AGPVIAQSAA AVERILAKGA PVYGINTGFG KLASVRIGDA DLETLQRNIV LSHAAGTGAP SPVAVIRLMM ALKLASLAQG ASGVRPATTD LLEAMIVKGL TPVVPCQGSV GASGDLAPLA HMAATMIGVG EIFVEGQRLP AVQALMEAGL KPLTLGPKEG LALLNGTQFS TANALAALFD AERLFQSALV TGALATEAAK GSDTPFDPRI HTLRRQPGQI ETAAALRALM AGSAIRDSHR EGDTRVQDPY CLRCQPQVMG AALDILRQAA VTLSTEANGV SDNPLIFPDT DEALSGGNFH AEPVAFAADI IALAVCEIGS IAERRIAMLV DPACSGLPAF LTPKPGLNSG FMIPQVTAAA LVSENKQKAY PASVDSIPTS ANQEDHVSMA AHGARRLLAM VEAAEAVIGI ELLAAVQGCD FHAPLASSPA LESVRGLLRA QVPHLSDDRH FHPDMEAANA LVRSGAVVAA ASSVELPGVE G
|
| |