Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3645 |
Symbol | |
ID | 4692814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 4029505 |
End bp | 4031067 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639851400 |
Product | histidine ammonia-lyase |
Protein accession | YP_998379 |
Protein GI | 121610572 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAT CCAACACTTC CCCACTGCTG CTGCAACCCG GCCATGTCAC CCTGGCCGGG CTGCGCCGCA TCCATGCCGG CCCGGTACGA CTGGCGCTGG ACGCACCGGC CCGGGCGGCC ATGCAGGCCG CGCAGGCGGC GGTGCAGCGC ATCGTGGCGG CAGACCGGGT GGTCTATGGC ATCAACACCG GTTTTGGCAA GCTGGCCAGC ACCAGGATTG CTGCCGAGCA CCTGACCGAG TTGCAGCGCC GCCTGGTGCT GTCGCACAGC GCGGGCACCG GGCCGGCGCT GCCCGACGCG GTGGTGCGCC TGGTGCTGGC CACCAAGGCC GTGGGCCTGG CGCGCGGCCA CTCCGGCATC CGCCCCGAGA TCGTCGATGC GCTGCTGGCG CTGGCCAGTG CCGAGGTGCT GCCGGTGATT CCGGCCAAAG GCTCGGTCGG GGCCTCGGGG GATCTGGCGC CGCTGGCGCA TCTGGCCTGC GTGCTGATCG GGCAGGGGCA GGCCCAATGC AACGGCACGC TGGTGCCGGG CGCCGAGGCG ATGCGCGCCA TCGGTTGCCA GCCCTTCGTG CTCGGCCCGA AGGAGGGGCT GGCGCTGCTC AACGGCACGC AGGTGTCGAC CGCGCTGGCG CTGGCGGGCC TGTTCGGCGC CGAGAACCTG CTGGCCGCTG CGCTGGTGGC CGGCGCGCTA TCGCTGGAGG CCATCAAAGG CTCGGTCTGG CCGCTGGATG CGCGCATCCA TGAAGCCCGC GGCCAAGCCG GGCAAATCGC CGTGGCCGCC GCGTTGCGCG CGCTGCTCGA AGGCAGCGCC ATCGCCGCTT CGCACCCGCA CTGTGGCCGC GTGCAAGACC CCTACTCGAT CCGCTGTATG CCGCAGGTGC TGGGCGCTTG CCTGGACAAC CTGCACCACG CGGCGCGCGT GCTCGTCATC GAAGCCAATG CGGCATCGGA CAACCCGCTG GTCTTTGCCG CAGCGCAGGG CGGCCCGGGT GCGGACGAGG TGATCTCCGG CGGCAACTTC CACGCCGAGC CGGTGGCCTT TGCTGCCGAC ATCATGGCGC TGGCGGTGGC CGAGATCGGC GCCATGTCCG AACGGCGCCT GGCGCTGCTG CTCGACACCG GCCTGTCGGC ACTGCCGGCC TTTCTGGTGC GCGACAGCGG CATGAACTCG GGCTTCATGA TGGCGCAGGT CACCGCCGCT GCGCTGGCGA GCGAGAACAA ATCGCTGGCC CACCCGGCCA GTGTCGACAG CCTGCCCACA TCGGCGAACC AGGAAGACCA TGTGTCCATG GCCACCTTCG GCGCGCGCCG CCTGGCCGAG ATGATCGACA ACACGGCGAC GGTGGTGGGC ATCGAGGCCA TGGCGGCGGC GCAGGGCATG GAGTTCGATC GCAGCCTGCG CTCGACGCCG CTGCTCGAAG GGCAGTGGGC CGCGATTCGC GAGCGCGTGG CCTTTCTGGA GCAAGACCGC TGCCTGGCGC CCGATATAGC CGCCATGCGG CTGTGGGCGC AGCAATCCGG GTGGCCCGCG CCGCTGTTGC AGTGCCTGCC CAGCCATGCC TGA
|
Protein sequence | MTESNTSPLL LQPGHVTLAG LRRIHAGPVR LALDAPARAA MQAAQAAVQR IVAADRVVYG INTGFGKLAS TRIAAEHLTE LQRRLVLSHS AGTGPALPDA VVRLVLATKA VGLARGHSGI RPEIVDALLA LASAEVLPVI PAKGSVGASG DLAPLAHLAC VLIGQGQAQC NGTLVPGAEA MRAIGCQPFV LGPKEGLALL NGTQVSTALA LAGLFGAENL LAAALVAGAL SLEAIKGSVW PLDARIHEAR GQAGQIAVAA ALRALLEGSA IAASHPHCGR VQDPYSIRCM PQVLGACLDN LHHAARVLVI EANAASDNPL VFAAAQGGPG ADEVISGGNF HAEPVAFAAD IMALAVAEIG AMSERRLALL LDTGLSALPA FLVRDSGMNS GFMMAQVTAA ALASENKSLA HPASVDSLPT SANQEDHVSM ATFGARRLAE MIDNTATVVG IEAMAAAQGM EFDRSLRSTP LLEGQWAAIR ERVAFLEQDR CLAPDIAAMR LWAQQSGWPA PLLQCLPSHA
|
| |