Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0822 |
Symbol | hutH |
ID | 5135766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 834761 |
End bp | 836296 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640532280 |
Product | histidine ammonia-lyase |
Protein accession | YP_001216772 |
Protein GI | 147674751 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000107913 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCACC TGATGATCAA ACCCGGCCAA CTCAGCTTAA AACAGTTGCG TCAAGTGAGC CGCTCACCCG TGGTGTTATC CCTCGATCCA GAAGCGATTC CGGCGATTGC CGAAAGCGCC CAAGTCGTCG AACAAGTGAT CAGCGAAGGA CGTACCGTGT ACGGCATCAA TACTGGTTTT GGTTTGCTCG CCAACACCAA AATTGCCCCG CAAGATCTGG AAACGCTGCA AAAGAGCATC GTGCTTTCTC ATGCTGCGGG CATCGGTGAG CTGATGTCGG ATGAAACCGT TCGTTTGATG ATGCTACTTA AAATCAACAG CTTGGCGCGT GGCTATTCTG GTATCCGTCT TGAAGTCATC CAAGCTTTGA TCGAGCTCGT CAATAACCAG ATTTATCCTT GCGTACCGAA AAAAGGCTCG GTTGGCGCAT CGGGTGATCT TGCGCCGCTG GCGCACATGA GCACAGTGTT GCTCGGCGAA GGCCAAGCCC GTTACAACGG CAAAATCATT TCCGGTCTGG AAGCGATGAA AATTGCGGGA CTAGAGCCAA TTACCCTCGC CCCTAAAGAA GGGCTCGCGC TACTCAATGG CACTCAAGCC TCGACCGCAT TTGCGCTCGA AGGATTGTTT GTGGCTGAAG ATCTGTTTGC ATCCGCCACT GTGTGCGGTG CGATGTCGGT CGAAGCCGCT CTGGGAAGCC GTCGCCCCTT CGATCCGCGT ATCCACCGCG TGCGTGGGCA CCGAACCCAA ATGGATGCTG CAACGGCGTA TCGTCACCTG CTCGATGTCA GCAGCGAAAT TGGCCAATCC CACAGCAATT GTGAAAAAGT GCAAGATCCT TACTCTCTGC GCTGCCAACC ACAAGTGATG GGCGCTTGCT TGCAGCAAAT TCGAAGTGCA GCTGAGGTGT TGGAAGTCGA AGCCAACTCT GTTTCTGATA ACCCACTCGT GTTTGCCGAG GATGGCGACA TCATCTCAGG CGGCAACTTC CATGCTGAAC CTGTCGCCAT GGCTGCGGAT AATTTGGCGC TGGCGATTGC TGAAATCGGC AGCCTCTCCG AGCGCCGCAT GGCACTGCTG ATTGACAGTG CGCTGAGCAA ACTACCGCCC TTTTTGGTCG ACAATGGTGG GGTGAACTCC GGCTTTATGA TTGCGCAAGT CACGGCAGCT GCCTTAGCCA GTGAGAACAA AACCCTCGCG CATCCTGCAT CAGTCGACAG TTTACCCACT TCAGCCAACC AAGAAGATCA CGTTTCCATG GCGACGTTTG CCGCACGCAG ACTGCGTGAC ATGGGCGAAA ATACTCGTGG TATTTTGGCG GTGGAATACC TTGCAGCAGC ACAAGGATTG GATTTTCGTG CACCATTGAA GTCCTCACCA CGCATTGAGG AAGCAAGGCA GATACTGCGT GAAAAAGTAC CGTTTTACGA TAAAGACCGC TATTTTGCGC CGGATATCGA AAAAGCCAAT GCTCTGCTGC AACTTGCCGT ACACAACCGT TTAATGCCCG ATCAGCTGCT ACCAAGCCAG CACTAA
|
Protein sequence | MLHLMIKPGQ LSLKQLRQVS RSPVVLSLDP EAIPAIAESA QVVEQVISEG RTVYGINTGF GLLANTKIAP QDLETLQKSI VLSHAAGIGE LMSDETVRLM MLLKINSLAR GYSGIRLEVI QALIELVNNQ IYPCVPKKGS VGASGDLAPL AHMSTVLLGE GQARYNGKII SGLEAMKIAG LEPITLAPKE GLALLNGTQA STAFALEGLF VAEDLFASAT VCGAMSVEAA LGSRRPFDPR IHRVRGHRTQ MDAATAYRHL LDVSSEIGQS HSNCEKVQDP YSLRCQPQVM GACLQQIRSA AEVLEVEANS VSDNPLVFAE DGDIISGGNF HAEPVAMAAD NLALAIAEIG SLSERRMALL IDSALSKLPP FLVDNGGVNS GFMIAQVTAA ALASENKTLA HPASVDSLPT SANQEDHVSM ATFAARRLRD MGENTRGILA VEYLAAAQGL DFRAPLKSSP RIEEARQILR EKVPFYDKDR YFAPDIEKAN ALLQLAVHNR LMPDQLLPSQ H
|
| |