Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0825 |
Symbol | hutI |
ID | 5137886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 839024 |
End bp | 840232 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640532283 |
Product | imidazolonepropionase |
Protein accession | YP_001216775 |
Protein GI | 147674086 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0000174389 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTGTG TGTTGACCGA AGCGCGTTTA GTAACCATGC AACCGGGCGT GCAAGGTTAT CAGATAACCG AGCCGCAAAC CCTCATCATT GAGCAGGGGC GCATTCAACA CATTGGCCAA CATATTGACC TTCCCAGCGA TGCGCATCCT ATCTCGTGTG CCGGCAAATT AGTCACACCG GGATTGATTG ATTGCCACAC CCATTTGGTT TATGCCGGCA GCCGCGCCAA CGAATTCGAG CTGCGTTTAC AAGGTGTGCC CTACCAAACC ATCGCCGCTC AAGGTGGCGG CATTCTTTCT ACCGTGAATG CCACTCGTAA GGCCAGTGAA GAAGCGTTGA TCGAACTCGC GCTACCGCGT TTGGATGGCC TGCTGCGTAG CGGTGTCACT TCCGTTGAAG TGAAATCTGG CTACGGTTTA ACGCTCAAGG ATGAGCTGAA AATGCTGCGC GCCGCCAAAG CGCTTGAGCA GCATCGCCGC GTCAAAATCA CCACCACATT ACTTGCCGCG CACGCACTAC CTCCTGAATT TCAAGGCCGT AGTGATGACT ATATCGCGCA TATTTGCCAA GAGATCATTC CTCGGGTTGC CGAAGAGCAA CTGGCCACCA GCGTAGATGT GTTTTGTGAG TCGATTGGCT TTAGCGTGGC GCAAACTGAA CGCGTGTTTC ATGCTGCGCA AGCGCATGGC TTGCAAATCA AAGGTCATAC CGAGCAGCTT TCCAACTTAG GCGGCAGCGC ACTCACCGCT CGCATGGGTG GGCTCTCCGT TGACCATATC GAATATCTTG ATGAAGCGGG CGTGAAAGCA TTGGCGCAAT CGAGCACGGT TGCCACCTTA CTTCCCGGTG CATTCTACTT TTTGCGCGAA ACCCAAAAGC CGCCGATTGA ATGGCTGCGC CAATATCGCG TACCCATGGC CATCTCTACC GATTTAAACC CCGGCACCTC ACCATTTGCC GATTTGTCCC TGATGATGAA TATGGGCTGT ACCTTGTTTG ACCTAACGCC AGAAGAGACT CTACGCGCAG TGACATGCCA TGCAGCGCAA GCCTTGGGTT ACCCCGCAAA CCGTGGACAA ATCGCAGAAG GCTACGATGC TGATTTGGCG ATTTGGAATA TCGAACATCC TGCCGAGCTG AGCTATCAAG TCGGCGTTTC TCGCCTGCAT GCTCGCATAG TTAATGGAGA GCTGAGCTAT GAATCCTAA
|
Protein sequence | MNCVLTEARL VTMQPGVQGY QITEPQTLII EQGRIQHIGQ HIDLPSDAHP ISCAGKLVTP GLIDCHTHLV YAGSRANEFE LRLQGVPYQT IAAQGGGILS TVNATRKASE EALIELALPR LDGLLRSGVT SVEVKSGYGL TLKDELKMLR AAKALEQHRR VKITTTLLAA HALPPEFQGR SDDYIAHICQ EIIPRVAEEQ LATSVDVFCE SIGFSVAQTE RVFHAAQAHG LQIKGHTEQL SNLGGSALTA RMGGLSVDHI EYLDEAGVKA LAQSSTVATL LPGAFYFLRE TQKPPIEWLR QYRVPMAIST DLNPGTSPFA DLSLMMNMGC TLFDLTPEET LRAVTCHAAQ ALGYPANRGQ IAEGYDADLA IWNIEHPAEL SYQVGVSRLH ARIVNGELSY ES
|
| |