Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0824 |
Symbol | hutG |
ID | 5137705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 838024 |
End bp | 839034 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640532282 |
Product | formimidoylglutamase |
Protein accession | YP_001216774 |
Protein GI | 147673474 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01227] formimidoylglutamase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000103972 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCTA ATTTCACCAC TGAGCACACT TGGCAAGGCC GCCATGATCC CGAAGATGGT CAAGCCGGGC GTCGTGTGCA CCATATCGCT TGTCCCATTC AAGTGGGCGA GCTAGCGAAC CAAGAGCCGG GTGTGGCGTT GATCGGTTTT GAGTGTGATG CGGGTGTGGA ACGCAACAAA GGTCGCACCG GAGCCAAGCA CGCGCCCAGC CTTATTAAGC AAGCGCTAGC CAATCTCGCG TGGCACCATC CCATCCCCAT TTACGATTTG GGCAACATTC GTTGTGAGGG TGATGAATTA GAGCAAGCTC AGCAAGAATG CGCGCAAGTG ATTCAACAGG CTTTGCCTCA CGCGCGGGCC ATCGTGCTCG GCGGCGGACA CGAGATTGCT TGGGCAACCT TTCAGGGTTT AGCACAACAT TTTCTAGCAA CAGGCGTAAA GCAACCACGG ATCGGCATCA TCAATTTTGA TGCGCATTTT GACTTACGAA CATTTGAATC AGAATTAGCA CCAGTGCGCC CAAGCTCAGG CACGCCGTTT AATCAAATCC ACCATTTTTG CCAGCAGCAA GGTTGGGATT TTCATTACGC CTGCTTGGGA GTGAGCCGCG CCAGCAACAC GCCCGCACTG TTTGAACGCG CAGATAAGCT AGGGGTTTGG TATGTTGAAG ATAAAGCCTT TTCGCCTTTG TCACTCAAGG ATCACCTGAC TCAATTACAA CACTTTATTG ATGATTGTGA TTACCTCTAT CTCACCATTG ATCTGGACGT GTTTCCGGCG GCCAGTGCGC CCGGCGTCAG TGCGCCTGCC GCGCGCGGTG TGAGCCTAGA AGCGCTTGCC CCCTATTTCG ACCGAATTCT TCATTACAAA AACAAACTGA TGATTGCCGA TATCGCCGAA TACAACCCAA GTTTCGATAT TGATCAGCAC ACCGCGCGCT TAGCCGCTCG TTTGTGTTGG GACATTGCTA ACGCCATGGC CGAACAAGTG CAATCCATCC GTCACCCGTG A
|
Protein sequence | MNPNFTTEHT WQGRHDPEDG QAGRRVHHIA CPIQVGELAN QEPGVALIGF ECDAGVERNK GRTGAKHAPS LIKQALANLA WHHPIPIYDL GNIRCEGDEL EQAQQECAQV IQQALPHARA IVLGGGHEIA WATFQGLAQH FLATGVKQPR IGIINFDAHF DLRTFESELA PVRPSSGTPF NQIHHFCQQQ GWDFHYACLG VSRASNTPAL FERADKLGVW YVEDKAFSPL SLKDHLTQLQ HFIDDCDYLY LTIDLDVFPA ASAPGVSAPA ARGVSLEALA PYFDRILHYK NKLMIADIAE YNPSFDIDQH TARLAARLCW DIANAMAEQV QSIRHP
|
| |