Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2468 |
Symbol | purK |
ID | 5137360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2619871 |
End bp | 2621004 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640533919 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001218361 |
Protein GI | 147674298 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.599362 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTGT TGGTTCTCGG CGCTGGTCAG CTGGCGCGCA TGATGTCGCT CGCCGGAGCA CCGCTCAATA TTGAAACGAT CGCTTTTGAT GTGGGTAGCG AAAACATTGT GCACCCCTTA ACGCAAACTG TGCTTGGGCA TGGATTGGAG CAAGCGATTG AACAAGTCGA TGTGATCACC GCTGAGTTTG AACACATTCC GCATCCGATC CTCGATCTCT GTGCACGCAG TGGCAAACTT TACCCAAGCG CTGAAGCTAT CAAAGCTGGC GGCGATCGTC GTTTAGAAAA AGCCTTGCTG GATCGCGCCC AAGTGGCGAA TGCACGTTAT ACGATGATCC GCAGCCGAGA CGACCTAACC TCAGCCATCG CCGAGATTGG ATTGCCTATG GTGCTGAAAA GTGCACTCGG AGGCTACGAT GGAAAAGGCC AATGGCGCTT GAAAGAACCA ACGCAGATCG AATCGGTTTG GCAAGAACTT GCGCAATATC TGGCAGCCAA CCCCGAACAA GCAATTGTGG CAGAAGAATT TGTCGCTTTT GATCGCGAAG TGTCACTGGT CGGTGCACGT AACCTAGTCG GCGATGTTGT GGTGTATCCT TTAGCGGAGA ACGTTCATAC CCAAGGTGTG TTGAGCCTTT CTACCGCCAT TGATGCTCCT GCGCTACAAA CTCAAGCGAA AGCCATGTTT AAAGCGGTAG CCGAGCAGCT CAATTATGTC GGTGTATTAG CGCTGGAGTT TTTTGAAGTA CAAGGCCAGT TACTGGTCAA TGAAATTGCA CCACGAGTTC ATAACTCCGG TCACTGGACT CAGCAAGGTG CGGAAACCTG TCAGTTCGAA AACCACTTAC GCGCAGTGTG TGGCTTACCG CTGGGTAGCA CCAAACTGGT TCGTGAGACC GCGATGATTA ATATTCTTGG TGAAGATCAG CTGCCCGCAG AAGTATTGGC ACTGGAAGGC TGCCACGTAC ATTGGTACGG CAAGGCCAAG CGCTCAGGAC GCAAGATGGG GCATATCAAT GTGACCGCCG ATTACAGTGG TGAGTTGCAA CGCAAATTAT GCCAATTAGC GACTGTGTTA GATGAAAAGG CTTTTCCTGC CGTACACGCC GTAGCAAAGG AAATTCAGCC TTAA
|
Protein sequence | MRVLVLGAGQ LARMMSLAGA PLNIETIAFD VGSENIVHPL TQTVLGHGLE QAIEQVDVIT AEFEHIPHPI LDLCARSGKL YPSAEAIKAG GDRRLEKALL DRAQVANARY TMIRSRDDLT SAIAEIGLPM VLKSALGGYD GKGQWRLKEP TQIESVWQEL AQYLAANPEQ AIVAEEFVAF DREVSLVGAR NLVGDVVVYP LAENVHTQGV LSLSTAIDAP ALQTQAKAMF KAVAEQLNYV GVLALEFFEV QGQLLVNEIA PRVHNSGHWT QQGAETCQFE NHLRAVCGLP LGSTKLVRET AMINILGEDQ LPAEVLALEG CHVHWYGKAK RSGRKMGHIN VTADYSGELQ RKLCQLATVL DEKAFPAVHA VAKEIQP
|
| |