Gene VC0395_A2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2468 
SymbolpurK 
ID5137360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2619871 
End bp2621004 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content51% 
IMG OID640533919 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001218361 
Protein GI147674298 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.599362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTGT TGGTTCTCGG CGCTGGTCAG CTGGCGCGCA TGATGTCGCT CGCCGGAGCA 
CCGCTCAATA TTGAAACGAT CGCTTTTGAT GTGGGTAGCG AAAACATTGT GCACCCCTTA
ACGCAAACTG TGCTTGGGCA TGGATTGGAG CAAGCGATTG AACAAGTCGA TGTGATCACC
GCTGAGTTTG AACACATTCC GCATCCGATC CTCGATCTCT GTGCACGCAG TGGCAAACTT
TACCCAAGCG CTGAAGCTAT CAAAGCTGGC GGCGATCGTC GTTTAGAAAA AGCCTTGCTG
GATCGCGCCC AAGTGGCGAA TGCACGTTAT ACGATGATCC GCAGCCGAGA CGACCTAACC
TCAGCCATCG CCGAGATTGG ATTGCCTATG GTGCTGAAAA GTGCACTCGG AGGCTACGAT
GGAAAAGGCC AATGGCGCTT GAAAGAACCA ACGCAGATCG AATCGGTTTG GCAAGAACTT
GCGCAATATC TGGCAGCCAA CCCCGAACAA GCAATTGTGG CAGAAGAATT TGTCGCTTTT
GATCGCGAAG TGTCACTGGT CGGTGCACGT AACCTAGTCG GCGATGTTGT GGTGTATCCT
TTAGCGGAGA ACGTTCATAC CCAAGGTGTG TTGAGCCTTT CTACCGCCAT TGATGCTCCT
GCGCTACAAA CTCAAGCGAA AGCCATGTTT AAAGCGGTAG CCGAGCAGCT CAATTATGTC
GGTGTATTAG CGCTGGAGTT TTTTGAAGTA CAAGGCCAGT TACTGGTCAA TGAAATTGCA
CCACGAGTTC ATAACTCCGG TCACTGGACT CAGCAAGGTG CGGAAACCTG TCAGTTCGAA
AACCACTTAC GCGCAGTGTG TGGCTTACCG CTGGGTAGCA CCAAACTGGT TCGTGAGACC
GCGATGATTA ATATTCTTGG TGAAGATCAG CTGCCCGCAG AAGTATTGGC ACTGGAAGGC
TGCCACGTAC ATTGGTACGG CAAGGCCAAG CGCTCAGGAC GCAAGATGGG GCATATCAAT
GTGACCGCCG ATTACAGTGG TGAGTTGCAA CGCAAATTAT GCCAATTAGC GACTGTGTTA
GATGAAAAGG CTTTTCCTGC CGTACACGCC GTAGCAAAGG AAATTCAGCC TTAA
 
Protein sequence
MRVLVLGAGQ LARMMSLAGA PLNIETIAFD VGSENIVHPL TQTVLGHGLE QAIEQVDVIT 
AEFEHIPHPI LDLCARSGKL YPSAEAIKAG GDRRLEKALL DRAQVANARY TMIRSRDDLT
SAIAEIGLPM VLKSALGGYD GKGQWRLKEP TQIESVWQEL AQYLAANPEQ AIVAEEFVAF
DREVSLVGAR NLVGDVVVYP LAENVHTQGV LSLSTAIDAP ALQTQAKAMF KAVAEQLNYV
GVLALEFFEV QGQLLVNEIA PRVHNSGHWT QQGAETCQFE NHLRAVCGLP LGSTKLVRET
AMINILGEDQ LPAEVLALEG CHVHWYGKAK RSGRKMGHIN VTADYSGELQ RKLCQLATVL
DEKAFPAVHA VAKEIQP