Gene VC0395_A2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2653 
SymbolpurH 
ID5137582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2806027 
End bp2807619 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content53% 
IMG OID640534101 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001218531 
Protein GI147673909 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACG CTCGTCCTAT TCACCGTGCG CTTCTCAGCG TATCAGATAA AACCGGCATT 
GTTGAGTTCG CAAAAGCGCT TGCAGAGCGC GGCGTTGAAC TTCTCTCCAC CGGTGGCACC
GCTCGTCTAT TGGCTGAGCA AGGCCTGACG GTAACGGAAG TCTCCGATTA CACCGGTTTC
CCAGAAATGA TGGACGGACG CGTCAAAACC CTGCATCCGA AAGTGCATGG CGGTATTTTA
GGCCGTCGCG GCCAAGATGA CGCGGTGATG AATACCCACG GCATTCAGCC AATCGATATG
GTGGTGGTGA ACCTCTATCC TTTCGCCCAA ACGGTAGCTA ACCCAAATTG CACGCTGGCT
GATGCGGTGG AAAACATCGA TATCGGCGGG CCGACTATGG TGCGTTCTGC TGCAAAAAAC
CATAAAGATG TGGCCATTGT GGTGAATGCG CACGATTACG ACCGTGTGAT CCGCGAAATG
GATGCCAACC ACAACTCCCT GACGTTAGCG ACGCGTTTTG ATCTGGCGAT TGCCGCTTTT
GAACATACCG CGGCGTACGA TGGCATGATT GCGAACTACT TCGGCACACT CGTACCTTCT
TATGGTGATA ACAAAGAAGG GGATGAAGAG AGCAAATTCC CTCGCACGTT CAATGCGCAG
TTCATCAAAA AGCAAGATAT GCGTTATGGC GAAAACAGCC ACCAAGCGGC CGCGTTTTAT
GTAGAAGCCA ATCCACAAGA AGCCTCAGTC GCCACCGCGC GCCAAATTCA AGGTAAAGCC
CTTTCTTACA ACAACATTGC CGATACCGAT GCCGCACTAG AGTGCGTCAA AGAGTTCAGC
GAGCCAGCCT GTGTGATTGT GAAACACGCG AACCCATGTG GTGTAGCGCT GGGCGATGAT
CTTCTGCAAG CTTACAATCG CGCTTACCAA ACCGACCCAA CCTCGGCATT TGGCGGCATC
ATTGCCTTTA ACCGCGAGCT GGATGGCGAA ACCGCAAGAG CGATCATCGA GCGTCAGTTT
GTGGAAGTGA TCATTGCGCC GAAAGTCTCG CAAGCCGCGA TCGACATCGT CGCAGCAAAA
CAAAACGTAC GTTTGCTGGA GTGTGGTGAA TGGCAAGGCC AAACCACAGG ATTTGATTTG
AAGCGCGTGA ATGGTGGCTT ATTGGTGCAA GATCGTGACC AAGGCATGGT GGCACAAGAT
GACCTACAAG TGGTTTCGAC TCGTCAACCT AGCGACGCTG AGCTGAAAGA TGCTCTGTTC
TGCTGGAAAG TAGCGAAATA CGTGAAATCC AACGCGATTG TGTACGCCAA AGGCGACATG
GCCATCGGCA TCGGCGCGGG TCAAATGAGC CGTGTTTACT CCGCGAAAAT CGCCGGCATC
AAAGCCGCCG ATGAAGGCTT GGAAGTCGCG GGCAGCGTGA TGGCATCGGA TGCCTTCTTC
CCCTTCCGTG ATGGGATTGA TGCTGCGGCC GAAGCGGGCA TTACCTGTGT TATCCAACCG
GGCGGCTCAA TGCGCGACCA AGAGGTGATT GATGCGGCCA ACGAACACGG TATGGCGATG
ATCTTCACCG GTATGCGTCA CTTCCGTCAT TAA
 
Protein sequence
MNNARPIHRA LLSVSDKTGI VEFAKALAER GVELLSTGGT ARLLAEQGLT VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDAVM NTHGIQPIDM VVVNLYPFAQ TVANPNCTLA
DAVENIDIGG PTMVRSAAKN HKDVAIVVNA HDYDRVIREM DANHNSLTLA TRFDLAIAAF
EHTAAYDGMI ANYFGTLVPS YGDNKEGDEE SKFPRTFNAQ FIKKQDMRYG ENSHQAAAFY
VEANPQEASV ATARQIQGKA LSYNNIADTD AALECVKEFS EPACVIVKHA NPCGVALGDD
LLQAYNRAYQ TDPTSAFGGI IAFNRELDGE TARAIIERQF VEVIIAPKVS QAAIDIVAAK
QNVRLLECGE WQGQTTGFDL KRVNGGLLVQ DRDQGMVAQD DLQVVSTRQP SDAELKDALF
CWKVAKYVKS NAIVYAKGDM AIGIGAGQMS RVYSAKIAGI KAADEGLEVA GSVMASDAFF
PFRDGIDAAA EAGITCVIQP GGSMRDQEVI DAANEHGMAM IFTGMRHFRH