Gene HMPREF0424_0832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0832 
Symbol 
ID8709725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp943612 
End bp945282 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content44% 
IMG OID646482932 
Productputative phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003374049 
Protein GI283783295 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACAGA CACAGAGCAA AGAATCAAAC GCAAATGTGC TGATAACCAC AGGTGAAAAG 
CAAATTCGTA GAGTTCTTGT TTCAGTTTTT CATAAAGAAG GACTTGAAAT ATTAGCTGAA
GCATTTAGAA AGGCTGATAC TGAAGTAGTT TCAACCGGCT CTACTGCTGC TAAATTGGCG
GAACTTGGCG TTCATGTTAC TGAAGTTTCT GACATTACTG GTTTCCCAGA AAGTCTTGAT
GGAAGAGTGA AGACTTTAGA TCCGCATATT CACGCTGGAA TTCTTGCAGA TATGAGTAAT
GCTGATCATG TAAAGCAACT TGAAAAGCTA AACATAAAGC CGTTTGATGC TGTAGTAGTT
AATTTGTATC CGTTCGTAGA CACTGTAATG AGCGGTGCTG ATAGTGACGC AATTATTGAG
AAAATTGATA TTGGTGGACC TTCTATGATT CGTGCTGCAG CAAAGAATCA TAAGTCTGTT
GTCGTTATTA CAGATCCTGC AGATTACCAG CTTTTAGCTA ATAGAATCGT TTCAGGTGAA
GGCTTTAATT TGCAAGAGCG CGAATATTTG GCTGGTAAAG CATTTGCTCA TACAGCAGCT
TACGATGCTT CTATATCTGA GTGGACAAGT AAAGCATGGC AGAAGCCTGA AAGTTTGAAC
ACAAATGATG AAGAAGATAG TAAAAATGCT GAATCTGTGA AGCTTCCTGC AAACTACACG
CGCACTTGGA GCTTAGAGCA TACGCTTCGA TATGGCGAGA ACCCTCATCA GCAAGCAGGA
TTGTATTTAG ATCCTCTGCA TAAGGGCGGT TTAGCTCAAG CTGAGCTTCT TGGCGGAAAA
CCTATGAGCT ACAACAATTA TGTGGATGCG GATGCTGCTT GGCGTGCGGT TTGGGATTTT
GCTCCACAAA TTGCAGTTGC TGTTTGCAAG CATAATAATC CATGCGGTTT AGCTATTGGA
GCTAATGCAG CTGAAGCTCA CAAGAAAGCT CACGCTTGCG ATCCTGTAAG TGCTTACGGT
GGTGTGATTG CTTCAAATAC TACAGTCACT TTAGAAATGG CAGAAAGTGT GCGTCCAATT
TTCACAGAGG TTATTGTGGC TCCAGACTAT GAGCCTGAAG CTTTGGAATT GCTTCGCACT
AAAAAAAAGA ACCTCCGCAT TTTGAAGGTT GCAACTCCGC CAGTTTTAGG TTTGCAAGTT
CATCCTATTG ACGGCGGTGC TTTAGTTCAG TCTGCAGATA AGATTGATGC AGTTGGTGAT
AATCCTGAAA ACTGGACGCT TGTTTCTGGA GATCCTGCTG ATGCTGGTAC TTTGCGCGAT
TTGCAATTTG CCTGGCGTTC GCTTCGTTGT GTAAAGTCTA ACGCAATCCT ACTTGCGCAT
GATAATGCTA CTGTTGGCAT TGGTATGGGG CAGGTTAATC GCGTGGATTC GTGCCATTTG
GCTGTTGAAC GTGCGAATAC TTTGGCTGAT GGTGCTGAAC GTGCTCGTGG AGCTGTTGCG
GCTTCGGATG CTTTCTTCCC ATTCGCTGAC GGTCCGCAAA TTTTGATTGA AGCTGGCGTT
CGCGCAATCG TGCAGCCTGG TGGCTCTATT CGCGACGAAG AAGTGTTCGA AGCTGCGCGC
AAAGCTGGTG TAACGATGTA CACTACCGGT ACTCGTCACT TCTTCCACTA A
 
Protein sequence
MQQTQSKESN ANVLITTGEK QIRRVLVSVF HKEGLEILAE AFRKADTEVV STGSTAAKLA 
ELGVHVTEVS DITGFPESLD GRVKTLDPHI HAGILADMSN ADHVKQLEKL NIKPFDAVVV
NLYPFVDTVM SGADSDAIIE KIDIGGPSMI RAAAKNHKSV VVITDPADYQ LLANRIVSGE
GFNLQEREYL AGKAFAHTAA YDASISEWTS KAWQKPESLN TNDEEDSKNA ESVKLPANYT
RTWSLEHTLR YGENPHQQAG LYLDPLHKGG LAQAELLGGK PMSYNNYVDA DAAWRAVWDF
APQIAVAVCK HNNPCGLAIG ANAAEAHKKA HACDPVSAYG GVIASNTTVT LEMAESVRPI
FTEVIVAPDY EPEALELLRT KKKNLRILKV ATPPVLGLQV HPIDGGALVQ SADKIDAVGD
NPENWTLVSG DPADAGTLRD LQFAWRSLRC VKSNAILLAH DNATVGIGMG QVNRVDSCHL
AVERANTLAD GAERARGAVA ASDAFFPFAD GPQILIEAGV RAIVQPGGSI RDEEVFEAAR
KAGVTMYTTG TRHFFH