Gene Apre_1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1108 
SymbolpurH 
ID8397895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1188990 
End bp1190495 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content39% 
IMG OID644995455 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003152856 
Protein GI257066600 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAGCTT TACTATCAGT TACTGACAAG ACAGGAATAG AAAAACTAGC CAAAGACCTT 
AGGGACTTGG GAGTAAGTTT GGTTTCAACA GGAGGCACCT ACAAAAAGAT CAAAGATAGC
GGAGTAGATG TATCAGAGAT TGAGGAAATA ACAAACTTTC CAGAAATACT AGAAGGAAGG
GTAAAGACCC TATCTCCTTA TGTACATGGA GGAATTCTTT ATAAGAGGGA TGAAGCTAGT
CATGTTTCAA CTGTAGAAGA GTTGGGGATA AAGGCAATTG ATATAGTAGT TGTAAATTTA
TACGAATTCC AAAAGGCCCT TGATAAGGGA AATCCAGAAG AGATAATCGA AAATATCGAC
ATCGGTGGCC CATCCATGGT TAGGTCTGCT GCCAAAAACC ATAAAGATGT CTTAATTGTA
ACAGATCCAA GTGATTATGA TGAACTAATC GAAAGACTTA AAAACGATGA TATAGACCTA
GCTTATAGGC AAAGACTTGC TATGAAGGCC TTTAGCCTTA CAGCATTCTA CGATTCAGTA
ATAGCAAGGT ACTTCACAAA ACTTACTGGA GAAGAATCTA AATATAAGAC CTACGGCTTT
GAGAAAGAAA CAGATTTACG TTATGGGGAA AATCCAGGAC AAGAGGCAAG TTTATACAAT
GATCCATTTG TCACAGGACT TATGGAAGAT ATAGAAGTAA TTCACGGCAA GGAAATGAGT
TATAACAACT ACAATGATCT AAACCCAGCC CTAGAGCTTG CCCAAGAGCT AGGAGATAAT
GCAGTAGTTG CTCTTAAACA CCAATCACCA TGCGGGGTTG CTGTAGGAAG TGATGTCTAT
GATTCATACA TTAAGGCCTT CGAGTGCGAC AGCCAATCAA TATTTGGAGG AATCCTTGCA
GTAAATGGAG TAGTTGATGA GAAAGCAGCT TCGAAAATGC ATGAAATATT CCTAGAAATA
ATAGCAGCAA AAGACTTTAC AAAAGAGGCT CTAGAAATTC TTACAAAGAA GAAAAATCTA
AGGCTCGTTA AAGTCGACTT TGCTAATGAA AGTGTAAGAG AAGAAATCAG ATATCTTAAT
GGAAAAGTCC TAATTCAAGG AAAAGACTTC GGCAAGGACG AAGTAAATAT AGTAACTGAC
AAAAAGCCTA GTGAAGAAGA AATCAAAGAC CTCTTATTTG CCCAAAAGGT GGTAAAATAT
GTCAAATCAA ATGCCATTGT AGTAGCCAAG GGAATGAAGA CCCTAGGTTG TGGGGCAGGT
CAACAATCTA GAGTTTGGGC GCTTGAATCT ATCAAAGATC ACTTTAAGGA TAGGGACTTT
GAGGGAGCAG TCCTTGGATC AGATGCCTTC TTCCCATTTT CAGATACAGT AGAGCTTGCC
CACGAGATGG GAATTAGCTC AATCATCCAA CCAGGTGGAT CAATAAGAGA CGAAGACTCA
ATCGATAAAT GTAATGAATA TGGTATGAGC ATGGTATTTA GCAAATCACG TCACTTCAAA
CATTAA
 
Protein sequence
MRALLSVTDK TGIEKLAKDL RDLGVSLVST GGTYKKIKDS GVDVSEIEEI TNFPEILEGR 
VKTLSPYVHG GILYKRDEAS HVSTVEELGI KAIDIVVVNL YEFQKALDKG NPEEIIENID
IGGPSMVRSA AKNHKDVLIV TDPSDYDELI ERLKNDDIDL AYRQRLAMKA FSLTAFYDSV
IARYFTKLTG EESKYKTYGF EKETDLRYGE NPGQEASLYN DPFVTGLMED IEVIHGKEMS
YNNYNDLNPA LELAQELGDN AVVALKHQSP CGVAVGSDVY DSYIKAFECD SQSIFGGILA
VNGVVDEKAA SKMHEIFLEI IAAKDFTKEA LEILTKKKNL RLVKVDFANE SVREEIRYLN
GKVLIQGKDF GKDEVNIVTD KKPSEEEIKD LLFAQKVVKY VKSNAIVVAK GMKTLGCGAG
QQSRVWALES IKDHFKDRDF EGAVLGSDAF FPFSDTVELA HEMGISSIIQ PGGSIRDEDS
IDKCNEYGMS MVFSKSRHFK H