Gene Strop_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3806 
SymbolpurH 
ID5060284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4360946 
End bp4362514 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content72% 
IMG OID640476064 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001160615 
Protein GI145596318 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCCA GTCAGGACGA GCGCCGCCCG ATCCGGCGGG CGCTGGTCAG CGTCTACGAC 
AAGGCCGGTC TGGCCGAGCT GGCCCAGGCG TTGCACGACG CCGGCGTGGA GATCGTCTCG
ACCGGAAGCA CCGCGTCGGC CATCGCCGGT GCCGGCGTGC CGGTCACCGC GGTGGATTCC
GTGACCGGGT TCCCGGAGAT CCTCGACGGC CGGGTCAAGA CCCTGCACCC CAAGATCCAC
GGCGGTCTCC TCGCTGACCT GCGCAAGGAG TCGCACGTTG GGCAGCTCGC CGAGCACGGC
ATCGGGGCAA TCGACCTGCT GGTGTCCAAC CTGTACCCGT TTCAGGCGAC CGTCGCCTCC
GGGGCGGGGC AGGACGAGTG TGTCGAGCAG ATCGACATCG GCGGGCCGGC GATGGTGCGG
GCCGCTGCCA AGAACCACGC CTCGGTCGCC GTGGTGACCG ACCCGTCGGG CTACCCGCAG
CTGCTGACGG CGGTACGGGT GGGTGGTTTC ACCCTGGCGC AGCGTCGGGC GCTCGCGGCC
CGCGCGTTCG CGGTGATCGC CGACTACGAC GTGGCCGTCG CCGAGTGGTG CGCGCGGGAG
TTGGTCGAGG ACGCGCCGTG GCCGAGCTTC GCCGGGCTGG CGCTGCGCCG CGACGCGGTG
CTGCGGTACG GGGAGAACCC GCACCAGGCA GCCGCCCTCT ACACCGACCC GTCGAGCCCG
GCCGGGCTTG CCCAGGCCGA GCAGCTGCAC GGCAAGGAGA TGTCGTACAA CAACTACGTT
GACGCCGACG CCGCCTGGCG GGCCGCCAAC GACTTCTCCG ACCAGCCGGC GGTGGCGATC
ATCAAGCACG CCAACCCGTG TGGCATCGCG GTTGGCGTGG ACGTCGCAGA GGCGCACCGC
AAGGCGCACG CCTGCGACCC CGTGTCCGCC TTCGGTGGCG TGATCGCCGT GAACCGACCG
GTCGGCGTCG AGCTCGCCGG GCAGGTGTCG GAGGTGTTCA CCGAGGTGGT TGTCGCTCCG
GAGTTCGAAC CCGACGCGCT CGAGGTACTG CGGGGCAAGA AGAACGTGCG CCTGCTGCGC
GCCCCGGCCT ACGCCCCGGC GTCGGCGGAG TGGCGACCGG TCACCGGTGG GATGCTGGTG
CAGGTGCGGG ACAAGGTGGA CGCCGCGGGT GACGACCCGG CCACCTGGCA GCTGGCGACC
GGCGAGGCCG CCGACGAGGC GACCCTGCGG GACCTGGCCT TCGCCTGGCG GGCGGTCCGA
GCGGTGAAGA GCAACGCGAT TCTGCTCGCC CGCGACGGCG CGACCGTCGG TGTGGGTATG
GGGCAGGTCA ACCGGGTGGA CTCGGCCCGG CTGGCGGTGG ATCGGGCCGG TGCCGAACGG
GCGCGGGGCG CGGTGTGTGC CTCGGACGCG TTCTTCCCGT TCGCCGACGG GCCGAAGATC
CTCATCGACG CCGGAGTGCG GGCGATCGTC CAACCTGGCG GGTCGATCCG GGACGAGGAG
GTCATCGCCG CTGCCAAGGC GGCCGGCGTG ACCATGTACC TGACCGGCAC CCGTCACTTT
TTCCACTGA
 
Protein sequence
MSSSQDERRP IRRALVSVYD KAGLAELAQA LHDAGVEIVS TGSTASAIAG AGVPVTAVDS 
VTGFPEILDG RVKTLHPKIH GGLLADLRKE SHVGQLAEHG IGAIDLLVSN LYPFQATVAS
GAGQDECVEQ IDIGGPAMVR AAAKNHASVA VVTDPSGYPQ LLTAVRVGGF TLAQRRALAA
RAFAVIADYD VAVAEWCARE LVEDAPWPSF AGLALRRDAV LRYGENPHQA AALYTDPSSP
AGLAQAEQLH GKEMSYNNYV DADAAWRAAN DFSDQPAVAI IKHANPCGIA VGVDVAEAHR
KAHACDPVSA FGGVIAVNRP VGVELAGQVS EVFTEVVVAP EFEPDALEVL RGKKNVRLLR
APAYAPASAE WRPVTGGMLV QVRDKVDAAG DDPATWQLAT GEAADEATLR DLAFAWRAVR
AVKSNAILLA RDGATVGVGM GQVNRVDSAR LAVDRAGAER ARGAVCASDA FFPFADGPKI
LIDAGVRAIV QPGGSIRDEE VIAAAKAAGV TMYLTGTRHF FH