Gene Shewana3_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_0441 
SymbolpurH 
ID4479348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp515787 
End bp517385 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content53% 
IMG OID639724977 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_868090 
Protein GI117918898 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTG CAAATAATGC CAGACCCATT CGTCGCGCGC TGTTAAGCGT TTCAGATAAA 
ACCGGAATTC TCGAATTCGC CAAAGCATTA CACGCCCAAG GCGTTGAACT GCTGTCAACG
GGCGGCACCG CTCGCTTGTT AGCGGATAAC GGCGTGCCTG TTATCGAAGT ATCTGACTAT
ACAGGACACC CTGAGATCAT GGATGGTCGC GTTAAGACCC TGCACCCTAA AGTGCATGGT
GGCATTTTGG CGCGTCGCGG TCTTGATGAA AATGTCATGG CTGCCAACAA CATCAATGCA
ATCGATCTGG TTGCGGTTAA CCTCTACCCC TTTGCCGATA CCGTTGCTAA AGCCGGTTGC
ACCCTAGAAG ATGCGATTGA AAACATCGAC ATCGGTGGCC CGACTATGGT GCGCGCCGCG
GCGAAAAACC ATAAAGACGT GACTATCGTG GTAAATGCGG CCGACTATAA CCGCGTATTA
GCCGAAATGG CCGCCAACAA TGGCAGCACG ACTCACACGA CCCGTTTCGA TTTAGCGATT
GCTGCCTTCG AACACACTGC GGGTTACGAT GGCATGATCG CCAACTACTT CGGCACTATG
GTTCCTGCAC ACAGCACTGA CGAGTGCTTC GAAGATTCTA AGTTCCCACG CACCTTCAAC
ACCCAATTAG TGAAGAAGCA AGATCTACGT TACGGTGAAA ACAGCCACCA AACTGCGGCC
TTCTATGTCG ACACTAAGAT CGACGAAGCC TCTGTTGCGA CAGCCGTCCA GCTGCAAGGT
AAGGCACTGT CTTACAACAA CATCGCCGAT ACCGATGCCG CCCTTGAGTG TGTGAAAGAG
TTCAGCGAGC CAGCCTGCGT TATCGTTAAA CACGCTAACC CATGTGGTGT TGCACTGGGT
AAAGACTTGC TCGATGCCTA TAACCGCGCC TATCAAACTG ATCCAACTTC AGCCTTCGGC
GGCATTATCG CCTTCAACGG CGAGTTAGAT GCAGCAACGG CTAGCGCTAT CGTTGAGCGT
CAATTCGTTG AAGTGATTAT TGCGCCAGTC GTGAGCCAAG GCGCACGCGA TGTAGTGGCC
AAGAAAACCA ACGTGCGTCT GTTAGAGTGC GGTCAATGGG ATACTAAGAC CAAGACCTTA
GACTATAAGC GCGTGAACGG TGGCCTGCTG GTGCAAGACC GCGATCAAGG CATGGTTGGT
TTAGATGACA TTAAAGTCGT GACTAAACGT CAACCGACCG AGAGCGAGCT GAAGGACTTA
ATGTTCTGCT GGAAAGTGGC TAAGTTCGTT AAATCTAACG CCATTGTTTA CGCTAAAGAC
GGTATGACCA TCGGTGTCGG CGCCGGCCAA ATGAGCCGCG TCTACAGCGC TAAGATTGCC
GGTATCAAGG CGGCCGATGA AGGCCTAGAA GTGGTTAACT CTGTGATGGC GTCCGATGCC
TTCTTCCCAT TCCGCGACGG TATCGATGCC GCAGCGGCGG CGGGCATCAG CTGCATCATC
CAGCCAGGTG GCTCAATGCG CGATGCTGAA ATCATCGCTG CTGCCGACGA GCACGGCATG
GCCATGGTAA TGACGGGCAT GCGCCACTTC CGTCACTAA
 
Protein sequence
MTVANNARPI RRALLSVSDK TGILEFAKAL HAQGVELLST GGTARLLADN GVPVIEVSDY 
TGHPEIMDGR VKTLHPKVHG GILARRGLDE NVMAANNINA IDLVAVNLYP FADTVAKAGC
TLEDAIENID IGGPTMVRAA AKNHKDVTIV VNAADYNRVL AEMAANNGST THTTRFDLAI
AAFEHTAGYD GMIANYFGTM VPAHSTDECF EDSKFPRTFN TQLVKKQDLR YGENSHQTAA
FYVDTKIDEA SVATAVQLQG KALSYNNIAD TDAALECVKE FSEPACVIVK HANPCGVALG
KDLLDAYNRA YQTDPTSAFG GIIAFNGELD AATASAIVER QFVEVIIAPV VSQGARDVVA
KKTNVRLLEC GQWDTKTKTL DYKRVNGGLL VQDRDQGMVG LDDIKVVTKR QPTESELKDL
MFCWKVAKFV KSNAIVYAKD GMTIGVGAGQ MSRVYSAKIA GIKAADEGLE VVNSVMASDA
FFPFRDGIDA AAAAGISCII QPGGSMRDAE IIAAADEHGM AMVMTGMRHF RH