Gene Spea_0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_0432 
SymbolpurH 
ID5660832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp537651 
End bp539240 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content48% 
IMG OID641234969 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001500295 
Protein GI157960261 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATG CCAGACCTAT TCGTCGCGCG CTGTTAAGCG TTTCAGATAA AACCGGAATC 
CTAGAGTTTG CACAAGCGCT GCATGCTCAA GGTGTTGAGC TACTTTCTAC TGGTGGCACC
GCACGCCTGC TGGCTGATAA TGGCGTGCCT GTCATTGAAG TATCAGATTA TACTGGTCAC
CCAGAGATCA TGGATGGCCG CGTTAAAACC CTGCACCCTA AAGTGCATGG CGGTATTTTA
GCGCGTCGCG GTATCGACGA AATCGTAATG GAACAAAACA ACATCAAGCC TATCGACTTG
GTTGCAGTTA ACTTGTATCC GTTTGCTGCT ACCGTTGCTC AAGAAGGTTG CACGCTTGCT
GACGCTATCG AGAACATCGA TATCGGCGGC CCTACTATGG TGCGATCTAC CGCTAAAAAC
CATAAAGACA CTACTATCAT CGTTAACGCG AAAGATTACG GCCGCGTTAT TGCAGAAATG
CAATCAAACG AAGCTAGCAC TACCCTTGAG ACTCGCTTTG ATCTAGCGAT TGCAGCGTTT
GAGCACACTG CCGCATACGA TGGCATGATT GCTAACTACT TCGGCACTAA AGTACCTGCA
CACAGCAATG ACGAGTGCCA TGAAGATTCT AAGTTCCCAC GCACTTACAA CACTCAGCTA
GTTAAGAAGC AAGACTTGCG CTATGGCGAA AATAGCCACC AAACAGCCGC TTTCTATGTT
GATACAAATC TTGATGAAGC CTCTGTTGCC ACCGCCGTTC AACTGCAAGG TAAGGCACTG
TCATACAATA ACATCGCCGA TACCGACTCT GCACTTGAGT GCGTAAAAGA GTTCGACGAG
CCAGCTTGTG TGATTGTTAA GCATGCTAAT CCATGTGGTG TGGCTATTGG TGAAAACCTA
CTTGAAGCTT ATAACCGTGC ATTCCAAACT GACCCAACGT CGGCTTTCGG TGGCATCATC
GCCTTTAACG GCGAGCTAGA TGCAGCAACT GCTAGCGCGA TTGTTGAGCG CCAGTTTGTT
GAGGTGATCA TAGCCCCTAA AGTTAGCCAA GCAGCCCGTG ACGTTATTGC TGCTAAAGCA
AACGTACGCT TGCTTGAGTG TGGTGAGTGG GCAGCAAAAA CCACCAGCCT AGATTACAAG
CGTGTAAACG GCGGCCTATT ACTACAAGAT CGCGACCAAG GCATGGTTGG TCTTGATGAC
GTTAAGGTTG TTTCTAAGCG CCAACCAACG GCTGAAGAGA TGAAAGATCT AATGTTCTGC
TGGAAAGTGG CTAAGTTTGT TAAATCAAAC GCCATTGTTT ACGCCAAGAA CAGCATGACA
ATCGGTGTAG GTGCAGGCCA AATGAGCCGA GTTTACAGCG CTAAGGTTGC AGGTATTAAA
GCCGCAGACG AAGGCCTAGA AGTACAAAAC TCAGTAATGG CATCAGATGC GTTCTTCCCA
TTCCGTGATG GTATCGATGC TGCAGCAGAA GCAGGCATTA GCTGTATCAT CCAGCCTGGC
GGCTCTATCC GCGATGAAGA GATCATCGCA GCAGCTGATG AGCACGGCAT GGCAATGGTA
TTTACCGGTA TGCGTCACTT CCGTCATTAA
 
Protein sequence
MNNARPIRRA LLSVSDKTGI LEFAQALHAQ GVELLSTGGT ARLLADNGVP VIEVSDYTGH 
PEIMDGRVKT LHPKVHGGIL ARRGIDEIVM EQNNIKPIDL VAVNLYPFAA TVAQEGCTLA
DAIENIDIGG PTMVRSTAKN HKDTTIIVNA KDYGRVIAEM QSNEASTTLE TRFDLAIAAF
EHTAAYDGMI ANYFGTKVPA HSNDECHEDS KFPRTYNTQL VKKQDLRYGE NSHQTAAFYV
DTNLDEASVA TAVQLQGKAL SYNNIADTDS ALECVKEFDE PACVIVKHAN PCGVAIGENL
LEAYNRAFQT DPTSAFGGII AFNGELDAAT ASAIVERQFV EVIIAPKVSQ AARDVIAAKA
NVRLLECGEW AAKTTSLDYK RVNGGLLLQD RDQGMVGLDD VKVVSKRQPT AEEMKDLMFC
WKVAKFVKSN AIVYAKNSMT IGVGAGQMSR VYSAKVAGIK AADEGLEVQN SVMASDAFFP
FRDGIDAAAE AGISCIIQPG GSIRDEEIIA AADEHGMAMV FTGMRHFRH