Gene Ssed_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_0444 
SymbolpurH 
ID5612542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp553060 
End bp554664 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content51% 
IMG OID640931289 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001472185 
Protein GI157373585 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.23043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATG CCAGACCCAT TCGTCGCGCG CTGTTAAGCG TTTCAGATAA AACCGGAATC 
CTTGAGTTTG CAAAATCTCT ACACGCTCAA GGCGTAGAAC TGCTATCTAC TGGTGGCACC
GCCCGCCTTT TGGCTGATAA CGGTGTGCCT GTCATTGAAG TATCGGATCA TACCGGACAT
CCTGAAATTA TGGATGGTCG TGTTAAGACC CTGCACCCTA AAGTGCATGG CGGTATCTTA
GCGCGCCGCG GTATCGACGA GCTCGTCATG GAACAAAACA ACATCAAGCC TATCGACCTG
GTTGCCGTCA ACCTGTATCC ATTCGCAGAG ACTGTGGCGA AAGAGGGTTG TACTTTGGCC
GATGCAGTCG AAAATATCGA TATCGGCGGT CCAACTATGG TTCGCTCTAC GGCGAAAAAC
CATAAAGACA CCACCATCAT AGTTAACGCG AGTGATTACG ACCGCGTTAT TGTTGAGATG
AATGCTAATG AAGGCAGCAC GACGCTGGAG ACTCGCTTCG ATTTAGCTAT AGCCGCGTTC
GAGCACACCG CAGCATATGA CGGCATGATT GCTAATTACT TCGGCACTCA GGTACCGGCA
CACAGTAAAG ATGAGTGCCA TCACGACTCT AAGTTCCCGC GTACCTACAA TACTCAGCTG
GTGAAGAAAC AAGATCTGCG TTACGGCGAA AACAGCCACC AGACCGCGGC TTTCTATGTC
GATAGCCCCT CTTTTAACGG CCAGGGCGAT GAAGCTTCTG TCGCGAGCGC CATACAGCTA
CAGGGTAAGG CATTGTCTTA CAACAACATC GCCGATACCG ATTCAGCACT CGAGTGCGTG
AAAGAGTTCA GCGAACCGGC TTGTGTCATC GTTAAGCACG CTAACCCATG TGGTGTGGCT
ATAGGTAGTG ATCTTCTCGA TGCCTATAAC CGTGCTTTTA AAACCGATCC GACCTCGGCC
TTCGGTGGCA TTATCGCTTT CAATGGTGAG CTCGATGCGG CAACGGCCAG CGCGATTGTT
GAACGCCAAT TTGTTGAAGT CATTATCGCA CCGAAAGTGA GCCAAGCCGC TCGCGATATC
GTGGCTGCTA AAGCCAACCT TCGTCTTCTC GAATGTGGCG AGTGGAACAC TAAGACCACG
AGCTTAGATT ATAAGCGAGT CAACGGTGGG CTGCTGCTGC AAGACAGAGA TCAAGGTATG
GTCGGCCTGG ATGACGTGAA AGTGGTTTCT AAGCGTCAAC CCACCGCAGC CGAGATGAAA
GATCTGATGT TCTGCTGGAA AGTGGCTAAG TTCGTTAAAT CAAACGCCAT TGTTTACGCT
AAAGACAGCA TGACTATCGG CGTGGGCGCA GGCCAGATGA GTCGCGTATA CAGCGCGAAA
GTGGCTGGCA TTAAGGCTGC CGATGAAGGG CTGGAAGTTC AGGATTCAGT TATGGCGTCC
GATGCCTTCT TCCCATTCCG TGATGGTATC GATGCAGCCG CTGCTGCGGG TATCAGCTGT
ATCATCCAAC CTGGTGGTTC GATTCGTGAT GAAGAGATCA TTGCCGCGGC AGATGAGCAC
GGCATGGCGA TGGTATTCAC CGGAATGCGC CACTTCCGTC ATTAA
 
Protein sequence
MNNARPIRRA LLSVSDKTGI LEFAKSLHAQ GVELLSTGGT ARLLADNGVP VIEVSDHTGH 
PEIMDGRVKT LHPKVHGGIL ARRGIDELVM EQNNIKPIDL VAVNLYPFAE TVAKEGCTLA
DAVENIDIGG PTMVRSTAKN HKDTTIIVNA SDYDRVIVEM NANEGSTTLE TRFDLAIAAF
EHTAAYDGMI ANYFGTQVPA HSKDECHHDS KFPRTYNTQL VKKQDLRYGE NSHQTAAFYV
DSPSFNGQGD EASVASAIQL QGKALSYNNI ADTDSALECV KEFSEPACVI VKHANPCGVA
IGSDLLDAYN RAFKTDPTSA FGGIIAFNGE LDAATASAIV ERQFVEVIIA PKVSQAARDI
VAAKANLRLL ECGEWNTKTT SLDYKRVNGG LLLQDRDQGM VGLDDVKVVS KRQPTAAEMK
DLMFCWKVAK FVKSNAIVYA KDSMTIGVGA GQMSRVYSAK VAGIKAADEG LEVQDSVMAS
DAFFPFRDGI DAAAAAGISC IIQPGGSIRD EEIIAAADEH GMAMVFTGMR HFRH