Gene Sama_0395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0395 
SymbolpurH 
ID4602650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp491215 
End bp492804 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content60% 
IMG OID639779731 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_926275 
Protein GI119773535 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.93923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATG CCAGACCCAT TCGCCGCGCG CTGCTGAGCG TGTCCGATAA AACAGGTATC 
CTCGAGTTTG CCCAGGCGCT GCACGCACAG GGCGTAGAAC TGCTGTCCAC CGGTGGCACC
GCCAAGCTGC TGGCCGATAA CGGCGTACCT GTGATCGAAG TTTCTGACTA CACAGGTCAC
CCTGAGATCA TGGACGGTCG GGTCAAGACC CTGCATCCCA AGGTGCATGG TGGCATTCTG
GCGCGTCGCG GCCAGGACGA AGACGTGATG GCTGCCAACA ACATTGGCCC TATCGATCTG
GTTGCCGTTA ACCTGTATCC CTTTGCTGCC ACCGTAGCCA AGCCTGGCTG TACCCTGGAA
GACGCCATCG AGAACATCGA TATCGGCGGC CCTACCATGG TGCGCGCTGC CGCCAAGAAC
CACAAAGACG TGGTGATTGT GGTGAACGCC AAAGACTACG ACCGCGTACT GGCCGAAATG
AGCGCCAATG GCGGCTCTAC CAGCCACGCT ACCCGTTTCG ATTTGGCCAT TGCCGCCTTC
GAGCACACCG CCGCTTACGA TGGCATGATT GCCAACTACT TCGGCACCAT GGTTCCGGCT
CACAGCAGCG ACGAGTGCCA CGACGACTCC AAATTCCCAC GCACCTTCAA CACCCAGCTG
GTGAAGAAGC AGGACCTGCG CTACGGCGAA AACAGCCACC AGAGCGCCGC CTTCTACGTG
GATTTGAACA GCGACGAGGC CTCTGTGGCC ACCGCCACTC AGCTGCAGGG TAAGGCTCTG
TCTTACAACA ACATCGCCGA CACCGATGCC GCCCTTGAGT GCGTAAAAGA ATTCGACGCC
CCAGCCTGCG TTATCGTCAA GCACGCCAAC CCCTGTGGTG TGGCCCTGGG CGACAACCTG
CTGGACGCGT ACAACCGCGC CTACAAGACC GACCCCACTT CTGCTTTCGG TGGCATCATC
GCCTTTAACC GCGAGCTGGA CGGCGAAACC GCCGCCGCCA TCGTTGAGCG TCAGTTTGTG
GAAGTGATTA TCGCCCCTGT GGTGAGCCAA GCCGCCCGTG ACGTGGTTGC CAAGAAGACC
AACGTGCGCC TGCTGGAATG TGGTCAATGG ACTGCGCAGA CCAAGGGTCT GGACTACAAG
CGCGTAAACG GCGGCCTGCT TATTCAGGAC CGCGATCAGG GTATGGTGAC CGAGGCCGAA
CTCAAGGTCG TGACCAAGCG TGTACCGACC GAAGCTGAAC TGAAAGATCT GATGTTCTGC
TGGAAAGTGG CCAAGTTCGT GAAATCCAAC GCCATCGTGT ATGCCAAAGA AGGCATGACC
ATAGGCGTGG GCGCAGGCCA GATGAGCCGC GTCTACAGTG CCAAGATTGC CGGTATCAAG
GCCGCCGACG AAGGTCTGGT GGTTGAGGGC TCTGTGATGG CGTCCGACGC CTTCTTCCCA
TTCCGCGACG GTATCGACGC CGCAGCGGCT GCCGGGATCA GCTGCATCAT CCAGCCCGGC
GGTTCTATCC GCGACGAGGA AGTGATTGCC GCCGCCGACG AGCACGGCAT GGCCATGGTG
TTCACCAACA TGCGCCACTT CCGCCACTGA
 
Protein sequence
MNNARPIRRA LLSVSDKTGI LEFAQALHAQ GVELLSTGGT AKLLADNGVP VIEVSDYTGH 
PEIMDGRVKT LHPKVHGGIL ARRGQDEDVM AANNIGPIDL VAVNLYPFAA TVAKPGCTLE
DAIENIDIGG PTMVRAAAKN HKDVVIVVNA KDYDRVLAEM SANGGSTSHA TRFDLAIAAF
EHTAAYDGMI ANYFGTMVPA HSSDECHDDS KFPRTFNTQL VKKQDLRYGE NSHQSAAFYV
DLNSDEASVA TATQLQGKAL SYNNIADTDA ALECVKEFDA PACVIVKHAN PCGVALGDNL
LDAYNRAYKT DPTSAFGGII AFNRELDGET AAAIVERQFV EVIIAPVVSQ AARDVVAKKT
NVRLLECGQW TAQTKGLDYK RVNGGLLIQD RDQGMVTEAE LKVVTKRVPT EAELKDLMFC
WKVAKFVKSN AIVYAKEGMT IGVGAGQMSR VYSAKIAGIK AADEGLVVEG SVMASDAFFP
FRDGIDAAAA AGISCIIQPG GSIRDEEVIA AADEHGMAMV FTNMRHFRH