Gene Shew_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_3412 
SymbolpurH 
ID4923566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp4063067 
End bp4064656 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content58% 
IMG OID640165024 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001095537 
Protein GI127514340 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATG CCAGACCTAT TCGTCGCGCG CTGTTAAGCG TTTCTGATAA AACCGGAATC 
CTGGAGTTTG CACAAGCCCT GCACGCTCAA GGTGTTGAAC TGCTATCTAC CGGCGGCACC
GCCCGTCTGC TAGCAGACAA CGGTGTGCCT GTCATTGAAG TCTCTGACTA CACGGGCCAT
CCTGAGATCA TGGACGGCCG CGTAAAGACC CTGCACCCTA AAGTGCACGG CGGCATCCTC
GGCCGTCGCG GTATCGATGA GATAGTGATG GAACAGAACG CCATTAAGCC TATCGATCTG
GTTGCCGTTA ACCTCTACCC CTTTGCCGAG ACTGTGGCCA AAGAAGGCTG CACCCTGGCC
GACGCCGTGG AAAACATCGA CATCGGCGGC CCAACCATGG TGCGCTCTAC CGCGAAGAAC
CACAAAGACA CCACCATTGT CGTTAACGCC AAAGACTATG ATCGCGTGAT CCAAGAGATG
CAGGCCAACC AGGGTAGCAC CACGCTGGAG ACACGTTTCG ATCTGGCTAT CGCCGCCTTC
GAACACACGG CCGCCTACGA TGGCATGATC GCCAACTACT TCGGCACCAT GGTACCGGCG
CACAGCCAGG ACGAGTGCCA CCAGGATTCT AAGTTCCCAC GCACCTACAA CACTCAGTTG
GTGAAGAAGC AAGATCTGCG TTACGGCGAG AACAGCCACC AGAGCGCCGC CTTCTATGTG
GACCTCAACA TCGACGAAGC CTCGGTTGCC AGCGCGACTC AGCTACAGGG TAAGGCACTG
TCTTACAACA ACATAGCCGA TACCGATGCG GCGCTGGAAT GCGTCAAAGA GTTCAGCGAG
CCGGCCTGTG TTATCGTCAA GCACGCCAAC CCTTGTGGCG TCGCCATCGG CAAGGATCTG
CTCGAGGCCT ACAACCGCGC CTACCAGACC GACCCAACCT CAGCCTTCGG CGGCATCATC
GCCTTCAACG GTGAGCTGGA TGCAGAGACC GCCAGCGCCA TCGTCGAGCG TCAGTTTGTC
GAGGTGATCA TCGCACCTGT CGTTAGCCAG GCGGCACGCG ACGTGGTGGC AGCCAAGGCC
AACGTCCGTC TGCTGGAGTG TGGTCAATGG GCTAGCAAGA CCCGCAGCCT GGACTACAAG
CGCGTCAACG GCGGCCTGCT GATCCAAGAC AGAGACCAAG GCATGGTCGA GATGAGCGAC
ATCAAGGTAG TGACTAAGCG TCAGCCTACC GAAGCCGAGA TGAAAGATCT CATGTTCTGC
TGGAAGGTGG CCAAGTTCGT TAAGTCTAAC GCCATCGTCT ACGCCAAAGA CAGCATGACA
ATCGGCGTGG GCGCCGGCCA GATGAGCCGC GTCTACAGCG CCAAGGTGGC CGGTATCAAG
GCTGCCGATG AGAATCTGGA AGTCGTAGGT TCTGTCATGG CATCCGATGC CTTCTTCCCG
TTCCGCGATG GCATCGACGC CGCGGCGGCC GCCGGTATCA GCTGCATCAT CCAGCCGGGC
GGTTCGATTC GCGATGAAGA GATCATCGCA GCCGCCGACG AGCATGGCAT GGCCATGGTC
TTCACCGGCA TGCGTCACTT CCGTCATTAA
 
Protein sequence
MNNARPIRRA LLSVSDKTGI LEFAQALHAQ GVELLSTGGT ARLLADNGVP VIEVSDYTGH 
PEIMDGRVKT LHPKVHGGIL GRRGIDEIVM EQNAIKPIDL VAVNLYPFAE TVAKEGCTLA
DAVENIDIGG PTMVRSTAKN HKDTTIVVNA KDYDRVIQEM QANQGSTTLE TRFDLAIAAF
EHTAAYDGMI ANYFGTMVPA HSQDECHQDS KFPRTYNTQL VKKQDLRYGE NSHQSAAFYV
DLNIDEASVA SATQLQGKAL SYNNIADTDA ALECVKEFSE PACVIVKHAN PCGVAIGKDL
LEAYNRAYQT DPTSAFGGII AFNGELDAET ASAIVERQFV EVIIAPVVSQ AARDVVAAKA
NVRLLECGQW ASKTRSLDYK RVNGGLLIQD RDQGMVEMSD IKVVTKRQPT EAEMKDLMFC
WKVAKFVKSN AIVYAKDSMT IGVGAGQMSR VYSAKVAGIK AADENLEVVG SVMASDAFFP
FRDGIDAAAA AGISCIIQPG GSIRDEEIIA AADEHGMAMV FTGMRHFRH