Gene Psyc_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPsyc_1472 
SymbolpurH 
ID3515433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychrobacter arcticus 273-4 
KingdomBacteria 
Replicon accessionNC_007204 
Strand
Start bp1788784 
End bp1790364 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content47% 
IMG OID637670161 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_264754 
Protein GI71066027 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.255919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.32318e-06 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGTAAAG CCCCACTTGC ACTACTGTCA GTCTCCGATA AATCTAATAT CGTTGAATTC 
GCCCAAGGTC TGATTCAGGC AGGGTTTGGT TTGTTATCGA CTGGCGGTAC TTTCCGTTTG
CTAACAGAGC ATAATGTCGC TGTCACCGAA GTATCAGATT ACACGGGTTT TCCTGAGATG
ATGGATGGTC GGGTTAAAAC ACTTCATCCC AAGATCCACG GTGGCATTTT GGGACGCCGC
GGTACAGATG ATATGGTGAT GAGTGAGCAT GCGATTGAAC GCATTGATTT GGTCGTTGTC
AACCTCTATC CATTTGCAGA AACGATTGCA CGTAGTGACG TTACCATGAA TGATGCCATC
GAAAATATCG ATATTGGCGG ACCTACTATG GTGCGTTCAG CGGCAAAGAA TCATGCACAC
GTTGGTATTG TGACTGATCC AGCTGATTAT ACGCGAGTAC TTGAAGCATT AGGCGACAGT
ACTGCATTGA CCGCTACCCT ACGTTACGAC CTAGCGGTTA AAGCATTTGA GCATACTGCA
CAATATGACG GGATGATTGC AAACTTTTTG GGTAGCCGTG TTAATGAGAG CCAAGAGCCT
GAGAGTTTTT CACGTACCTT TAACGTTCAG CTAGAAAAAG TGCAAGACCT TCGCTACGGT
GAAAACCCGC ATCAAAAGGC GGCGTTCTAT GTTGAAAATA ACTCTTCAAA AAGCAAGCAA
GCATCTATTG CTACTGCTAA GCAATTGCAA GGCAAAGCCT TGTCTTATAA CAATATCGCC
GATACTGATG CCGCGCTTGA ATGCGTTAAA GCCTTTAGCA CGCCTGCTTG TGTGATTGTA
AAGCATGCCA ACCCTTGTGG CGTTGCTGTA GATATCGATC AAGTAGCAGC ATATCGCACT
GCCTTCAGTA CCGATCCTGA GTCTTCTTTT GGCGGTATCA TCGCTTTTAA CCGCCCGTTA
ACCCTTGCAG CCGCTACAGC CATTATCGAC AATCAGTTTG TTGAAGTCAT TATTGCCCCA
AGTGTCGAAG ACGGTGTGTT AGAGGCGACT GCTTCGAAGA AAAACGTTCG CGTCTTGGTT
TGCGGCGATT TGCCAGCACC TGAGCTACGT GACCGTCAGC TTGATTATAA GCGTGTGAAT
GGTGGTTTGC TGGTGCAAGA GCAAGATTTG GGCTTGATTA CGGCTCACGA CTTAAAAATC
GTCACAGACG TGCAGCCAAC CGAAGCGCAG ATTGCTGATT TACTATTTAG CTGGAACGTT
GCAAAATACG TTAAATCTAA TGCCATCGTT TACGCTAAAG GTCAGCGTAC CATCGGTGTA
GGTGCAGGTC AGATGAGCCG TGTTAACTCA GCTCGTATCG CTGCTATTAA AGCGGAGCAC
GCTGGACTTG CAACCGAAGG CGCGGTTATG GCATCTGATG CCTTCTTCCC GTTCCGTGAT
GGTATCGACA ATGCAGCAGA AGTGGGTATT GCTGCGATTA TCCAACCAGG TGGTTCTATG
CGCGATGATG AGACCATCGC TGCCGCAAAT GAGCACGGTA TCGCCATGGT CTTCACCGGT
ATGCGTCATT TCCGTCATTA A
 
Protein sequence
MSKAPLALLS VSDKSNIVEF AQGLIQAGFG LLSTGGTFRL LTEHNVAVTE VSDYTGFPEM 
MDGRVKTLHP KIHGGILGRR GTDDMVMSEH AIERIDLVVV NLYPFAETIA RSDVTMNDAI
ENIDIGGPTM VRSAAKNHAH VGIVTDPADY TRVLEALGDS TALTATLRYD LAVKAFEHTA
QYDGMIANFL GSRVNESQEP ESFSRTFNVQ LEKVQDLRYG ENPHQKAAFY VENNSSKSKQ
ASIATAKQLQ GKALSYNNIA DTDAALECVK AFSTPACVIV KHANPCGVAV DIDQVAAYRT
AFSTDPESSF GGIIAFNRPL TLAAATAIID NQFVEVIIAP SVEDGVLEAT ASKKNVRVLV
CGDLPAPELR DRQLDYKRVN GGLLVQEQDL GLITAHDLKI VTDVQPTEAQ IADLLFSWNV
AKYVKSNAIV YAKGQRTIGV GAGQMSRVNS ARIAAIKAEH AGLATEGAVM ASDAFFPFRD
GIDNAAEVGI AAIIQPGGSM RDDETIAAAN EHGIAMVFTG MRHFRH