Gene Sde_0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0806 
Symbol 
ID3966405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1051830 
End bp1053410 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content47% 
IMG OID637919868 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase / IMP cyclohydrolase 
Protein accessionYP_526280 
Protein GI90020453 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.663158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATG TTGCAGATTA TGTTCAAGTT AAACGCGCTC TTATTAGCGT TTCGGACAAA 
ACTGGCATTA TCGAATTTGC CCAAGCGCTA GCGCGCCAAG GTGTAGAAAT TTTTTCCACC
GGTGGAACCT TCCGCTTGCT AAGCGAAAAC AACATCGCGG CAACAGAAAT TTCAGACTAT
ACCGGCTTCC CAGAAATGAT GAGTGGACGT GTAAAAACCT TACACCCCAA AGTCCACGGT
GGCATTTTAG GGCGCAGAGG CATAGACGAC GAAGTAATGC AGGAACACGG CATTAAGCCA
ATCGACATGG TTGTTGTTAA TCTTTACCCG TTTGAAAAAA CCGTTGCCCA ACCAGACTGC
GAATTAGTAG ATGCCATCGA AAATATCGAC ATCGGCGGCC CAACCATGGT TCGCGCAGCG
GCTAAAAACC ACAATCACGT AGCGATAGTT GTAAACAGCC ACAGTTACGC TTCTGTGCTT
ACAGAAATGG AAATGAACAA CGGTGCACTA TCGCTAGCTA CACGTTTCGA CTTATGTGTA
CAAGCTTACG AACATACAGC CGCGTACGAT GGCGCAATTG CCAACTACTT AGGCGCTAAA
GTTGAAAAAG CGGAAGACAA ATTCCCGCGC ACCTTTAACA CCCAGTTTGT TAAAGCCCAA
ACTATGCGCT ACGGCGAAAA CCCGCACCAA CAAGCGGCTT TTTATGTAGA AAAGAATTCT
CGCGAAGCAA GCATTTCAAC GGCTATTCAA TTGCAAGGAA AAGAGCTTTC GTTTAACAAC
GTTGCCGATA CCGATGCCGC ATTAGAAACC GTTAAATTGT TTAGCGAGCC TGCATGTGTA
ATTGTAAAAC ACGCCAACCC TTGCGGCGTA GCGCAAGCAG ATAACTTGTT AGATGCTTAT
CAAAAAGCGT TTGAAACAGA CCCAGAATCT GCATTTGGCG GCATCATTGC TTTTAACCGC
GAGCTAGATG CAAAAACTGC AGAAGCTATT GTAGAAAAGC AATTCGTAGA AGTTATTATC
GCGCCCTCTG TTTCTCAAGC AGCTTCCGAT ATTGTTAGCG CTAAGAAAAA TGTCCGTCTA
CTTTCTTGCG GCCAATGGTC TGCAGCTAGC GAACACGCAT TTGACTACAA GCGCGTTAAC
GGTGGCCTAC TGGTACAAGA TCGCGACAAC GGCATGATTG AAACAGCCGA CTTAAAAGTT
GTTACCAAAC GCCAGCCTAC TGAAGACGAA ATACGCGATT TATTATTCGC TTGGAAAGTG
GCAAAAATGG TTAAGTCCAA CGCAATTGTT TACGGCAAAG ACAGCCGTAC CATTGGTGTA
GGCGCTGGCC AAATGAGTCG CGTTAACTCT GCCCGTATTG CCGCAATCAA AGCCGAGCAC
GCAGGCTTAG AAGTTAAAGG CTCGGTAATG GCATCAGACG CGTTCTTCCC GTTCCGCGAC
GGCATAGATA ACGCAGCAGC CGTTGGTATT GCTGCCGTTA TTCAACCTGG TGGCTCTATG
CGCGATGAAG AAACCATCGC AGCTGCCGAC GAGCACGGCA TGGCCATGGT GTTTACCGGT
ATGCGCCACT TCCGTCACTA A
 
Protein sequence
MPNVADYVQV KRALISVSDK TGIIEFAQAL ARQGVEIFST GGTFRLLSEN NIAATEISDY 
TGFPEMMSGR VKTLHPKVHG GILGRRGIDD EVMQEHGIKP IDMVVVNLYP FEKTVAQPDC
ELVDAIENID IGGPTMVRAA AKNHNHVAIV VNSHSYASVL TEMEMNNGAL SLATRFDLCV
QAYEHTAAYD GAIANYLGAK VEKAEDKFPR TFNTQFVKAQ TMRYGENPHQ QAAFYVEKNS
REASISTAIQ LQGKELSFNN VADTDAALET VKLFSEPACV IVKHANPCGV AQADNLLDAY
QKAFETDPES AFGGIIAFNR ELDAKTAEAI VEKQFVEVII APSVSQAASD IVSAKKNVRL
LSCGQWSAAS EHAFDYKRVN GGLLVQDRDN GMIETADLKV VTKRQPTEDE IRDLLFAWKV
AKMVKSNAIV YGKDSRTIGV GAGQMSRVNS ARIAAIKAEH AGLEVKGSVM ASDAFFPFRD
GIDNAAAVGI AAVIQPGGSM RDEETIAAAD EHGMAMVFTG MRHFRH