Gene Xfasm12_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXfasm12_0998 
SymbolpurH 
ID6121482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylella fastidiosa M12 
KingdomBacteria 
Replicon accessionNC_010513 
Strand
Start bp1078309 
End bp1079901 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content56% 
IMG OID641649032 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001775596 
Protein GI170730163 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCCG ATTTCCTGCC CGTGCATCGG GCACTGTTAT CTGTTTCCGA CAAAACAGGG 
TTGGTCGAGT TGGCGCGCGT GCTGCTGGCC TACAACATTG AATTACTGTC TACCGGTGGG
ACCGCGACAA TCATCCGCGA GGCTGGTCTC CCAGTGCAGG ATGTGGCCGA TCTGACAGGC
TTCCCTGAAA TGATGGACGG CCGCGTAAAA ACACTGCACC CCGTGGTACA CGGTGGTTTG
CTCGGACGCG CCGGTATCGA CGACGCCGTG ATGGCCAAAC ACGGCATCGC ACCGATCGAC
CTGCTGATAC TGAATCTGTA TCCCTTCGAA CAGATCACTG CCAAAAAAGA TTGCACGCTG
GCCGATGCAG TGGACACCAT TGATATTGGT GGCCCGGCGA TGCTGCGCTC GGCGGCAAAG
AATTTCGCGC GCGTGGCTGT GGCAACGTCA CCGGATCAGT ATCCTGATCT GCTTGCTGAA
CTGCAGGAGC ACCACGGCCA ACTTTCAGCC GAAAAGCGTT TCGCATTGGC CGTGGCGGCG
TTCAACCACG TTGCCCAGTA CGATGCGGCC ATCAGCAACT ATCTTTCGAG TGTGTCCGAC
ATGCACACAA CGTTGCCGTT GCGCCATGAA TTCCCAGCTC AGTTAAACAA TACCTTCGTA
AAGATGACAG AGCTCCGCTA CGGGGAAAAT CCGCATCAAA CAGGTGCGTT CTACCGCGAT
GTACATCCGC AACCCGGGAC GCTGGCCACC TTCCAGCAAC TCCAAGGCAA GAGACTCAGC
TACAACAACC TCGTCGATGC CGACGCCGCA TGGGAATGCG TACGTCAATT CGAAGCACCG
GCCTGCGTCA TCGTCAAACA TGCCAACCCT TGTGGCGTCG CAGTTGGGAT GGCGTGCAGT
GATGCTTATG AAGCAGCGTA TGCCACTGAT CCGACGAGTG CGTTTGGCGG CATTATTGCT
TTTAACCGCA CGTTGGATGC GGTCACTATG AAAAGCATTC TAGACCGTCA ATTTGTCGAG
GTATTCATTG CGCCGTATTA CGATGCAGAC GCCCTCGCCT ATGCCGCCAA AAAAGCCAAT
GTGCGCGTGC TGCGTATCCC CAGCAGCGCA GCGATGAAAG CAACGAATCA GTACGACTTC
AAGCGTATCG GCTCTGGGCT GCTGGTCCAA AGCGCCGATA CCATGCACAT CCATTCCGAT
GTTCTTAGGA CGGTGACCAC ACTTGCCCCC ACCGATAAAC AACGACGCGA TCTGATGTTT
GCTTGGCGTG TGGTCAAGTA CGTCAAGTCC AATGCAATTG TGTATGCCAA GGATAATCGC
ACGATTGGTA TTGGCGCCGG ACAAATGAGT CGCGTGTATT CAGCGCGTAT CGCTGGTATC
AAGGCGGCCG ATGCACATTT GGCTGTCACA GGCTCGGTGA TGGCCAGCGA TGCGTTCTTT
CCATTCCGCG ATGGCATTGA TGCCGCTGCC GCGACTGGAA TCAAGGCAGT GATTCAACCG
GGCGGTTCGA TGCGCGATAA CGAGGTGATC GCGGCGGCCG ATGAACACGG CATTGCCATG
CTATTCACCG GGATACGGCA TTTCCGGCAT TGA
 
Protein sequence
MASDFLPVHR ALLSVSDKTG LVELARVLLA YNIELLSTGG TATIIREAGL PVQDVADLTG 
FPEMMDGRVK TLHPVVHGGL LGRAGIDDAV MAKHGIAPID LLILNLYPFE QITAKKDCTL
ADAVDTIDIG GPAMLRSAAK NFARVAVATS PDQYPDLLAE LQEHHGQLSA EKRFALAVAA
FNHVAQYDAA ISNYLSSVSD MHTTLPLRHE FPAQLNNTFV KMTELRYGEN PHQTGAFYRD
VHPQPGTLAT FQQLQGKRLS YNNLVDADAA WECVRQFEAP ACVIVKHANP CGVAVGMACS
DAYEAAYATD PTSAFGGIIA FNRTLDAVTM KSILDRQFVE VFIAPYYDAD ALAYAAKKAN
VRVLRIPSSA AMKATNQYDF KRIGSGLLVQ SADTMHIHSD VLRTVTTLAP TDKQRRDLMF
AWRVVKYVKS NAIVYAKDNR TIGIGAGQMS RVYSARIAGI KAADAHLAVT GSVMASDAFF
PFRDGIDAAA ATGIKAVIQP GGSMRDNEVI AAADEHGIAM LFTGIRHFRH