Gene Haur_2746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2746 
Symbol 
ID5734627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3500668 
End bp3502188 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content53% 
IMG OID641279889 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001545512 
Protein GI159899265 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.221378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCTA TTGTCAGTGT TTCAGACAAA TCGGGGTTGG CGGATTTTGC TCAGGGCTTG 
AGCGATTTGG GCATTGAATT ATTTTCAACT GGCGGTACCA AAGCCGCGTT GGTTGGGGCT
GGTGTGCCAG TCCGCAGTGT CAGCGATTTG ACGGGCTTTC CTGAAATTTT GGAAGGTCGG
GTCAAAACCT TGCATCCTGG TGTGCATGCT GGCATTTTGG CCCGCCGTGA TAAGCCAGCC
CATCTGGCCC AATTGGCTGA GCACCACATC GGCCAGATCG ATTTGGTAGT TGTCAATTTG
TATCCGTTTG CTCAAACCAT CGCCAAACCT GATGTTACTT TAGAAGAAGC AGTTGAACAG
ATTGATATTG GTGGCCCAAC GATGGTGCGA GCTTCGGCCA AAAATCATGC CCATGTCTTG
ATTGTGATCG ATCCTGCTGA TTATCCACGG GTTTTGGAAG CCTTGCGGAC TGAGCAAATT
ACGCCTGAAT TGCGCCGCCA ACTTGCTGCC AAAGCCTTTG CCCATACCGC AGCCTACGAT
AGCGCGATTG CCGCCTACTT AACCGACGAA ACGTTCCCGC AGCAATTGCC CTTGGCCTGG
GAATTAGCCC AATCGCTGCG TTATGGCGAG AATCCGCATC AAGCAGCGGC GTTTTATCGC
GCTCCCAATG CTGCCGCCAA CACCTTGGCA AAAGCAGTGC AACATCAAGG CAAAGAGCTT
TCCTACAACA ACTTGCTTGA TGCTGATGCT ACCTTGCAAG TAATTCAAAA TTTTGATCAG
CCAACCGTGG CGATTATCAA GCACACTAAT CCTTGTGGCC TGGCTTCGGC TGAGGATTTG
GTGGCGGCCC ACAAAGCGGC GCGGGCGGGT GATCCGCTTT CGGCGTTTGG TGGCATCGTC
GGGGTCAATC GCCCCGTCGA TCGGGCTTTA GCCAATGTGC TGAAAAAATA TTTTTACGAA
GTGATTATTG CCCCATCCTT TAGCCCTGAA GCCTTGACAA TTTTGGCCGA AAAGCCCAAT
TTACGCTTGT TGAGCGTCGA TACCAGCCGC TCAAGCAGCA ACGATTGGGA ATATCGCAGT
ATTGGCGGCG GCATTTTGGC CCAACATGTT GATCGCGTTG GTAATGATCG CTGGGATGCT
TGGCAGGTTG TTACTGAAAC TGTGCCCAGC GATGAGCAAC TGGCAGCCTT GCAATTTGCT
TGGAAAGCTT GTGCCAGCGT GAAATCGAAT GCGATTGTCT TGGTGCAAGG CGAGGAATTG
GTGGGGATGG GCGCTGGTCA GCCTTCACGG GTTGATTCGG TATTGACGGC GATTCGCAAG
GCGGGCGAAC GGGCCAAGGG TAGCGTGCTA GCTTCCGATG CCTTCTTCCC CAAAGCCGAT
GGAATTCAGG CCGCGATTGA AGCTGGCGTG AGCGCAATTG TTCAGCCTGG TGGCTCGCAA
GGTGATGATG AAGTGATTGC TGCTGCTAAC GCCGCAGGCA TCGCGATGAT CTTCACTGCT
ACTCGCCACT TCAAACACTA A
 
Protein sequence
MRAIVSVSDK SGLADFAQGL SDLGIELFST GGTKAALVGA GVPVRSVSDL TGFPEILEGR 
VKTLHPGVHA GILARRDKPA HLAQLAEHHI GQIDLVVVNL YPFAQTIAKP DVTLEEAVEQ
IDIGGPTMVR ASAKNHAHVL IVIDPADYPR VLEALRTEQI TPELRRQLAA KAFAHTAAYD
SAIAAYLTDE TFPQQLPLAW ELAQSLRYGE NPHQAAAFYR APNAAANTLA KAVQHQGKEL
SYNNLLDADA TLQVIQNFDQ PTVAIIKHTN PCGLASAEDL VAAHKAARAG DPLSAFGGIV
GVNRPVDRAL ANVLKKYFYE VIIAPSFSPE ALTILAEKPN LRLLSVDTSR SSSNDWEYRS
IGGGILAQHV DRVGNDRWDA WQVVTETVPS DEQLAALQFA WKACASVKSN AIVLVQGEEL
VGMGAGQPSR VDSVLTAIRK AGERAKGSVL ASDAFFPKAD GIQAAIEAGV SAIVQPGGSQ
GDDEVIAAAN AAGIAMIFTA TRHFKH