Gene Athe_1445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1445 
Symbol 
ID7408103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1529244 
End bp1530785 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content37% 
IMG OID643715808 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002573316 
Protein GI222529434 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA GGGCAATTAT AAGTGTTTAC GATAAAAATG GTATAGTGGA ATTTGCAAAA 
AAGCTAAAAG AGTTTGGATA TGACATTATC TCAACGGGCG GTACTATGAA GTATTTGACC
GAAAATGGGA TTGAGGTTAT AAATATCTCT GATGTCACCC GTTTTCCAGA GATTTTGGAT
GGCAGAGTAA AGACTCTTCA TCCCAATATT CACGCAGGAA TTCTTGCAAT GAAGGATAAT
AGAGAACACT TGGAAACTTT AAAGGCGTTG GATATTCTAC CTATCGACAT GGTTGTGGTT
AACCTTTATC CGTTTAAAGA GACTATTTTC AAAGAAGATG TTACACTTGA TAACGTTATA
GAAAATATAG ATATAGGCGG GCCAACCATG ATTCGAGCTG CTGCAAAAAA TTTTAAATAC
ACAACAGTTA TAGTTGACCC TGAAGATTAC GATATAGTAG CAATGGAAAT AGAAAAAAAT
GGAGAAGTTT CTTATGAGAC AAGATTTTAT CTTGCCACAA AAGTTTTTGA ATACACCTCT
TATTATGATT CAATGATTTT TAACTATTTC AAACATGTAA GAAAAGACCA ATCGTTTTCG
AAGCATTTTA CAGTCCCACT TGAACTTTTA CAGTACTTAA GATATGGAGA AAATCCTCAC
CAGAAGGCAT GTTTTTATAA GATATCATTA CCGTTCATCG AAACCTCTAA TATTGTGAAT
TGTACACAGC TTCATGGTAA AGAACTTTCG TATAACAATA TCCTTGACAG TGACAGTGCT
ATAGAACTTT TGAAGGAATT TGATGAACCC ACATGTGTTG CTATAAAGCA CAACAATCCA
TGTGCGGTGG CATCAGCAGA GAATATTAAT GAGGCTTACA AAAAGGTTTA TGAAAGTGAC
CCGATATCAA TATTTGGCGG GATTGTTGCT TTCAACAGAA AGGTTGACAA AAATGTGGCA
GAACAGCTCA AAAAGATATT TCTTGAAATT GTAATTGCTC CGGAATTTGA CGAGGATGCT
CTTTCCATTT TGTGTTCTAA AAAAGATTTG AGAGTTTTAA AATTAGCATC CTTAGAAAAG
ACTGATACTT TCTACGATAT AAAATCTGTA AACGGCGGTG CTTTAGTACA AGAAAAGGAT
AGAATGCTTC TTGCAGACCA ACTTCAGGTT GTCACGGAAA GAAAACCTTC AGAAAAGGAA
TTGGAAGATT TAATCTTTGC ATGGAAGGTT GTAAAACATG TGAAGTCAAA TGCTATAGTT
GTAGCAAAAG ATAAAATGAC CCTGGGCATT GGAACGGGTC AGACAAATAG AATATGGGCG
GTAGAACATG CTATTTCGAG GTCGCGATTT GATTTAAAGG GAGCAGTGCT TGCGTCCGAC
GCGTTCTTCC CATTTTCAGA CAGTGTCGAA GCTGCGGGCA AAGCAGGAAT TAGTGCTATT
ATTCAGCCAG GTGGTTCTAT CCGCGACAAG GATTCAATTG AGATGGCAAA CAGGTTCAAT
ATAGCTATGG TATTCACAGG AATGAGACAT TTTAGGCATT AA
 
Protein sequence
MNKRAIISVY DKNGIVEFAK KLKEFGYDII STGGTMKYLT ENGIEVINIS DVTRFPEILD 
GRVKTLHPNI HAGILAMKDN REHLETLKAL DILPIDMVVV NLYPFKETIF KEDVTLDNVI
ENIDIGGPTM IRAAAKNFKY TTVIVDPEDY DIVAMEIEKN GEVSYETRFY LATKVFEYTS
YYDSMIFNYF KHVRKDQSFS KHFTVPLELL QYLRYGENPH QKACFYKISL PFIETSNIVN
CTQLHGKELS YNNILDSDSA IELLKEFDEP TCVAIKHNNP CAVASAENIN EAYKKVYESD
PISIFGGIVA FNRKVDKNVA EQLKKIFLEI VIAPEFDEDA LSILCSKKDL RVLKLASLEK
TDTFYDIKSV NGGALVQEKD RMLLADQLQV VTERKPSEKE LEDLIFAWKV VKHVKSNAIV
VAKDKMTLGI GTGQTNRIWA VEHAISRSRF DLKGAVLASD AFFPFSDSVE AAGKAGISAI
IQPGGSIRDK DSIEMANRFN IAMVFTGMRH FRH