Gene Amuc_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1206 
Symbol 
ID6273852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1441659 
End bp1443206 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content59% 
IMG OID642613257 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001877812 
Protein GI187735700 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTC AGCGCGCATT GATATCCGTT TCCGACAAGA CCGGCCTGGA GGAGTTTGCC 
AAGGGCCTGC ACGAGTTTGG AGTAGAGCTT ATTTCCACTG GCGGGACCGC CGCTTTCCTG
AAAGGCCTGG GCCTTCCGGT GATTGAGATT TCCGACTATA CCGGAGAACC GGAGTTATTT
GAAGGGCGCC TGAAGACGCT TCATCCCATG GTGCACGGGG GCCTGCTGCA CCGCCGCGAC
AATGAGGAGC ATGTGCGCCA GGCTAAGGAA AACGGCATCA AGCCCATCGA CCTGGTCTGC
GTGAATCTGT ATCCTTTTGA AGAAACCGTC GCCAGGCCCG GAGTCACGCT GGAGGAAGCT
ATTGAAAAAA TAGATATCGG CGGGCCGTCC ATGCTCCGTT CCGCCGCCAA GAATTACGCT
TCCGTCACGG TGGTTTCGGA TCCCGCGGAT TACGCGCGCA TTCTGGATGA AATGCAGACC
CACAAGGGGG ATACGACCCT GAAGACCCGT GAAAACCTGG CGGTGAAGGT GTTTATGCGC
ACCTCCAATT ACGATAACGC CATCACTAAT TATCTTGGCC ACCAGAGCGC GGAAAGCACC
AAGGGCAGCT TCTGCATCTG CGCTCCCCTG TATCAGGAAT TGCGTTACGG GGACAATCCC
CACCAGGAAG CCAGCCTGTA CGGCAGCTTC GGGGATATTT TCCACCAGCT TCAGGGCAAG
GAGCTTTCCT ATACGAACGT GCTGGACATT GAAGGGGCAG CCGAGCTGAT TACCCAGTTC
CGCCGCCCGA CGGTGGGCAT TTTGAAGCAC ACGAATCCCT GCGGCGTGGG CCAGGACGAC
GAAGACCTGC GCAACGCATG GCAGAAGGCT TTTGAAACGG ATACGCAGGC CCCCTTCGGC
GGCGTGATCG TGGTCAACCG CCCGATGACG GAAGGCCTGG CCCGTGTGCT GAGCGCCATT
TTCACGGATG TCATCATCGC TCCGGAGTAT GATGCGGAAG CCCGCGCCAT TCTCCAGAAG
AAGAAAAATT GCCGCATCAT CCGCATGAAC ACGGAAGCCT GGATGAAGGC GCGCCGCGAA
CCCATCATCC GTTCCGCGCC CGGCGGGTTC ATGACCATGA AGCGGGATAC GGACGTGATG
GGGCTGGACA ATCTGGAAGC CAAGGTGGTG ACCAAGCGCC CCCCGACCGA GGAAGAATTG
ACCGCCATGC GTTTCAACTG GCGTGTCGTG AAGCAGGTTC ATTCCAACGC CATCGTTTTT
GGCGGTACGG ACCGCACGCT TGGCATCGGC GCCGGGCAGA TGAGCCGTGT GGATTCCGCC
CGCATTGCCG TCTGGAAGGC CGGGCAGGCC GGCCTGGATC TGAAGGGCAG CGTTGTGGCG
TCCGACGCCA TGTTCCCGTT TGCAGACGGC CTCCAGGTGG CGATTGACGC CGGAGCCACC
GCCTGCATCC AGCCCGGAGG TTCCATCCGC GACGAGGAAG TGATTGCCGC CGCTGATGCC
GCGGGAATTG CCATGGTATT TACGGGACAC CGCCATTTCC TCCATTAA
 
Protein sequence
MAIQRALISV SDKTGLEEFA KGLHEFGVEL ISTGGTAAFL KGLGLPVIEI SDYTGEPELF 
EGRLKTLHPM VHGGLLHRRD NEEHVRQAKE NGIKPIDLVC VNLYPFEETV ARPGVTLEEA
IEKIDIGGPS MLRSAAKNYA SVTVVSDPAD YARILDEMQT HKGDTTLKTR ENLAVKVFMR
TSNYDNAITN YLGHQSAEST KGSFCICAPL YQELRYGDNP HQEASLYGSF GDIFHQLQGK
ELSYTNVLDI EGAAELITQF RRPTVGILKH TNPCGVGQDD EDLRNAWQKA FETDTQAPFG
GVIVVNRPMT EGLARVLSAI FTDVIIAPEY DAEARAILQK KKNCRIIRMN TEAWMKARRE
PIIRSAPGGF MTMKRDTDVM GLDNLEAKVV TKRPPTEEEL TAMRFNWRVV KQVHSNAIVF
GGTDRTLGIG AGQMSRVDSA RIAVWKAGQA GLDLKGSVVA SDAMFPFADG LQVAIDAGAT
ACIQPGGSIR DEEVIAAADA AGIAMVFTGH RHFLH