Gene Amuc_0457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0457 
Symbol 
ID6275857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp545157 
End bp546158 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content56% 
IMG OID642612507 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_001877076 
Protein GI187734964 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCA AACTAACCTA CAAGCAATCG GGGGTCGATA CCAAGGAAGC AGCCGCTTTT 
GTATCCGACA TCAGCTCTCA CGTTAAAAGA ACGCAAAAGC AGAGATCTCT GCACCAGGCC
TTTGGCCTTT TCGCCGCCGC CTATGATTTG AGCTCCTACA AGGAACCGGT CATCGTCACC
GGGTGCGACG GCGTAGGCAC CAAGACGGAA ATTCTTTTTG AACTGGACAT GGTGGAAACC
GCCGGCAAAG ACCTGGTGGC CATGAACGTC AACGACATTC TTACTACGGG CGGCGACCCT
CTCCTTTTCC TGGATTATCT GGGCATCTCC AATCTGGAAC GGGAACGCTC CCGTATCACC
CGCCTGGTGG CCGGCATGTG CGACTACCTG GAATCCTGCA ACTGCATCCT GGCCGGCGGA
GAAACGGCGG AAATGCCCGG CGTGGTGCCG GAATCCATCG TGGAACTCTC CGGCTTCTGC
ATCGGCTGCT GTGAAAAAAG CAAACTGATT GATCCGAAAA CCGTCCAGCC CGGAGACGTA
TTCATCGGCT ACAAATCCGA CAGCTTCCAT GCCAACGGCT GGAGCCTCAT CCGCCGCATT
CTGGAAGAAA ATCCGGATGT AGTTGATGAA GAAGAACTCC GCTCCCTCCT TCAGCCCACC
CGTTTGTATC ATGACGTAGT GGAAGATATG CGCCGCTTCA ACGTCACCCC GAAAGCATAC
GCCCACATCA CGGGGGGAGG CCTCCCGGAA AACCTGGAAC GCTTCCTGGG CGACTACGGC
GCAGACCTCT CCATTCCCTA TTGGGACAAC ACCGCCGCCC AGAAAATCCT GAAGCATGTG
GATCCTCAGG ACCGCTTCAA CACTTTTAAC ATGGGCATTG GCTGGGTAGC CATCGTGAGG
CCTGAAGACG CGGAAGCCGC ATTGAAGGCA GGTCCGGGCG GCACGGTCAT TGGCACGCTG
AGAGAAGGCC GCGGCATTCA TGTCAAGGTA CAGGGCGAAT AA
 
Protein sequence
MSGKLTYKQS GVDTKEAAAF VSDISSHVKR TQKQRSLHQA FGLFAAAYDL SSYKEPVIVT 
GCDGVGTKTE ILFELDMVET AGKDLVAMNV NDILTTGGDP LLFLDYLGIS NLERERSRIT
RLVAGMCDYL ESCNCILAGG ETAEMPGVVP ESIVELSGFC IGCCEKSKLI DPKTVQPGDV
FIGYKSDSFH ANGWSLIRRI LEENPDVVDE EELRSLLQPT RLYHDVVEDM RRFNVTPKAY
AHITGGGLPE NLERFLGDYG ADLSIPYWDN TAAQKILKHV DPQDRFNTFN MGIGWVAIVR
PEDAEAALKA GPGGTVIGTL REGRGIHVKV QGE