Gene Amuc_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0454 
Symbol 
ID6275841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp541815 
End bp543353 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content57% 
IMG OID642612504 
Productprotein of unknown function DUF303 acetylesterase putative 
Protein accessionYP_001877073 
Protein GI187734961 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCC ATACTTCATG CATCCTGGCT GCCGCCATCG CCCTTAACGC AGCGTGGGCG 
GACGCCCCCC GGTTCGACCG GATTTTCGGT TCCCATATGG TACTTCCCCA CGGGAAAAAC
ATTCCCGTTT CCGGCACGGC CGCCCCCAAC AGGGAAATAA CCGTCACTTT TGGAAACACT
GCACTGAAAA CAAAAACCGA TTCCAAAGGA AAATGGAGCG TCACGCTGCC TCCCATGCCT
CCCTGCGGCA CAGGCCAGAC GCTGGCGGCC GTGCAAAACG GAGACTCCGC CAAATTGGAA
GATGTGCTCG TGGGAGAAGT GTGGCTGGCT TCCGGCCAAT CCAACATGCT GTTCCGCCTG
AACCAGACCA CGACGGCCAA AGAAGACATA GCGGCTTCCG GAGATGACCA GCTCCGCCTG
CTCAACAATA TTCCGCAAGC CCACACCAAC AACGCCCCCT ACTCCGGGAA AGATTTTGAC
GCCGTCACCA CGGACAACTT CTATAAAGGG CAATGGGCCG CCAGCTCACC CTCCACTTCA
GGCCCCATGA CTGCCGTGGG CTACTATTTC GGGAAAAAAC TCCGCGAAGG GCTGGGCATT
CCGGTAGGCA TCATCCACTC CTCCCTGGGG GGCTCGGAAA TCGCAGCGTG GCTCCCCCGG
CAGGTCATTA ATGCAAACAA CAGTTTCCGC ACCCTGCGCG GCAACCACTG GCTGGACTCC
CCCCTTATCT CCGACTGGGT AAGGGGACGG GCCAGAAAAA ATATCTCTCC GCGTCTGAAT
CAGGGATCTC CGGACCATCC CTACAAGCCG GCGTTCCTTT ACGAATCCGG CATTGCCTGG
ACAACACCAC TCCCCATCAC AGGCGTGATC TGGTATCAGG GCGAATCCGA TGCGGAGATC
ATTGACAACG CCCAAAACGG CATGTTGCTG AAAACTATGA TCTCCTCCTG GAGAAAAGCT
TTCCGTAACC CGGAAATGCC CTTCGTCATG ATCCAGCTCC CCCGCATTAA CGATCCGTCA
AAAATCCGTG CCGGATGGCC GGAATTCCGC GAAATGCAGG ACACAGTGGC CAAAACCGTC
CCGCAGGTTT ACAGCGTCAA CACCATTGAT CTTGGCTCCA CCAATGCCGA CGTTCATCCT
CCGTTCAAGC GCCCTGTTGG GGAACGTGCA GGAAATACTG CCCTGAACAA GGTTTACGGT
AAAAAGGTCC CCTGTGAAGG CCCTGCCTTC AAGGCCTTCA AGACCTCCGG TTCCAGCATC
CTGGTCCAGA TGGAACAAGC TGCCGGACTG ACCACCACGG ACGGCAAGAA ACCGGCCCAG
TTTGAAATTG CCGGGACGGA CGGAATTTAC CATCCCGCCA CGGCGGAAAT CACAAACAGG
AAAGGCGGCA CAGCCGTTAT CCGCCTCTCC TCCCCGGAAG TAAAATCCCC CAAAAACGCC
CGGTATTGCT GGAACCGTTT CGTGACCCCC AACCTGGTCA ACGGCAACCA ACTCCCCGCC
CGTCCCTTCC GGACGGACAC ACCCGTTCTG AAAAAATAA
 
Protein sequence
MRLHTSCILA AAIALNAAWA DAPRFDRIFG SHMVLPHGKN IPVSGTAAPN REITVTFGNT 
ALKTKTDSKG KWSVTLPPMP PCGTGQTLAA VQNGDSAKLE DVLVGEVWLA SGQSNMLFRL
NQTTTAKEDI AASGDDQLRL LNNIPQAHTN NAPYSGKDFD AVTTDNFYKG QWAASSPSTS
GPMTAVGYYF GKKLREGLGI PVGIIHSSLG GSEIAAWLPR QVINANNSFR TLRGNHWLDS
PLISDWVRGR ARKNISPRLN QGSPDHPYKP AFLYESGIAW TTPLPITGVI WYQGESDAEI
IDNAQNGMLL KTMISSWRKA FRNPEMPFVM IQLPRINDPS KIRAGWPEFR EMQDTVAKTV
PQVYSVNTID LGSTNADVHP PFKRPVGERA GNTALNKVYG KKVPCEGPAF KAFKTSGSSI
LVQMEQAAGL TTTDGKKPAQ FEIAGTDGIY HPATAEITNR KGGTAVIRLS SPEVKSPKNA
RYCWNRFVTP NLVNGNQLPA RPFRTDTPVL KK