Gene Amuc_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1330 
Symbol 
ID6275839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1606901 
End bp1607980 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content59% 
IMG OID642613386 
ProductDNA-directed DNA polymerase 
Protein accessionYP_001877935 
Protein GI187735823 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.64899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.124466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA GAAAAATCAT CCATGTGGAT ATGGATGCCT TTTACGCATC CATAGAACAG 
CGGGACCATC CCGAATACCG CGGCAAGCCC ATCGCCGTAG GCAGGCCGGA AATGCGCGGC
GTGGTGGCGG CGGCCAGTTA TGAGGCGCGC CGTTTCGGAG TGCGTTCCGC CATGCCTTCC
ATGAAGGCTC TCAAGCTTTG CCCCCATCTG ATTTTCACCC GCAACCGCAT GGATGTGTAC
AAGGCCGTCT CCGCGCAGAT ACACGCCATT TTCCACCGTT ACACAGATCT GGTGGAACCC
CTTTCCCTGG ATGAAGCCTT TCTGGACGTC ACGGAAAACA AGCCGGGCAT TCCGCTGGCC
GTCGACATTG CGAGGAGGAT TAAGAAGGAA ATCCGCCGGG AACTTCACCT GACGGCCTCC
GCCGGCGTTT CCTACAATAA ATTCCTGGCA AAAATCGCTT CCGATTACCG TAAGCCGGAC
GGGCTGTTCA CGATCCATCC ATCCCGGGCG GAAAAATTCA TCGCGGCACT TCCCATTGAA
GCTTTCTGGG GAGTCGGGCA CGCCACCGCC GAACGCATGC GCGCCCTTTC CATCACCAAC
GGGGCGCAGC TCCGGGCACG GGACAAAGAC TTCCTGGTAA GGCATTTCGG CAAAACAGGA
GCCATCTTCT ACAACTTCGC CCGCGGTGTG GACGACCGCC CTGTGGAACC TTCCCGCATG
CGCAAATCCG TGGGTTGTGA AGAAACCTAC CGGGAAAACG TCACCAGGGC GGAAGCGCTG
GAACAACGCC TCCCCCTGCT GGCGGAAGAA CTCGCGGGGC GGCTGGCCCG TTCCGGCTTC
CGGGGAAACA CCCTTACCCT GAAGGTTAAG TTCCCGGACT TTGTCCAGAA GACCCGCTGC
GCGACCGTTC CGGAAATCCT GACGGAGAAA GAAGGAATTC TCCCCCTGGC CCGCACCCTG
ATGGAAGAAC TGGATTCCGG GGACCGTACA TTCCGCCTTC TGGGGCTGTC CGTCTCCCAT
CCCCAGGAAG AACAGCGGCA GGGCATCTGG GAACAGCTCT GGCTGGAGCT GGAGTATTAA
 
Protein sequence
MNQRKIIHVD MDAFYASIEQ RDHPEYRGKP IAVGRPEMRG VVAAASYEAR RFGVRSAMPS 
MKALKLCPHL IFTRNRMDVY KAVSAQIHAI FHRYTDLVEP LSLDEAFLDV TENKPGIPLA
VDIARRIKKE IRRELHLTAS AGVSYNKFLA KIASDYRKPD GLFTIHPSRA EKFIAALPIE
AFWGVGHATA ERMRALSITN GAQLRARDKD FLVRHFGKTG AIFYNFARGV DDRPVEPSRM
RKSVGCEETY RENVTRAEAL EQRLPLLAEE LAGRLARSGF RGNTLTLKVK FPDFVQKTRC
ATVPEILTEK EGILPLARTL MEELDSGDRT FRLLGLSVSH PQEEQRQGIW EQLWLELEY