Gene Amuc_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1938 
Symbol 
ID6275203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2350820 
End bp2353657 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content56% 
IMG OID642613998 
ProductDNA polymerase I 
Protein accessionYP_001878532 
Protein GI187736420 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.632824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0549057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATT CCCCTTCCAA GCGCCTTTTT ATTCTGGATG GAATGGCCCT GGCTTACAGA 
GCCCACTTCG CCTTTTTCTC CAATCCTATC CGCAATTCCA AGGGAGTCAA TACTTCCGCC
GTGTACGGCT TCGCCAATAC GCTGCTGGGC ATTCTGGAGC ACGAACGCCC CACGCACATC
GCGGCCTGTT TCGACACTTC CGCTCCCACG GCGCGCCATA AGCTTTACCC TGCCTATAAA
GCCAACCGGG AATCCATGCC GGAAGAGTTG AGCGACCAAA TGCCCCTGAT TTTCAGATTG
CTGGAGGCCA TGAATATTCC CATTCTGCGC TATGAGGGCT ATGAGGCGGA CGATACGATA
GGCACGCTGG CACGCATCGC GGACGGTACG GAGGGATTCC AGACCTACAT GGTTTCCCAG
GACAAGGACC TGGGCCAGCT TATTTCCTCC ACCTGCTTTC TGTGGAAACC CGGCAAAAGG
GGCAATGACC ACGAAGTGAT TGACCTGGCA AAGCTCAAGG AGCAATGGGG CATTGAACGT
GCGGACCAGG TAGTTGATAT TCTGGCCCTG ATGGGCGACA GCTCCGACAA TATTCCGGGG
CTTCCCGGCG TGGGGGAAAA GACGGCTAAG CTGCTGATCG GAGAGTTCGG CTCCGTGGAA
AACCTGCTGT CTAATACGGA TAAACTGAAA GGGAAGCGCA AACAGATTGT GGAGGAAAAC
GGGGCCATGG CAACCCTTTC CAAGCAACTG GCCACCATTG ACCGGAACGT TCCCCTGACG
GTGACCCTGC CTGAATTGGT TAAAAGAGAA CCCAGTCCGG AAGAACTGCA GGCCCTTCTC
CAGGAGTTGG AATTCCGGTC CATGCAGGCC AAGCTGTTCG GGAAAAAAGC GCCGGAGCCC
AGAAAAGCCC CCCTCCCGGC GGACGATTTG TTTGCTCCCG CCCCCCGGAC GGAACAACCT
CTGTCGGCGG AACCCTCCGC CCCTGTGTCG GGAGCACGGC AGAACGGTTC GGGACAAATG
GATTTATTTG AGGAACGCCA TTTGAAAACG GTAGATGATT TCAGGCACGA ATATATTATT
GCAGATACGC CGGAAGCCCG TTCCTCCATG GCAGCCGAGC TGGAAAAGTA TGATTCCTGG
TGCTTCGACA CGGAAACGAC GGGCCTGAAC CCCCTCATGG ACAACCTGCT GGGCGTCTCC
TTTTGCGCTG AACCGCACAA GGCATGGTAC ATGCCTGTCT CCGGTCCGGC GGATCTGGAA
GCGGTCAGGC CGCTCCTGGA AGGCCCCGCA GAGAAGATAG GGCATCACCT GAAATTTGAC
CTGGAGGTTT TGCGAGCCAA CGGCATTCAT GTCAAAGGCC CCTTTTTCGA TACGTTGCTG
GCTCATGCCC TGATCGCTCC AGGCATGAAG CACGGAATGG ACGTTCTGGC GGAAAATTTG
CTGCAATATT CCACGATTAA ACTGAAGGAC ATTGCCGCTC CGGGAGCAAA AAAACGGGAA
CTGGACACCA GCGGCGTTCC CGTGGAAGTA ATGGGCAAAT ATTCGGCGGA GGATGCGGAT
ATCACCCTCC AGCTTTCCGC CGTCCTGAAA AGGCAGGTCA AGGAGAGCGG CATGGAAAAA
CTGTTCCGTA CCGTGGAATT GCCCCTGCTT CCCGTGCTGG CGGACATGGA GTTTTCCGGT
ATCCGCGTGC TTCCGGAATC CCTGGAAAAG GCTTCCGTCA AGGTAGGAGC CATCATTGAC
GGCCTGCGGG AAAGAATTGA AGAAGCCGCA GGCCATCCCC TGAATCTGAA TTCCCCCAAG
CAGCTCGGAG ATTTCCTGTT CGGAGAACTG GAGCTGGTGA AGAAGCCCAA GAAGACGAAG
ACGGGCCAGT TCGTGACGGA TGAAGACACC CTTTCCGCTC TGGCCCCCCA GCATCCCATT
ATAGCGGATA TCCTGGCCTA CCGGGAGAAT ATGAAGCTGA AGAGCACATA TCTGGATGCG
CTGCCCAAGT ATATCTGCCC GCGGGACGGA CGCATCCATA CCCAATTCCA CCAGATGCTG
ACCGCAACCG GGAGGCTTGC CTCCCAGGAT CCCAATCTTC AAAATATTCC GGTAAGGACG
GAACAGGGGC GCCTGATCCG CACCGCCTTT GTCCCCGCCT CAGAGAAATA CACCATGCTG
TCTGCGGATT ATTCCCAGAT TGAACTGCGC ATCATGGCGG CTCTTTCAGG AGATCCCGCC
ATGTGCGGAG CATTCAGGGA AGGACGGGAC ATCCATACGG AAACGGCCGC CCGTGTGTAC
GGCATTCCCC GCGACCAGGT GGACGCCGTT ATGCGCCGTG CGGCCAAAAC GGTGAATTTC
GGCATTATTT ACGGCATTTC CGCCTTCGGT CTCTCCCAGA GGCTGGGCTG CCCCCGCGGA
GAAGCCGCCA CTTTGATTGA AAACTATTTT ACCCAGTTCC CCGTAGTTAA ATCCTTCATG
GAAGACCTGG TTCACAAAGC GGAACAAGCC GGTTACGCGG AAACGCTATT GGGGCGACGG
AGGATGATTC CGGAAATCAA TTCCGCCAAC AAGACCATTA AATCCGCTGC GGAACGTACT
GCCATCAACA CCCCCATCCA GGGCACGGCG GCGGATATGA TTAAAATAGC CATGATTCAT
GTGGATAAAT TGCTGAAAGG CACCAGATCC CGGCTTATCC TCCAGATTCA TGATGAATTG
CTGGTGGACC TGCACAGGGA TGAGTTGGAT CTCATTCCCA AGATAGAGGA AGCCATGGTC
AGCGCGCTGC CCCTGCCCAA CGGCGTTCCC ATTCTGGTGG AAGCCAGGAC GGGAGGCAAT
TGGCTGGAAG CCCATTAA
 
Protein sequence
MTDSPSKRLF ILDGMALAYR AHFAFFSNPI RNSKGVNTSA VYGFANTLLG ILEHERPTHI 
AACFDTSAPT ARHKLYPAYK ANRESMPEEL SDQMPLIFRL LEAMNIPILR YEGYEADDTI
GTLARIADGT EGFQTYMVSQ DKDLGQLISS TCFLWKPGKR GNDHEVIDLA KLKEQWGIER
ADQVVDILAL MGDSSDNIPG LPGVGEKTAK LLIGEFGSVE NLLSNTDKLK GKRKQIVEEN
GAMATLSKQL ATIDRNVPLT VTLPELVKRE PSPEELQALL QELEFRSMQA KLFGKKAPEP
RKAPLPADDL FAPAPRTEQP LSAEPSAPVS GARQNGSGQM DLFEERHLKT VDDFRHEYII
ADTPEARSSM AAELEKYDSW CFDTETTGLN PLMDNLLGVS FCAEPHKAWY MPVSGPADLE
AVRPLLEGPA EKIGHHLKFD LEVLRANGIH VKGPFFDTLL AHALIAPGMK HGMDVLAENL
LQYSTIKLKD IAAPGAKKRE LDTSGVPVEV MGKYSAEDAD ITLQLSAVLK RQVKESGMEK
LFRTVELPLL PVLADMEFSG IRVLPESLEK ASVKVGAIID GLRERIEEAA GHPLNLNSPK
QLGDFLFGEL ELVKKPKKTK TGQFVTDEDT LSALAPQHPI IADILAYREN MKLKSTYLDA
LPKYICPRDG RIHTQFHQML TATGRLASQD PNLQNIPVRT EQGRLIRTAF VPASEKYTML
SADYSQIELR IMAALSGDPA MCGAFREGRD IHTETAARVY GIPRDQVDAV MRRAAKTVNF
GIIYGISAFG LSQRLGCPRG EAATLIENYF TQFPVVKSFM EDLVHKAEQA GYAETLLGRR
RMIPEINSAN KTIKSAAERT AINTPIQGTA ADMIKIAMIH VDKLLKGTRS RLILQIHDEL
LVDLHRDELD LIPKIEEAMV SALPLPNGVP ILVEARTGGN WLEAH