Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1938 |
Symbol | |
ID | 6275203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2350820 |
End bp | 2353657 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613998 |
Product | DNA polymerase I |
Protein accession | YP_001878532 |
Protein GI | 187736420 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.632824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.0549057 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATT CCCCTTCCAA GCGCCTTTTT ATTCTGGATG GAATGGCCCT GGCTTACAGA GCCCACTTCG CCTTTTTCTC CAATCCTATC CGCAATTCCA AGGGAGTCAA TACTTCCGCC GTGTACGGCT TCGCCAATAC GCTGCTGGGC ATTCTGGAGC ACGAACGCCC CACGCACATC GCGGCCTGTT TCGACACTTC CGCTCCCACG GCGCGCCATA AGCTTTACCC TGCCTATAAA GCCAACCGGG AATCCATGCC GGAAGAGTTG AGCGACCAAA TGCCCCTGAT TTTCAGATTG CTGGAGGCCA TGAATATTCC CATTCTGCGC TATGAGGGCT ATGAGGCGGA CGATACGATA GGCACGCTGG CACGCATCGC GGACGGTACG GAGGGATTCC AGACCTACAT GGTTTCCCAG GACAAGGACC TGGGCCAGCT TATTTCCTCC ACCTGCTTTC TGTGGAAACC CGGCAAAAGG GGCAATGACC ACGAAGTGAT TGACCTGGCA AAGCTCAAGG AGCAATGGGG CATTGAACGT GCGGACCAGG TAGTTGATAT TCTGGCCCTG ATGGGCGACA GCTCCGACAA TATTCCGGGG CTTCCCGGCG TGGGGGAAAA GACGGCTAAG CTGCTGATCG GAGAGTTCGG CTCCGTGGAA AACCTGCTGT CTAATACGGA TAAACTGAAA GGGAAGCGCA AACAGATTGT GGAGGAAAAC GGGGCCATGG CAACCCTTTC CAAGCAACTG GCCACCATTG ACCGGAACGT TCCCCTGACG GTGACCCTGC CTGAATTGGT TAAAAGAGAA CCCAGTCCGG AAGAACTGCA GGCCCTTCTC CAGGAGTTGG AATTCCGGTC CATGCAGGCC AAGCTGTTCG GGAAAAAAGC GCCGGAGCCC AGAAAAGCCC CCCTCCCGGC GGACGATTTG TTTGCTCCCG CCCCCCGGAC GGAACAACCT CTGTCGGCGG AACCCTCCGC CCCTGTGTCG GGAGCACGGC AGAACGGTTC GGGACAAATG GATTTATTTG AGGAACGCCA TTTGAAAACG GTAGATGATT TCAGGCACGA ATATATTATT GCAGATACGC CGGAAGCCCG TTCCTCCATG GCAGCCGAGC TGGAAAAGTA TGATTCCTGG TGCTTCGACA CGGAAACGAC GGGCCTGAAC CCCCTCATGG ACAACCTGCT GGGCGTCTCC TTTTGCGCTG AACCGCACAA GGCATGGTAC ATGCCTGTCT CCGGTCCGGC GGATCTGGAA GCGGTCAGGC CGCTCCTGGA AGGCCCCGCA GAGAAGATAG GGCATCACCT GAAATTTGAC CTGGAGGTTT TGCGAGCCAA CGGCATTCAT GTCAAAGGCC CCTTTTTCGA TACGTTGCTG GCTCATGCCC TGATCGCTCC AGGCATGAAG CACGGAATGG ACGTTCTGGC GGAAAATTTG CTGCAATATT CCACGATTAA ACTGAAGGAC ATTGCCGCTC CGGGAGCAAA AAAACGGGAA CTGGACACCA GCGGCGTTCC CGTGGAAGTA ATGGGCAAAT ATTCGGCGGA GGATGCGGAT ATCACCCTCC AGCTTTCCGC CGTCCTGAAA AGGCAGGTCA AGGAGAGCGG CATGGAAAAA CTGTTCCGTA CCGTGGAATT GCCCCTGCTT CCCGTGCTGG CGGACATGGA GTTTTCCGGT ATCCGCGTGC TTCCGGAATC CCTGGAAAAG GCTTCCGTCA AGGTAGGAGC CATCATTGAC GGCCTGCGGG AAAGAATTGA AGAAGCCGCA GGCCATCCCC TGAATCTGAA TTCCCCCAAG CAGCTCGGAG ATTTCCTGTT CGGAGAACTG GAGCTGGTGA AGAAGCCCAA GAAGACGAAG ACGGGCCAGT TCGTGACGGA TGAAGACACC CTTTCCGCTC TGGCCCCCCA GCATCCCATT ATAGCGGATA TCCTGGCCTA CCGGGAGAAT ATGAAGCTGA AGAGCACATA TCTGGATGCG CTGCCCAAGT ATATCTGCCC GCGGGACGGA CGCATCCATA CCCAATTCCA CCAGATGCTG ACCGCAACCG GGAGGCTTGC CTCCCAGGAT CCCAATCTTC AAAATATTCC GGTAAGGACG GAACAGGGGC GCCTGATCCG CACCGCCTTT GTCCCCGCCT CAGAGAAATA CACCATGCTG TCTGCGGATT ATTCCCAGAT TGAACTGCGC ATCATGGCGG CTCTTTCAGG AGATCCCGCC ATGTGCGGAG CATTCAGGGA AGGACGGGAC ATCCATACGG AAACGGCCGC CCGTGTGTAC GGCATTCCCC GCGACCAGGT GGACGCCGTT ATGCGCCGTG CGGCCAAAAC GGTGAATTTC GGCATTATTT ACGGCATTTC CGCCTTCGGT CTCTCCCAGA GGCTGGGCTG CCCCCGCGGA GAAGCCGCCA CTTTGATTGA AAACTATTTT ACCCAGTTCC CCGTAGTTAA ATCCTTCATG GAAGACCTGG TTCACAAAGC GGAACAAGCC GGTTACGCGG AAACGCTATT GGGGCGACGG AGGATGATTC CGGAAATCAA TTCCGCCAAC AAGACCATTA AATCCGCTGC GGAACGTACT GCCATCAACA CCCCCATCCA GGGCACGGCG GCGGATATGA TTAAAATAGC CATGATTCAT GTGGATAAAT TGCTGAAAGG CACCAGATCC CGGCTTATCC TCCAGATTCA TGATGAATTG CTGGTGGACC TGCACAGGGA TGAGTTGGAT CTCATTCCCA AGATAGAGGA AGCCATGGTC AGCGCGCTGC CCCTGCCCAA CGGCGTTCCC ATTCTGGTGG AAGCCAGGAC GGGAGGCAAT TGGCTGGAAG CCCATTAA
|
Protein sequence | MTDSPSKRLF ILDGMALAYR AHFAFFSNPI RNSKGVNTSA VYGFANTLLG ILEHERPTHI AACFDTSAPT ARHKLYPAYK ANRESMPEEL SDQMPLIFRL LEAMNIPILR YEGYEADDTI GTLARIADGT EGFQTYMVSQ DKDLGQLISS TCFLWKPGKR GNDHEVIDLA KLKEQWGIER ADQVVDILAL MGDSSDNIPG LPGVGEKTAK LLIGEFGSVE NLLSNTDKLK GKRKQIVEEN GAMATLSKQL ATIDRNVPLT VTLPELVKRE PSPEELQALL QELEFRSMQA KLFGKKAPEP RKAPLPADDL FAPAPRTEQP LSAEPSAPVS GARQNGSGQM DLFEERHLKT VDDFRHEYII ADTPEARSSM AAELEKYDSW CFDTETTGLN PLMDNLLGVS FCAEPHKAWY MPVSGPADLE AVRPLLEGPA EKIGHHLKFD LEVLRANGIH VKGPFFDTLL AHALIAPGMK HGMDVLAENL LQYSTIKLKD IAAPGAKKRE LDTSGVPVEV MGKYSAEDAD ITLQLSAVLK RQVKESGMEK LFRTVELPLL PVLADMEFSG IRVLPESLEK ASVKVGAIID GLRERIEEAA GHPLNLNSPK QLGDFLFGEL ELVKKPKKTK TGQFVTDEDT LSALAPQHPI IADILAYREN MKLKSTYLDA LPKYICPRDG RIHTQFHQML TATGRLASQD PNLQNIPVRT EQGRLIRTAF VPASEKYTML SADYSQIELR IMAALSGDPA MCGAFREGRD IHTETAARVY GIPRDQVDAV MRRAAKTVNF GIIYGISAFG LSQRLGCPRG EAATLIENYF TQFPVVKSFM EDLVHKAEQA GYAETLLGRR RMIPEINSAN KTIKSAAERT AINTPIQGTA ADMIKIAMIH VDKLLKGTRS RLILQIHDEL LVDLHRDELD LIPKIEEAMV SALPLPNGVP ILVEARTGGN WLEAH
|
| |