Gene Amuc_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1228 
Symbol 
ID6275772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1475565 
End bp1477763 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content50% 
IMG OID642613284 
ProductDNA polymerase III, subunits gamma and tau 
Protein accessionYP_001877834 
Protein GI187735722 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR02397] DNA polymerase III, subunit gamma and tau 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0388635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTACC AGGTCTTTGC CAGAAAATAC CGTCCTCTGA CATTCGATGA CGTCCTTGGA 
CAGGATCATG TTGTCCGGAC GTTGAAAAAC GCCATTGAGC ACAACCGTCT CGCCCATGCC
TATCTCTTTG TAGGTCCCCG TGGAACGGGA AAGACATCAA CCGCAAGAAT TTTTGCGAAA
GCTCTAAACT GCAGCGGTGG GCCCAAAGTG GATTTTGACC CGCATGAAGA TATATGTGAA
GAAATTGCAG AAGGGCGGAG CCTGGATGTT CTTGAAATAG ACGGAGCATC CAACCGCGGC
ATTGATCACA TCCGGGATCT CAGGGACAAC GTACGGTTCG CTCCGAGCAG AGGAAATTTC
CGTATCGTCT ATATAGATGA AGTGCACATG CTCACTAAGG AATCCTTCAA CGCCCTGCTC
AAAACGCTGG AAGAACCGCC ACCTCATGTC AAATTTATTT TCGCAACCAC GGAACCCCAT
AAAATACTGC CAACCATTCT ATCGCGATGC CAGCGCTTCG ACCTGCGCCC CATCCCTTCT
GAAATCATTG CAGAACACCT GCTCCATATC GCTTCTGCAG AGGGAGTAAG TTTGAGCAGG
GAAGCAGCCT TTGCTGTCGC CAAAGTGGCG GATGGAGGTA TGCGGGATGC ACAATCCATG
CTGGACCAGC TTGTTTCCTT CTGTGGAGAT CATATTGAAG AACAGCAGGT ACTCCACATT
TTCGGCATTA CTTCCCGGGA GACCGTGGCC CATGCTCTGG CGCTTATTTT GAACAAGGAA
CTTCCCTCTC TCCTGCATCT TCTGCATGAA CAGGCGGAAG CAGGAAGAGA CATGAGCCAG
TTCCTTTCCG AAATTATCTC CGCCGTGCGT GAAATCCTGG TCTCTAAAGT AGATCCAGAA
GCCAGCTTTG ATTCTCTCCC GGAATCCTCC AAGGAGGAAC TCGCCGAATT GGTCAAACGC
ACCCATACGG ACAAAATCCT GCGTTTGGTG GAAGTTCTGG CGGAAACGGA AGATAAAATG
CGCTGGTCCA CCAATAAAAG GCTTCATCTG GAAATGGGCC TGATTAAAGC CGTTCATACT
CTGGCTGAGG CCAGCATCAG CGATATTATC ATGGCGTTGG AAGGTGCTCC GCTAGCCACC
GCGGCACCGG CCTCTTCTTC AGATCTTGCT TCACAGCAGG AACCAAGTAC ATTTATTTCC
TCCGCCACGG CACCCACTCC GGCCCCCCGG CAAAACATCC AGTCTCCTGT TGCTGCAACA
AACCCAACCC CAGCTACGGA AGAACATTTG ATGGATCCGA TTCCAGATTT TTCTTTGTCT
TCCCCTGCTT CATCTCCCGC AGTACAGAAA ATTCAGAGCA CTGACGCTCA AGCTTCTCCC
CATCCTACGC AGGAAGCCGA ACCACAACAG GAAACTCTTA CACCCGCTGC CCCCCCCACC
TCTCTTGCCG TTGCAGAGGA AGAATTTCCC TACTCCGCCA CTAAATTGCG GACAAAAACC
GTACCTGCCT CCGAAGAAAT CACCCGGCAT CCAGACTCCG ATGCCTCCGA TGTGTCGAAA
TTTCCTACGG ATTCTCCAGA GCCTACTTTT ATGGATCCAG AGGAAAATCT TCCTCTGGAA
AGACGAACTA ACAGTTTCTT TGACAATCTA TTCGATACTC CAAGCACATC ATCCCAAACC
CAGGCACCCG TAGAAAAAGA GGAACCGGCA GTTCTGTCCC AACAATCAGG AAGAACAATC
ACGGAAGAAG ACTGGAAAAC GGTGTTGGAA CAAATGTCCG CCAAATTCCC TCTTCAAGCG
GACTTTCTAG CCAACAGCGT TTTTTCCGGT TATGATGGTG TAGCAGTCGC CATTTCTTTT
CATCCTTCTG ACCGTCAGGG GATGGACAGT CTTGGAACCG GCCCTCTGCG AACAGCTTTG
GAAGCTGCTC TTTCCCAATA TATTGGAACC TCTGTTACTA TTTCTATCCG TCAGGATTTC
TCCATTCCCG AACCCGTGCA GGAAGAACTG GCTCCTCTGC CTGCACCACC TCCTTCCGCA
TCCACCATTC CTGCACCTAA ACCTCATACC CCGCAGAAAA AAACCGCAGA GCCTGTCCAG
GAAGCTTCCA AAGAGGAAAA TGAAGATAAC TCTTACTACA CGGATCCGCT GATTGATGCT
GCTATGGAGA TTTTCCGGGC TCGTATCATT TCCCAATAG
 
Protein sequence
MSYQVFARKY RPLTFDDVLG QDHVVRTLKN AIEHNRLAHA YLFVGPRGTG KTSTARIFAK 
ALNCSGGPKV DFDPHEDICE EIAEGRSLDV LEIDGASNRG IDHIRDLRDN VRFAPSRGNF
RIVYIDEVHM LTKESFNALL KTLEEPPPHV KFIFATTEPH KILPTILSRC QRFDLRPIPS
EIIAEHLLHI ASAEGVSLSR EAAFAVAKVA DGGMRDAQSM LDQLVSFCGD HIEEQQVLHI
FGITSRETVA HALALILNKE LPSLLHLLHE QAEAGRDMSQ FLSEIISAVR EILVSKVDPE
ASFDSLPESS KEELAELVKR THTDKILRLV EVLAETEDKM RWSTNKRLHL EMGLIKAVHT
LAEASISDII MALEGAPLAT AAPASSSDLA SQQEPSTFIS SATAPTPAPR QNIQSPVAAT
NPTPATEEHL MDPIPDFSLS SPASSPAVQK IQSTDAQASP HPTQEAEPQQ ETLTPAAPPT
SLAVAEEEFP YSATKLRTKT VPASEEITRH PDSDASDVSK FPTDSPEPTF MDPEENLPLE
RRTNSFFDNL FDTPSTSSQT QAPVEKEEPA VLSQQSGRTI TEEDWKTVLE QMSAKFPLQA
DFLANSVFSG YDGVAVAISF HPSDRQGMDS LGTGPLRTAL EAALSQYIGT SVTISIRQDF
SIPEPVQEEL APLPAPPPSA STIPAPKPHT PQKKTAEPVQ EASKEENEDN SYYTDPLIDA
AMEIFRARII SQ