Gene Amuc_1452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1452 
Symbol 
ID6275676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1743347 
End bp1745848 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content62% 
IMG OID642613512 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001878055 
Protein GI187735943 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.523417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTC CCATCTCCAT CCGCGGCGCG CGCCAGCACA ACCTCCGGAA CCTGAATCTG 
GATCTTCCGT CCAACAAGCT CATCGTATTC TGCGGCCCTT CCGGCTCCGG AAAATCTTCC
CTGGCGTTTG ATACGCTTTT TTCCGAATCC CGCAGGCGTT TTCTGGACTG CCTGTCGGCA
CGCTCCAGGC AGGGCATGGA TCAACCGGAA AAACCGGAAG TGGACAGCAT TACCGGGCTG
CCCCCGGCCC TGTGCCTGGA GCAATCCGCC AGGCAGCAGA GTTCCCGCAC CCTGCTGGGG
AGCATCACGG AAATTCTGGA CTACCTGCGC ATCCTTTACG CGGCTGCCGG CACGCCCCAT
GACCCGGAGA CAGGAAAGGA ACTGGAACGC AAGAGCCCGG ACCGGATTAC GGAAGAACTC
GTTTCCCTGC CGGAACACAC GCGCCTGATT CTGACCGCTC CGGCGGAAAA CCTGCTGGCC
CAGGATCCCG CAGCGACGCT GGCCGACTTC CAGCGGCAGG GCTTCCTCCG GGTTTACTGG
AACGGAGAAA TGCGGGATAT TGAAGAAATA AGTTCCCCCG TCCCCCCGCC TCCAGACGCG
GCCCTGGTCA TTGACCGCCT CATCGTCAGA GGGGAAAATA CGGCCTCGCG CATTGCGGAT
TCCCTGCAAA CGGCTCTCCG TATCAATCCG GACGAGGTGC GGGCCATCAT CACCATACCG
GGAGAGGAAG CCTCAATCCG GGCCTTCCAC ACCCGCTACC GCAATCCGGA AACAGGCTTC
CTTCTGCCCC AGCTTACGCC CCGCCATTTT TCCTTCAACT CCCCGCTGGG GGCATGCCCC
TCCTGCCGGG GAACCGGCCT GAATGAACAG GAAAACGGTC CGTGCCGCGC CTGCGGAGGC
CAGCGTCTTT CCCCTCTGGC CCTGGCCGTC ACCATGCCTG CGCCGGACCG GGCCTACAAT
CTCGCGGAAC TGACGGCTCT TCCTCTGGAA GATATGGCAG GAGAACTGGA ACGACTGAAA
ACGCCCCCCT CCCTGGCGGC GGCATTGACC CCGCTCATGG AGGAAATCAA CAAACGCGTG
CGCTTCCTGA ATGAGCTGGG ACTCTCCTAC CTGTCCCTGG ACCGCCAGGC AAACACCCTC
TCCGGAGGCG AACTGCAGAG GGCGCGCCTG GCTTCCCAGC TGGGAGGCGG CCTTTCCGGA
GTCCTTTACA TCCTGGACGA ACCCACGGCC GGACTGCACC CCGCCGATAC GGACCGCCTG
CTCCGCGCTC TCCGGACGCT CCGGAACCAG GGCAACACAG TACTGGTCGT AGAGCATGAT
GAACAAATTC TAACCGCGGC GGATCACCTG GTGGACATGG GCCCCGGCTC CGGAACCAAC
GGAGGCCGTA TTCTGGCGCA GGGCTCCCTT GCTGAGATAC TGGGAAATTC AGGAAGCCCC
ACCGGGGAAT GGCTTTCAGG CAAGCGAAAC ATGCCCGCCT CCGGACGCAA GACGGCTCCT
GCGGGGCGTC TGGTACTGAC CGGTGCGGAC AAGCACAACC TCAACAACGT CACTCTGAAT
ATCCCGGTCG GCACACTGAC CTGCATCTCC GGCCCTTCCG GTTCAGGGAA ATCCACCCTT
GTCCGGGACT GCCTCATCCC CGCAGTCAGG CAAGATCTCT CCGGGAAAAA GGGTATTCCG
CGCCGCGTGC AGGGAACGGA ACACTTCAAC CGCCTCGTCG TCATCGACCA GTCGCCCATC
GGCAAAACGC CGCGCTCCAC ACCGGCCACC GCTACCGGCC TGCTCCAGGT GCTGCGCCCC
CTTTACGCAC AGCTCCCCCT TTCCAAGCAG AGGGGATATA CGGCGGCGCG CTTTTCCCCC
AACATTCGCG GAGGCCGCTG TGAACGGTGC CAGGGAACGG GCATGATTGA AGTGGACATG
AACTTTCTGG GAAACGTGGC AATGCCCTGC GACGCCTGCC AGGGGCAGTG CTACAACAGG
GAAACGCTGG AAGTCACCTG GAAAGGGAAA TCCATTGCCC AGGCGCTGGC CCTGACCGTG
GACGAAGCGG CGGAATTCTT TTCCTCCCTG CCCAGAGCCG CCGCCATCCT GAAAAGCATG
CAGGACGTAG GGCTGGGATA CCTCAATCTC AACCGCAGGG CGGACACCCT TTCCGGCGGA
GAATCCCAGC GCATAAAAAT AGCTGCGGAA CTGGCCAAAG CCCCGGCCTG GAAACTGGAG
GAAGACGGGA AACGGGCCCT GTTCATTCTG GACGAACCCA CCAGCGGCCT CCACTTCAAT
GAAGTGGCCC TTCTCCTGGC AGCCCTTTTC CGCCTGAGGG ATGCCGGACA CACCATCCTC
TGCGTGGAAC ACCACAAGGA CCTGCTCAAT GCCGCGGACT ACCTGGTGGA CATGGGCCCC
GGAGCCGGCA GGCACGGCGG CAATATCGTG GCCGAGGGCT CCCCCGCAGA TGTAGCGTCC
AATCCGGAAG CGCCCACTTC TCCCTGGCTC GTCCCCCGTT AA
 
Protein sequence
MNLPISIRGA RQHNLRNLNL DLPSNKLIVF CGPSGSGKSS LAFDTLFSES RRRFLDCLSA 
RSRQGMDQPE KPEVDSITGL PPALCLEQSA RQQSSRTLLG SITEILDYLR ILYAAAGTPH
DPETGKELER KSPDRITEEL VSLPEHTRLI LTAPAENLLA QDPAATLADF QRQGFLRVYW
NGEMRDIEEI SSPVPPPPDA ALVIDRLIVR GENTASRIAD SLQTALRINP DEVRAIITIP
GEEASIRAFH TRYRNPETGF LLPQLTPRHF SFNSPLGACP SCRGTGLNEQ ENGPCRACGG
QRLSPLALAV TMPAPDRAYN LAELTALPLE DMAGELERLK TPPSLAAALT PLMEEINKRV
RFLNELGLSY LSLDRQANTL SGGELQRARL ASQLGGGLSG VLYILDEPTA GLHPADTDRL
LRALRTLRNQ GNTVLVVEHD EQILTAADHL VDMGPGSGTN GGRILAQGSL AEILGNSGSP
TGEWLSGKRN MPASGRKTAP AGRLVLTGAD KHNLNNVTLN IPVGTLTCIS GPSGSGKSTL
VRDCLIPAVR QDLSGKKGIP RRVQGTEHFN RLVVIDQSPI GKTPRSTPAT ATGLLQVLRP
LYAQLPLSKQ RGYTAARFSP NIRGGRCERC QGTGMIEVDM NFLGNVAMPC DACQGQCYNR
ETLEVTWKGK SIAQALALTV DEAAEFFSSL PRAAAILKSM QDVGLGYLNL NRRADTLSGG
ESQRIKIAAE LAKAPAWKLE EDGKRALFIL DEPTSGLHFN EVALLLAALF RLRDAGHTIL
CVEHHKDLLN AADYLVDMGP GAGRHGGNIV AEGSPADVAS NPEAPTSPWL VPR