Gene Amuc_1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1561 
Symbol 
ID6273684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1875070 
End bp1878234 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content59% 
IMG OID642613621 
ProductUvrD/REP helicase 
Protein accessionYP_001878163 
Protein GI187736051 
COG category[L] Replication, recombination and repair 
COG ID[COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCC TGACCAACAT GCTGATCTCC GCTTCCGCCG GAACGGGAAA AACCTACCAG 
CTCTCCCTGC GTTTCCTGGG GTTGCTGGCG CTGAACAGCG GCAACCACCC CGAACGCCTC
ATCGCCATTA CGTTCACCCG AAAAGCGGCG GGAGAATTCA AGGACCGCAT TCTGACAGAC
CTGGCCGCGG GGGCCACGGA TGAGGCAGGC GCGGCGCGGC TGAAAGAACG GCTCTGGGCG
GTTATCAAGG GAACGGACGG GGAGCCCGGT CTCTGGCCCG GCGCTCCGGA GGCATGGAAG
GAGGAAAATC TGCATCGGGA ACGCTTCCTC CATTTACTGC ACATCCTGGT ACAGAATTTG
GCGCGACTCA ACCTCTGCAC CATTGACAGC CTGTTTGCCC AAATTGCCTC CGCCAGCACC
TTTGAGCTGG GAGTCAGCGG CTTCAGTATG ATTGACCCCA CGGCGGAGAA ACTGGCGCGC
CGTGAAGCCC TGCTCTCCCT GTACCGGGAA TGCTCCGTAA ACAAAGAGCG AAGGAAGGAT
TTTGAAGACG CTTTCCTTTC CGGGGCGGAT TCTGACGCGG AAGCCGCAGA CGCGGAGAAA
TCCATGATGC GCCGCCTGGG CACCTATCAC GAACTTTTTC TGGATGTGCC GGATGCCGGG
ATGTGGGGAA ACCCCGTCAC GCTGGGCTTC ACACCGGAGG AGCTGGCGCC TCCCGTCTCT
CTGGAACAAT TTGATTCCGT CCTTCATTCC CTGATTTTCC AGGTTCAGCA AACTCCTGCG
TCGGAAGGGA AAAACGGCGT AAAAAACAAA GAACTGTTCC TGCGCTTCCT GAACGGATTT
TCCCAATACG CCAGGCTGGG CCGCGTGCGA TTCCGTACAG AGGGTGGTTC CGCCTGGGCT
ATTACCGTGG AAGAGGCGCG GGAAAAATTC CGGGATTTCT GGACGCCTGC TCTGGAAGAA
CTCATCCAAA GCTGGCTTCG CATGGAAACC CTGCACACGC TGCGCCGAAC CCGCGCCACG
CACGGGCTGA TGCTCCTGTT TGAGAACAAA TATTCCTCCC TGGTCCGCAA CAGGGGGAGA
TTCCTGTTCC ACGACGTTAC GCGCATGCTG GGAGGGGGCA CCATGACGCC GGAACTCAAA
CGGGACCTGC AATACCGCAT GTACTGCCGT TATGACCATT GGATGCTGGA TGAATTCCAG
GATACGTCCC AGCCCCAATG GCACGTCATC AAGCCTTTTC TGGATGATCT GGCGGAATCC
AAGACTGGAA ATGAAGGCAG CATTTTCGTG GTGGGGGACA TCAAGCAGAG CGTGTACCAA
TGGCGCGGCG GAGACCCGGA GCTTTTCCGC TCCGTTTCTT CACAGCTTCA ACTGGAACAA
CGCGGCATGT CCACTTCCTA CCGTTCTGTC CAGCCAGTGC TGGACCTGGT CAACGATATT
TGCGACTATG CCCGGACAGC GCCCGGGTGC GAACCCGCCG CCCTGGAACA GTGGGGAGAA
TATCCGGCGC ACCGGTGCGC CCCTCATCTG GAAGACCGTC CCGGCACCTC CCAAATCTGG
CAGGCGCCCA AGGCGGAAAA CGTCTCCGCC AACGACCAGG TATGCCAGGC GGCGGCCGAT
ATTCTGGAGC GTACAGGGGC CCTGCGCCGC GGCCTGGAAA CAGCGATACT GGTCAGTACG
AAAAACCAGG CTCTTGTCAT CAAGGGCTGG CTGACGGACC ACGGCATCCC CGCAGAAGTA
TGTGACGACG TTCCAGTAGG TGTGGACTCT CCGCTGGGAA AAAACCTGCT GTACTTCTTC
CGCTGGCTGC TGATGCCGGG AGACCCCTTT GTCGTCGGCC TGCTCACCCA CTCCCCTCTC
CGGCCCCTCA TCACGCAGGG AGGCCCGGAA AGCATGGGGT GGAAGGAATG GCGCCTCCTG
CTGGAACGGG ACGGTTACGC CGCCGTCATG GAGGAACTGG AACAGCGCCT GCTCCGGGGA
GGGACGGAAC TGACGGACTT CCACCGCGAC CGCCTGGCCG TCTGGCAGAA TGAGGCGGAA
CAGGTGGACG AACAGGGCGT TTCCCTGGAT GAATGGATCA GGCGCATGGA AGACCTGACC
CGCCGGGAAG ACCCCGCCGC GGGAATTGTC CGAATCATGA CCATCCACAA ATCCAAAGGG
CTGGGATTTG ACATCGTCAT CCTGCCCCAG ATCGGCAGGG ATACACCTTT TGCGGACGGA
AGGCATCTGA CCCATTTTAT CAAAAAAAAC GGGGAGGGCG GTGTGGAAGG CATTGTCCTG
GCCCCCTCCC GGCATGTCTA TATGAATATC CCGCAATTCC GGGAACTTTA CGGGGAATGG
CGCGCCCGGC AGCAATTCGA CGGATTCTGC AAGCTGTATG TGGCGCTTAC CCGGGCCAAG
CGGGCTACCT ACGTTATCCT CCCGTACCGG GAAGACAAGG AAGAAACGGA GGCAGATTCC
ATGTGGAAAG TAGTCCGTTC CTCTATCCGG CCCCTCAACC GGGGAACGGA AGACATACTT
CCGGAATCCG GAGCGTCATG CCTGTACTCC CGCGGTCTTG ACGGGTGGTA TGAGGAATTC
CCGGAAAGCA TACAAAAACG CGCGGAGAAA AACGCGCTGG AATGGCCTCC GCAAAAACCG
TTGGCCAGAG AACGGATATC CCCTTCCGGC CTGTCGGAGG AAGCCTCCCC GCTCCAGGGA
GAAAAACACG CGGGCGCAGG GAAAGCCGCC GCGCTTGGTT CCGCCGTCCA CGCCGTGTTT
GAACGGATTA CCCGCTGGGA TGATGAAAAC AAACCGGCGT GGGCTCTCCA CCCTGCCACG
GAGGCGGAAC GCATCGTAGC GGAGTGCATG GAAATCCCAT CCATCCGCGA ACTTTTCACG
CCTCCGGAAA CGGCGCGCAT CATGAAGGAA CAGCGTATTG AAGCCATTGA CGGGAATAAC
TGGATTTCCG GCATCATTGA CAGGCTGATT CTTGATGGAG ACGGCGCCCG CATCGTGGAC
TTCAAGACGG ACCATGCCGA TACCCCTGAG CAACTGCGCG AGCACCATGC AAACCAATTG
AACGCTTACG CCCGCATCGT ATCCAGAATC ACCGGAATAC CGCTGGAGCG CATTACGCGC
ACTATCGTCT CCACCTGCTT GAAGGAAACC GTCCCCATTC AATAA
 
Protein sequence
MPPLTNMLIS ASAGTGKTYQ LSLRFLGLLA LNSGNHPERL IAITFTRKAA GEFKDRILTD 
LAAGATDEAG AARLKERLWA VIKGTDGEPG LWPGAPEAWK EENLHRERFL HLLHILVQNL
ARLNLCTIDS LFAQIASAST FELGVSGFSM IDPTAEKLAR REALLSLYRE CSVNKERRKD
FEDAFLSGAD SDAEAADAEK SMMRRLGTYH ELFLDVPDAG MWGNPVTLGF TPEELAPPVS
LEQFDSVLHS LIFQVQQTPA SEGKNGVKNK ELFLRFLNGF SQYARLGRVR FRTEGGSAWA
ITVEEAREKF RDFWTPALEE LIQSWLRMET LHTLRRTRAT HGLMLLFENK YSSLVRNRGR
FLFHDVTRML GGGTMTPELK RDLQYRMYCR YDHWMLDEFQ DTSQPQWHVI KPFLDDLAES
KTGNEGSIFV VGDIKQSVYQ WRGGDPELFR SVSSQLQLEQ RGMSTSYRSV QPVLDLVNDI
CDYARTAPGC EPAALEQWGE YPAHRCAPHL EDRPGTSQIW QAPKAENVSA NDQVCQAAAD
ILERTGALRR GLETAILVST KNQALVIKGW LTDHGIPAEV CDDVPVGVDS PLGKNLLYFF
RWLLMPGDPF VVGLLTHSPL RPLITQGGPE SMGWKEWRLL LERDGYAAVM EELEQRLLRG
GTELTDFHRD RLAVWQNEAE QVDEQGVSLD EWIRRMEDLT RREDPAAGIV RIMTIHKSKG
LGFDIVILPQ IGRDTPFADG RHLTHFIKKN GEGGVEGIVL APSRHVYMNI PQFRELYGEW
RARQQFDGFC KLYVALTRAK RATYVILPYR EDKEETEADS MWKVVRSSIR PLNRGTEDIL
PESGASCLYS RGLDGWYEEF PESIQKRAEK NALEWPPQKP LARERISPSG LSEEASPLQG
EKHAGAGKAA ALGSAVHAVF ERITRWDDEN KPAWALHPAT EAERIVAECM EIPSIRELFT
PPETARIMKE QRIEAIDGNN WISGIIDRLI LDGDGARIVD FKTDHADTPE QLREHHANQL
NAYARIVSRI TGIPLERITR TIVSTCLKET VPIQ