Gene Amuc_1719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1719 
Symbol 
ID6274078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2089829 
End bp2091193 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID642613782 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_001878318 
Protein GI187736206 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.623668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0000296907 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAATTCC CGGAGGAAAC GCCCGCCGCG CCCAAACCCA TCACAGTCAA ACAGCTCGTT 
TACCGCCTGA GGGACACGGT CAGCATCGCT ATGGGCACCC AGTGGGTGGT GGGAGAACTG
AGTAACGTCA AGCATCACAC CAGCGGCCAC GTTTATTTCA CCCTGAAGGA ACAGGGAGCG
GAAATTTTCT GCGCCTTTTT CAAGGCAGCG GCTTCCAAAT GCCCCATACG GCTCCAGGAA
GGGATGAAAG TCCATGTCCT GGGCAGCGCC ACCGTATACC CGGACCGGGG GCAACTCCAG
CTGGTCATCA GGCAGGTAAA AGCTGCAGGA CAGGGAGATT TGCAGACTCG CTTTTTGGAG
TTGAAAGCCA AACTCCAGCG CGAGGGGTTG TTTGATGCGG AACACAAAAA GAAAATTCCC
ACGTTCCCCC GCGCCATCGG CATCGTCACC TCCCCCACCG GCGCCGTCAT CCAGGACATG
CGGCACGTAT TGGAACGCCG CGCCCCGTGG GTGAAGGCTT ATCTGCTGCC CATACGCGTG
CAGGGGGCAG GAGCGGAGCA CGAAATAGCC GCAGCCGTGC GCGCTTGGTC CGGAGCCCCC
TTTAACGGTC TCCCTCCCGT GGACGTGCTC ATTGTAGGCC GCGGCGGCGG CTCCATTGAG
GATTTGTGGA ATTTCAATGA GGAAACGGTG GCGCGCGCCA TTTACGAATG TACCGTTCCC
GTGATTTCCG CAGTGGGTCA TGATACGGAC TTCACCATTG CCGATTTTGT AGCGGATCTG
CGCGCTCCCA CTCCCACTGC AGCCGCGGAA CTGGCCACAC CGGACGGCCC CGAATGGCTC
CGCAAGCTGT CCAGAATGGA ACAGGCTCTT CATGCATCCG CTCGGCATTC CCTCTTGCGT
TCCAAGCTGA AACTGGATGT TTACCTGCGA GGGAAACTGT TGGATGCGTA TTCCCTGCTC
TCTCCTTATT CTCAGCGCCT GGACGATATG GAAGAAACTC TGCAGAATGC CGCAACCACG
AGAATTTTTC ATAATGCGCT GCATATTAAC AAATTGGAAC ATCAACTGCA AATGCGCCAT
CCCGCCCACA GGAACCGGGA GCGTACACAG CTTCTGGCTT CCCTTCAGGC CGGTTTATCC
CATGCTGCCG CCGCCCGCAT GACGGATTTA TCTTCACAAC TCACTCTTAT TCAAGCACGT
CTTGAAGCAC ACAGCCCGGA ACAGACCCTG CGCCGCGGAT ATGCCCTGGT GGAAAACCAG
GATAAGCAGC TTATCCGGCA GACAAACCAG GTGCATGCAG GGGAAAAATT GAAAATCCGT
GTTTCCGACG GTTGTTTTTA TGTCAGGGAT GACACGCCAC AATAA
 
Protein sequence
MEFPEETPAA PKPITVKQLV YRLRDTVSIA MGTQWVVGEL SNVKHHTSGH VYFTLKEQGA 
EIFCAFFKAA ASKCPIRLQE GMKVHVLGSA TVYPDRGQLQ LVIRQVKAAG QGDLQTRFLE
LKAKLQREGL FDAEHKKKIP TFPRAIGIVT SPTGAVIQDM RHVLERRAPW VKAYLLPIRV
QGAGAEHEIA AAVRAWSGAP FNGLPPVDVL IVGRGGGSIE DLWNFNEETV ARAIYECTVP
VISAVGHDTD FTIADFVADL RAPTPTAAAE LATPDGPEWL RKLSRMEQAL HASARHSLLR
SKLKLDVYLR GKLLDAYSLL SPYSQRLDDM EETLQNAATT RIFHNALHIN KLEHQLQMRH
PAHRNRERTQ LLASLQAGLS HAAAARMTDL SSQLTLIQAR LEAHSPEQTL RRGYALVENQ
DKQLIRQTNQ VHAGEKLKIR VSDGCFYVRD DTPQ