Gene Amuc_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1659 
Symbol 
ID6274570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2006208 
End bp2007308 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID642613718 
Productintegrase family protein 
Protein accessionYP_001878259 
Protein GI187736147 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0421076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGA AAAAAGCACC GAATACAAAC AAATTGAAAT ATGTCGGAGC CGCACTTCTG 
GAGGGGGAAT CAGTCACCCT GATCCAGGCT GCAAGGCTGG TATTAGAAAT CAAGGAAGCC
CTTGGAGATG AAATCTGTAC CATTACCCGG TGCCGGGAAG TCGTCTCCCT GGGATTGAAC
GCCATTAAAA ACAAACATCA AACCGTCAGT TTCGGCACGG CGGCCGTGGA ATGCCTGCGT
TCCAAAAGCC ACCGCCGTCC CCGGACGCTG ACGGACATCA GGAGCATCAT CCACAAGCTT
AAAAAGAGCA ATCCGGAACT GGAACACACC TCCCTGCGCA ACCTGAGCGT GGAGGAATGC
CAGAACATCC TGATGAATAC CTTTACCACA TCCCGGCAGA GGCACAAAGC GAGGCTCATC
TTGAGCGGAA TTTTCTCGTT CTCCGTCAAA CGCGGATGGT GTGACGAGAA TCCCATCCTC
CGTGTAGACA CGCCTTTTCT GCAGGAGCAG GAAATCCCCG CTCTGACGCT GAAGGAAATA
ACTCAGCTTC TCAAGGCGGC CATGGAAGAA TTTGACGGAA GCTGCGCAGC TGGAGCGGCG
CTGATGATCT TTGCAGGAAT CCGCCCGCAG GAAGTGGAAC GCCTGCTCTG GGAAAACATC
GCTCTCCGCG ACGGCTGTAT CATTCTGAAC TCCAAGCATA CCAAAACCGG AGGCGCCAGA
CACGTCACCA TCCTGCCCGT GCTCGCCAAA TGGCTCAAAT TCTGCCGTGA CAGGACCAAA
CCCGGCCCCG GAACTCCCAT CTGCCCGAAA GGGTGGACAA TCAAGTGGCG CAAAATCCGG
AAAAAAGCCG GCTGGGGAGG AAGAAAAAAA TCATGGGTGC CGGACTGCCT GCGGCACACC
TACGCCAGCT ACCACGCCAA GCACTTCAAG GACTACAACC TGTTGCAAAT GGAAATGGGG
CACCGCTCCT CCTCCCTGCT CCGCACACGG TACTTGAACA TGAAAGGCAT CTCTCCGCAA
ACGGCCACGC GCTTCTGGGC CCTGACGCCA GCCAAGGTCA TTGAAGAAAC GAAACCGCCG
GAAGAACCGC CGGTCTCCTG A
 
Protein sequence
MNMKKAPNTN KLKYVGAALL EGESVTLIQA ARLVLEIKEA LGDEICTITR CREVVSLGLN 
AIKNKHQTVS FGTAAVECLR SKSHRRPRTL TDIRSIIHKL KKSNPELEHT SLRNLSVEEC
QNILMNTFTT SRQRHKARLI LSGIFSFSVK RGWCDENPIL RVDTPFLQEQ EIPALTLKEI
TQLLKAAMEE FDGSCAAGAA LMIFAGIRPQ EVERLLWENI ALRDGCIILN SKHTKTGGAR
HVTILPVLAK WLKFCRDRTK PGPGTPICPK GWTIKWRKIR KKAGWGGRKK SWVPDCLRHT
YASYHAKHFK DYNLLQMEMG HRSSSLLRTR YLNMKGISPQ TATRFWALTP AKVIEETKPP
EEPPVS