Gene Amuc_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1237 
Symbol 
ID6275845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1484750 
End bp1486246 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content54% 
IMG OID642613294 
Productexcinuclease ABC C subunit domain protein 
Protein accessionYP_001877843 
Protein GI187735731 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.173931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.435946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACG CCCGGGGAAA TACCATCTAC GTGGGCAAAG CCAAGGACCT TCACCGCCGC 
CTGGGCAACT ATTTTTCTCC CACGGGAGCC ACACTCTCCA ATCATAAGAC AAGAGCGCTC
ATTAACGCTA TTGCGTCATT TGATTATTTT GAAACCAGAA ATGACCAGGA GGCTTTTCTG
CTGGAAAGCA AACTGATTAA ACAATATCGT CCGCATTACA ATATCCAGAT GAAGGACGAC
AAGCGCTACC CCCTGCTGAA AATTCCGAAG GGGGAAAAAC TGCCTCGTTT CCAGCTGGCG
CGAGTGCGCA AAGATGACGG AGCGCGCTAC TTCGGCCCCT TTGTCCACTC CCAGGCTCTT
TACGCCACGC AGGAATGGCT TAACCGGCAT TTCCGGCTGC GAACCTGCAA GACAAAAAAT
CCGGGAATTC ATGACTTCAG GCACTGCCAT GCGGATGTGA TTCGCAACTG CTCCGCTCCG
TGCATTGGCC GCATTTCCAT CAATGATTAC AACCGGAACT TTGACCAGGC GGTGCGTCTG
CTGGAAGGAA CGGGAAGAAA AAGCACTCTG GACGAACTTA CCGGGGAAAT GATGGAAGCC
GCCGATAAAC TGGACTTTGA ACGTGCGGCA TACCTGCGGG ACATCCGGGA CAATCTGGTC
AAGGTACTGG AACCCGCCCG GCGGTTCCGA AAAAGAACTC CCGATCTTCC CGGCACCGTT
CATCCGGAAG AAGACATGAA GGAACTGGGG GTGGCCCTGG GATTAGAATC TCCCCCAACC
ATCATGGAAT GCTTTGACAT TTCCAATGTC TCTTCCAACC ATATCGTAGC CTCCATGGTG
CGCTTTACCA ACGGCAGGCC GGATAACAAA GCCTACCGTC GGTACCGTAT ACGTACTGTG
GACGGACAAA ATGACTTCGC CTCCATGTCG GAAGTCATCC GGAGGCGTTA CTCCCGCATT
CTGGCGGAGA GCGACGCCGT AGCCTCACGG CAGGCGGACA TGACCCTGTA CCAGTGGCTT
AAGAAACTCA GCGCGGAAGG AAAAGCCCCC ATCAAAGTTC CAGATCTGGT AGTCGTGGAC
GGAGGTAAAG GACAGCTTTC CTCCGCTCTG GCCGATTTGG AAGCCATCGG CCTGGGAGAC
ATGCCCATCG TAGGCCTGGC CAAGCAGAGG GAAGAAATAT TTTTTCCCCA CCAGTCCCAG
CCCCTTTGCC TGCCTCACAG CACGGGAGCC CTCAAACTCA TGCAGCGCAT CCGCGACGAA
GCCCACCGTT TCGCCAACGG CTATAACGAA CTGCTCTACC GCAAACGCAT GCGGGAAAGC
GCCTTGGACG ACGCCCCGGG CATGAGCGCC TCAAAAAAAA GACTGTTGCT GGAAAAATTC
AAATCCGTAA CTGCCATTAA AAAGGCGGAC CCGGCTTCTA TCGCCGCCAT CCGCGGCATT
TCGGAAACTT GGGCCCGCAC CCTGCTGAAC TACCTTAACT CATCTTCCAA CTCCTGA
 
Protein sequence
MKDARGNTIY VGKAKDLHRR LGNYFSPTGA TLSNHKTRAL INAIASFDYF ETRNDQEAFL 
LESKLIKQYR PHYNIQMKDD KRYPLLKIPK GEKLPRFQLA RVRKDDGARY FGPFVHSQAL
YATQEWLNRH FRLRTCKTKN PGIHDFRHCH ADVIRNCSAP CIGRISINDY NRNFDQAVRL
LEGTGRKSTL DELTGEMMEA ADKLDFERAA YLRDIRDNLV KVLEPARRFR KRTPDLPGTV
HPEEDMKELG VALGLESPPT IMECFDISNV SSNHIVASMV RFTNGRPDNK AYRRYRIRTV
DGQNDFASMS EVIRRRYSRI LAESDAVASR QADMTLYQWL KKLSAEGKAP IKVPDLVVVD
GGKGQLSSAL ADLEAIGLGD MPIVGLAKQR EEIFFPHQSQ PLCLPHSTGA LKLMQRIRDE
AHRFANGYNE LLYRKRMRES ALDDAPGMSA SKKRLLLEKF KSVTAIKKAD PASIAAIRGI
SETWARTLLN YLNSSSNS