Gene Amuc_0600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0600 
Symbol 
ID6274705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp709686 
End bp710834 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content56% 
IMG OID642612651 
Productintegrase family protein 
Protein accessionYP_001877218 
Protein GI187735106 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGC CGAACATTTT GATTAATAAA CGACTATTTT TGCTTGACTA CGGATTCCGA 
AACCGTTGCA AAATAGATGA AATGAAAGAG GAAATTTTAG CAACGGATCA TGCGGCGCTG
GCCATTTTGA AAAAGACAGG GCTGGATGTG GTGGAAGCCG CCCAGCTGGC GCATGAATTG
CTGCAGGCCA GCAAGGGGCG CGGCAACCGA TGGAAGAGGG CAAGGGAATG CATCCGGCTG
GGAGAGGAGG CCCTGGTTGC CAGAGGAAAA ACCGTGGCGT TCAGGAAAGC CGTGGATGAG
GCCATGGAGG CCCGGCAAGA TCGCAGGAAA AGAACCAGGG ATGATTTCCG TTATATCAGC
CGGCGGTTGT TGAGGTTTTG TCCGGAAATG GCCCGGCGGC CCGTGCGCTT TATCACGCCT
CGGGAATGCC GCGCCTGCCT GGAAAAATCA TTTGATACCC TCAGGCAGAG GCATAAGGCC
CGGTTGGTGC TCAGCGGTAT TTTCGGAACG GCTGTGAAAC GCGGCTGGTG CACGGAAAAC
CCGGTGGCGT ATGTGGATCT GCCGAGGCTG AAGGAGCATT CCATTCCCGT GCTGTCCCTG
GAAGAAATGC GCCGGTTGCT GGCTGCGGCG GAAGAGTATG ACGGAGGGGC GTGCCTGGCT
GCGGTTGGGC TGATGCTGTA TGCCGGCATC CGCCCGGAAG AAGTGAGAAG GCTGGATTGG
GCACAGATTA ATTTAAGGGA GGGAATGGTT TCCCTGAGGG CGCGCCACAG CAAGACGGGA
GGAGCCCGCA TCGTGACGAT CCGGCCGGTT TTGGGAAATC TGCTGGAAAG AGCGGTTGCT
ATGGGATTTG TGTCGGGAAG CGTATGCCCG CGCAACTGGC CGGCGAAGTG GAGGGAGCTG
AGACGGTTGG CCGGGTGGGA CGGGAAGGAA AAGAAGTGGC CTGCGGATAT GTTAAGGCAT
ACGTTTGCCA GTTATTTTGC CAGGCATTTC AAGAATTTGC ATGTATTGCA GATGGAAATG
GGGCATTCTT CTTCCGATTT GCTGAGGACC AGGTATTTGA ATATGGAAGG CATTACGGAA
ATGACGGCCG CCGTTTTTTG GGGGAACTGC CATGCGGGGC GTCACACCAT TAACAATGGC
CGCCCCTGA
 
Protein sequence
MELPNILINK RLFLLDYGFR NRCKIDEMKE EILATDHAAL AILKKTGLDV VEAAQLAHEL 
LQASKGRGNR WKRARECIRL GEEALVARGK TVAFRKAVDE AMEARQDRRK RTRDDFRYIS
RRLLRFCPEM ARRPVRFITP RECRACLEKS FDTLRQRHKA RLVLSGIFGT AVKRGWCTEN
PVAYVDLPRL KEHSIPVLSL EEMRRLLAAA EEYDGGACLA AVGLMLYAGI RPEEVRRLDW
AQINLREGMV SLRARHSKTG GARIVTIRPV LGNLLERAVA MGFVSGSVCP RNWPAKWREL
RRLAGWDGKE KKWPADMLRH TFASYFARHF KNLHVLQMEM GHSSSDLLRT RYLNMEGITE
MTAAVFWGNC HAGRHTINNG RP