Gene Amuc_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1115 
Symbol 
ID6273958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1333195 
End bp1334292 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content57% 
IMG OID642613166 
Productintegrase family protein 
Protein accessionYP_001877722 
Protein GI187735610 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTCG GAATGGCGAG ACATGGAAAA AACCAGAAGG AATATCTGGC TGCGGAAATG 
TTATTGAAAG GTAGCGGATT GAATATGCTG GATGCTGCCA GCCTCGCAGT AGAGCTGCTG
AGCCTGTGCG GAGGCAGCCG CAGCATGCGC CGGGCCAGAA AGGCAATTTT TCTGGGTGCT
GAGGAATTGA GGAAAGGGGA AAGGACGGTT TCCTTTTCCG CCGCCGTGGA AGAAATGCTG
AAGGCGAAGC GTAATCTTCG CCCTACGACG TTGCGCGATA TCAGGTATTT CACCGGAGCT
TTGATGCGGA GGTGTCCGGA ATTGAAAGGT TTTCCGGTCC GGAAGCTGAC GCCGGAGCAT
TGCGCCCATT TTCTGGAGGC TGCTTTTACA TCCCACAGGC AGCGTTACAA GGGCCGGGCG
GTGATGAGCG GCGTTTTGTC CTTTTCCCTG CGGCGCGGAT GGTGCGGGGA AAATCCTGTC
GCCCGTGTGG ACTCTCCTTC GTTTCGTGAA CGGACGATTG CCATTTTGGC TCCGGAAGAG
ATAGGAAGCC TTCGGACGGT GGTGGAAGCT CCGGAATTTC GTGACTGCGC TCCGGCTGTT
TGGGTAATGT TGTACGCCGG AATCAGGCCG GGGGAGGTGG TGCGCCTCCA TTGGCGTGAT
GTGGATTTGG AGGAGAGGGT CATTTCCGTA AGGTCCCGAA CCAGTAAAAC GGGGGGAATC
AGGCATGTCA CCATTCATGC TGTGCTCTGG CGGCTTCTGG CGGGATACGG AGCAGGGGGG
CCGGCCGGAC TGTTATGTCC TCCCAATTGG CCGGTGCGCT GGAGGCTGCT CCGGAAAAAA
GCCGGATGGG GGCTGCACAA CCGCTTTGGA GAATGGAGTG CGGACGCGCT GAGGCATACT
TATGCTTCCT ATCACGCCAA ATGGTTCAGG GATTTTTCCC TGCTTCAGCT GGAGATGGGG
CATCGTTCCT CCTCCCTGTT GCGGGAGCGG TACCTGAACA TGGAAGGGGT GAGCCGCGAC
AGGGCACGCC TTTTCTGGGA AGCGCCGGAA CATGGCTGGA ACAACAAAAC CGGCATTGCA
GGAACGGACT TTCTATGA
 
Protein sequence
MIFGMARHGK NQKEYLAAEM LLKGSGLNML DAASLAVELL SLCGGSRSMR RARKAIFLGA 
EELRKGERTV SFSAAVEEML KAKRNLRPTT LRDIRYFTGA LMRRCPELKG FPVRKLTPEH
CAHFLEAAFT SHRQRYKGRA VMSGVLSFSL RRGWCGENPV ARVDSPSFRE RTIAILAPEE
IGSLRTVVEA PEFRDCAPAV WVMLYAGIRP GEVVRLHWRD VDLEERVISV RSRTSKTGGI
RHVTIHAVLW RLLAGYGAGG PAGLLCPPNW PVRWRLLRKK AGWGLHNRFG EWSADALRHT
YASYHAKWFR DFSLLQLEMG HRSSSLLRER YLNMEGVSRD RARLFWEAPE HGWNNKTGIA
GTDFL