Gene Amuc_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0018 
Symbol 
ID6275222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp23308 
End bp24450 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content58% 
IMG OID642612058 
Producthypothetical protein 
Protein accessionYP_001876646 
Protein GI187734534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCCT ATTGTACCAA TATTCATCCC GCAGAATCCT GGGCGGAAAC CAGGGAGGCT 
CTTTTCACCT GCGTTCCCCG CATCAGGCAG GAACTCGCGG CGATGGACTC CCCTCTGAAG
GATCTCCCCC TGGGCATCGG CCTGCGCCTG TCCGCCAGGG CGGCAGCGGA GCTTCTGGCA
ACGCCGCACG CCGCGGAAAC CCTGAAATCA TGGCTGGAAG ACCAGGGCGC GCGCGTAGAA
ACCCTTAACG GGTTCCCTTA CGGAAATTTC CACGGGCAGC GCGTGAAAGA ACGCGTTTTC
CAGCCGGACT GGACTACACC GGAACGTTTT GAATACACCT GCAACCTGTT CCGCATTCTG
GCACTCATTG GTGACGAACA GGCTGACAGG CTGACCGTCA GCACGCTCCC TGCCTCGCAC
AGCTGGTTTC ATGCGGATGA AGAACGCATC TTCTCCCGGC TGGACGCCAT GAGCGGATTC
CTGGATGTGC TGGGCAGGCA GACCGGCTGC CTGATGCAGC TGGGGCTGGA ACCGGAACCC
TTCGGCCATT TTCACGATAC GGATGGAGCC ATCCGTTTTT TCAACGGCCT CCGCAACCGT
TCCCGCCGTC CCGAACTTAT CGAACGCCAC CTGGGGCTGA CATACGATAC CTGCCATTTC
GCCATTCTCC GGGAAGAACC GGAATTCACC CTCTCCGCCT GGGAGGAAAA CAACATCGCC
CTCTGCAAAG TGCAATTTTC CAACGCCCTG GAATGCCGCA TATGTGGGGA GGAAGACCTG
GAACGCCTCC GGCAGTTTGA CGATGGCGTT TATTTCCATC AGACCAGCAT CCTCCACCGG
GAAGGCGCCA TGCTTTTCCC GGACCTGCCC AATGCCCTGG CCTATGGGCG GGATTATGCA
GAGGAAATAC GTGATTCCCA ATGGCGCATT CATTACCACA TTCCCCTGTA CGCTTCACCG
GAACCACCCT TGAAAAGCAC GGAAGAATTC ATCCAGAAAA CGCATAATTT CCTCCGGAGC
CGCAAAGGCC CGCAACCGCA TCTGGAGGTG GAAACCTATA CCTGGAGCGT CCTGCCGGAC
CACATGAAGA TCCCCCTGGC AGCCCAGATT GCCCGTGAAC TGCATTATAT TGAAACCCTG
TAA
 
Protein sequence
MLSYCTNIHP AESWAETREA LFTCVPRIRQ ELAAMDSPLK DLPLGIGLRL SARAAAELLA 
TPHAAETLKS WLEDQGARVE TLNGFPYGNF HGQRVKERVF QPDWTTPERF EYTCNLFRIL
ALIGDEQADR LTVSTLPASH SWFHADEERI FSRLDAMSGF LDVLGRQTGC LMQLGLEPEP
FGHFHDTDGA IRFFNGLRNR SRRPELIERH LGLTYDTCHF AILREEPEFT LSAWEENNIA
LCKVQFSNAL ECRICGEEDL ERLRQFDDGV YFHQTSILHR EGAMLFPDLP NALAYGRDYA
EEIRDSQWRI HYHIPLYASP EPPLKSTEEF IQKTHNFLRS RKGPQPHLEV ETYTWSVLPD
HMKIPLAAQI ARELHYIETL