Gene Amuc_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0479 
Symbol 
ID6275416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp567156 
End bp568616 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content48% 
IMG OID642612529 
Producthypothetical protein 
Protein accessionYP_001877098 
Protein GI187734986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.318642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCCG CTATGTTGCC TTGCACTATG TGCAGGGGAC AAAAGAGGGC GGCGGAAGGC 
AATGACGGGG GCATCAAATA TTGGTGGATT TTACCTTTTT TGTCTTTCTT TTTTTCCCTG
AATAACCAAA GTTTTTGGAT TGATGAATGT TGTACTGCAT TGTGCGCCAT GCAGCAGGGA
ATGGAGGGAT GTTGGAAGAA GATTTGTGAA ATTGGCGGTA GCGATGCTCA AATGGCCTTT
TATTACTACC TTTTATTTTT GTGGCATCAT CTGACGGGTG CGGAGTCTGA ATGGATGCTG
AGGTTATTTA ATATATTCTG GGTATTTTTA TCATCCTGGT TTTTCAGAAA GGAACCAAAG
GCGCTGGTGA TTCTTTTGAT TTCACCGTTT TTTGTTTATT ATTCTAACGA GTTGCGGCCC
TATATGTTGC AGATAGCGGC CTCTTGCGCT GTCTCCATGT TGTTTTGGCA GGTTAGCCGG
GGAGAACCGG TAAAATTCCA TGTTTTTTTT GGTTCTCTGT TTTTTTTATG TCTTACAAGC
CTGACAGGGG TTGTATGGGC GCTCGGTTTT GCCGCGGCAT TCATGGTCAT GGCATTCCGG
CAGTTTGGAG GGCGGAGGTT TCGGCGAGCC CTGCTCTGGT GGATTTTTCC ATTTTCCGGG
TTGGGAGCAT ATTATCTTTA CACTCTTTTC TTGGGCGCCC GTGCGGTTTC CATATCTTCC
TCCTGGATAG TCAATGCATG CGCTTCCATG TACGAGTTAT CGGGATTGGC CGGCATGGGG
CCTTCCCGTC TAGAATTGAG GATGTGCATG ACTCCTGATG CTTTATGGAA TATGAACGGA
TTGGGGGCAG GGATGATCTC CGGCGCCATT CTTCTGGCCG GTTCTGCCTG CGGCATCATT
TTGTGGAATA AGCGGGCTGA GAGGCCCTTG GTTCCTGCTT TGCTGGTGTT GATATTGCTT
CCAGGGGCCG TATTCCTTTA TGGAACGGAA ATGATGGATT TCCGTTTTTC CGGACGGCAT
TGCGCGCCGT TATTGCCTGT ATTGTGTCTG GCCTGGTCCC TGGTGGCGTC ATGGGATTGG
TCGTTCCCCA GGATTGCCCA AACTATCCTG TTCCTATTAA TGATGATTGT TTGGGCGGTG
AGCGATATTC GTATTCGTTG CAATGATTTG TATGGCCGTG AAGATTTCCG CTCTGCGGTA
TCTTATTGCA AATTATTGCA GAGCAGTCAT GTTGACATTT TACTTTTATG CAATGGGGCC
GGAAAGGAGT TTTATGGATG GATTCCCGGC CCGTTAAAGG ACAAATGGTC TGATTATAAG
GTGATCGTAG TTTCCCGGCC ATCCGATTAT GCTGCTTTTC TCCAGCCCAT AGAGTCTTCC
GGCAAGTATG AACGTTCCGA ATTATGCCGG GGGTTTGTCA TCTATCGGCG GGGTGGAATC
TCCCAACGAG AGGGGTATTG A
 
Protein sequence
MQSAMLPCTM CRGQKRAAEG NDGGIKYWWI LPFLSFFFSL NNQSFWIDEC CTALCAMQQG 
MEGCWKKICE IGGSDAQMAF YYYLLFLWHH LTGAESEWML RLFNIFWVFL SSWFFRKEPK
ALVILLISPF FVYYSNELRP YMLQIAASCA VSMLFWQVSR GEPVKFHVFF GSLFFLCLTS
LTGVVWALGF AAAFMVMAFR QFGGRRFRRA LLWWIFPFSG LGAYYLYTLF LGARAVSISS
SWIVNACASM YELSGLAGMG PSRLELRMCM TPDALWNMNG LGAGMISGAI LLAGSACGII
LWNKRAERPL VPALLVLILL PGAVFLYGTE MMDFRFSGRH CAPLLPVLCL AWSLVASWDW
SFPRIAQTIL FLLMMIVWAV SDIRIRCNDL YGREDFRSAV SYCKLLQSSH VDILLLCNGA
GKEFYGWIPG PLKDKWSDYK VIVVSRPSDY AAFLQPIESS GKYERSELCR GFVIYRRGGI
SQREGY