Gene Amuc_1593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1593 
Symbol 
ID6274597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1916220 
End bp1917680 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content64% 
IMG OID642613653 
Productprotein of unknown function DUF187 
Protein accessionYP_001878194 
Protein GI187736082 
COG category[S] Function unknown 
COG ID[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAT TTCCCCTGTT CCATGCCTTC TCCTGCTCCC TGCTGGCCCT GGCTTCCCAG 
GCGCTTGGGT GGCAGACCTC CGGAGAATCC GTTCCCGCTG TCCCCCAGGA ATTCCGCGCG
GCATGGATAT CCACCGTTCA TAATATTGAC TGGCCCTCCC GTTCCGGCCT CTCCGGAGCA
GCCCAGCGCG CGGAGCTGCT GAACATCCTG AATACCTGCG CCCAGTTGAA GCTGAACGCC
GTGTTTCTCC AAGTGCGCCC GAACGCCGAC GCCCTGTACC GCTCTTCTCT GGAACCGTGG
AGCCAATGGC TTTCCGGCCC GGGCGTCAAT CCTGGGTATG ATCCTCTGGC TTTCGCCATT
CAGGAGGCTC ACCGCCGCGG CATTGAACTG CACGCCTGGT TCAACCCATT CCGGGCGAAG
GCCAACGTCA AGCATGCCGT AGGCCGCAAC CATATTTCCC TCACGCGCCC TGATTTGATG
AAGCGAAACG GCTCCGTGCT GCTGATAAAC CCCAGCGCCT CCGCCTCCCG CGACCACGCG
CTGAAGGTTA TCATGGATGT CGTGCGCCGC TACGACATTG ACGGGGTGCA CCTGGACGAC
TATTTCTACC CCTATCCCAC TCCCGGCCGC GCCTGGTCTC CCGCCAGCTT TGGAGACGGG
AAATCCCCCT CCCAGCGCCG GGGCTACATT GACGGCTTCG TGCAAGACAT GTACAAATCC
GTCAAATCTT CCAAACCCTG GGTGCGGGTG GGCGTCAGCC CGTTCGGCAT CTGGAGGCCG
GGCGTTCCCG GCGGCATTGA AGCGGGAGTG GACGCCTACG AGCATCTGGC GTGCGACGCC
CGCAAGTGGC TTTCCCGCGG ATGGGTGGAT TACCTGGCCC CCCAGCTTTA CTGGCGCTGC
AGCCCGGCCA AGCAGAGCTT TCCCGCATTG ATGCAATGGT GGGCCGCCCA GAACTCCAGA
CGCCCGGTGT GGCCCGGCAT TGCCACGGCA CGTATCATGA GCAGCGAAGA CCCGGGCCGC
CCCGCCTCTG AAATAGCGGC GCAGGTCAAC TACTCCCGCA GCCTTGCCAG AACCGCCCCC
GGGCAATGCT TCTGGAGCAT CAAATCCATC ATGCGGAATG CCGGAGGCAT CCAGAAATAC
CTGAACAGAT TGTACCCCTC CATGGCCGTT CCGCCCGCCA TGCCCTGGTG CGGAACCGGC
ACACCCGGAC AACCGCAGAA TTTTTATGTG GCGGACAATG GTTCCACCGT AACCCTCTCC
TGGCAGCCGT CCGGCAATCC CTCCCGGAAA TGGGCCGTCC AGGCGCGCTA CGGCAGCCAG
TGGGCTACGC GCATCCTTCT GCCGGGCAGC CAGACCCGCG TAACCCTGCC CAAATCCTTC
CTGGGGGATG CGGGGTCTGT CGCCGTCAGG GGCGTCAGCG CCTATGGAGC CCAGGGCCCG
GCAGCAGCCG CCAGAAGGTA G
 
Protein sequence
MTRFPLFHAF SCSLLALASQ ALGWQTSGES VPAVPQEFRA AWISTVHNID WPSRSGLSGA 
AQRAELLNIL NTCAQLKLNA VFLQVRPNAD ALYRSSLEPW SQWLSGPGVN PGYDPLAFAI
QEAHRRGIEL HAWFNPFRAK ANVKHAVGRN HISLTRPDLM KRNGSVLLIN PSASASRDHA
LKVIMDVVRR YDIDGVHLDD YFYPYPTPGR AWSPASFGDG KSPSQRRGYI DGFVQDMYKS
VKSSKPWVRV GVSPFGIWRP GVPGGIEAGV DAYEHLACDA RKWLSRGWVD YLAPQLYWRC
SPAKQSFPAL MQWWAAQNSR RPVWPGIATA RIMSSEDPGR PASEIAAQVN YSRSLARTAP
GQCFWSIKSI MRNAGGIQKY LNRLYPSMAV PPAMPWCGTG TPGQPQNFYV ADNGSTVTLS
WQPSGNPSRK WAVQARYGSQ WATRILLPGS QTRVTLPKSF LGDAGSVAVR GVSAYGAQGP
AAAARR