Gene Amuc_0381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0381 
Symbol 
ID6274861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp456376 
End bp457710 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content51% 
IMG OID642612432 
Productprotein of unknown function DUF21 
Protein accessionYP_001877001 
Protein GI187734889 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTTT TTGTCACTTT CGTTATTCTT TTTCTGATTC TGGTCAACGC CTTTTATGTT 
GCAGCAGAAT TCGCCGCAGT GAGCGTACGC CGCAACATGA TCCGGGAAAT GGCGGAGAAC
GGCAGCAGGG TGGCTGTTCA TCTGCTTAAG ATTTTGGAAA ATACAAAAGA GCTGGACCGC
TACATTGCGG CTTGCCAGTT CGGCATTACC ATTTCCAGCC TGGTTCTGGG CGCATACGGA
CAGGTTGAAC TGGCCGCCTA TCTGTTCCCT CTGTTTGAGC GGTTTGGTGG GATGGACTCC
GTAATGGCCA ATTCTGCAGC CGCTCTTGTT GTACTGATAG GCTTGACGGT TTTTCAGGTA
ATTCTTGGAG AATTGATGCC CAAATCCCTG GCTCTTCAGT TTCCCAAGCA GGCGGCCTTG
TATACTTATT ATCCCATGCG ATGGACACTG GCTTTCTTCG CATGGTTCAT TGACTTTCTT
AATGGAAGCG GTTTGTTGCT TCTTAAACTG TTCCGGCTCC CACCTGGCGG CCATCAGCAC
ATCCACTCCC AGCAGGAGAT TAATATGCTG CTGGATGAAA GCCACCAGGG CGGGATGTTG
GAGGAAGATG AGCATGAACG GCTGCACAGC GCTTTGTCCC TGGCGGAACG AACGGTGGAA
CAAATTATGA TTCCCCGTTT TCAGCTTGTT TGCCTGGATG TGGATGCCAC ACAGGAAGAT
ATTTTGAATA TGATCGCAGA TAGGCCTCAT ACCCATATTC CCGTGTATGA AGGGAACCGT
GAATCTGTCA TTGGCATGCT GCATATTAAA GATATGGTAT CCGCTTATGC GGAAAAGGGA
ATTCTGCCTC CTCTCCGTTC CATGCTGAGG CAGGTGCCAT GCGTGATGGA AATGCAGACG
GTAGAGATGC TGATGGCCCG TTTGCGGGAA GACAGGGCCA AGGAAGCCTT TGTTCTGGAT
GAATACGGTA AGTTTGTGGG GCTGGTGACG CTGGAACGTC TGCTGGGAGA AATGGTGGGA
GATATAGATG AGGAATTCAT CCGTTCCGGG GAAAAGGTGG AAACCCTTCC GGATGGTTCC
GTGCGCATCC CCGGCATGAT GCGTGCCCAT AAGGCGGAAT GCCTGGTACC CTTTTTGATG
AATGGCGCCA CTACTGTGGG CGGCTGCGTG ATCAAGCACA TGTCCTGCAT TCCGAAGGAT
GGGGACCGCC TGATTATTGC CGGACGCGTG CTTGTGGTGG AAAAGATGGA CCATAACCGT
GTTTCCTCTA TCCTGCTGCT GCCTCCGGAG AGAAAGGAAA ATGAATTTGC CATGGAAAGC
GGGGTGGATG CATGA
 
Protein sequence
MILFVTFVIL FLILVNAFYV AAEFAAVSVR RNMIREMAEN GSRVAVHLLK ILENTKELDR 
YIAACQFGIT ISSLVLGAYG QVELAAYLFP LFERFGGMDS VMANSAAALV VLIGLTVFQV
ILGELMPKSL ALQFPKQAAL YTYYPMRWTL AFFAWFIDFL NGSGLLLLKL FRLPPGGHQH
IHSQQEINML LDESHQGGML EEDEHERLHS ALSLAERTVE QIMIPRFQLV CLDVDATQED
ILNMIADRPH THIPVYEGNR ESVIGMLHIK DMVSAYAEKG ILPPLRSMLR QVPCVMEMQT
VEMLMARLRE DRAKEAFVLD EYGKFVGLVT LERLLGEMVG DIDEEFIRSG EKVETLPDGS
VRIPGMMRAH KAECLVPFLM NGATTVGGCV IKHMSCIPKD GDRLIIAGRV LVVEKMDHNR
VSSILLLPPE RKENEFAMES GVDA