Gene Amuc_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1390 
Symbol 
ID6274602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1660025 
End bp1661212 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content48% 
IMG OID642613447 
Producthypothetical protein 
Protein accessionYP_001877995 
Protein GI187735883 
COG category[S] Function unknown 
COG ID[COG3274] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.67535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.125851 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGC CGCTGGCTGA TGATGAAGAG AAAGGAAAGC GTTTTTCCTG TGGTTGTCCG 
GAATATATTT CCCCGTCCCG GAATTATGGA ATAGATGCCT TGCGCATTGT CGCCATGATG
ATGGTTTTAA TTCTTCATCT TTTGGCTTCC ATTGATGTTT TGTCTTTGGA GAACCATGGG
TCCGCTTCCT ATAATGTCGG GTGGCTGCTG GAGATTGCCG CTTATTGCGG TGTGAATTGT
TATGCGTTAA TTACAGGATA TGTGTGCTGC GACGGAACGT TCAGGTATGA ACGTGTGGTT
TCCTTATGGT TTCAGGTTGT TTTTTATACA TGGGGCAGTT TGCTGCTGGC TCTGTTGTTT
TTCCCCCAGG AGGTTCAGTT GAGTAATATT CTGCATTCCC TTTTTCCGGT TTTATCCGGC
CAATATTGGT ATGTGACGGC GTATGTGGGG CTGTTTTTCT TCATTCCGTT TCTTAACGCC
CTGGGGAACA GGCTGACCAA ATTACAGTTC CAGTATCTTC TGGTTACCGT TTTCATGCTG
TTTTCCATTA TTCCCACCCT GCTTCACACG GATGTGTTCC CGGTGGAAGA AGGGTATAGC
ATCTGGTGGC TGGGCATCCT TTACATGCTG GGGATGTATG TTAAAAAGCA TGGCTTGCTG
ACGGGAATGA AAACGCGTCC GTTATGGATG TTTTATGCCG GATGCGTATG TTTTGCCTGG
GTTTTCAAGA TGGTTCTGAA TGTGGTGTCT CCATACCTGA TTGGGCAGAT CAAGGGCGGT
GGAATGTTTA TTCGCTATAA TTCTCCTTTT ATTGTAGGGC CGGCGGTTGC CCTGTTGCTG
ATTTTTTCCC GGATGCATTT TTCATCCCGA AGGGCTGTCT CCTGTATTTC ATGGCTGGCG
GCAGCGTCGT TCAGCGTTTA TGTGCTGCAT TGTAATGCTT TGATAGGAAA ATGGTTTTTG
TGGGATGTCT TTGAGTGGAC AGCGTCTTCT TCCTCAGCCC TGATGGTTGT GAACGTGTTG
GCAATAGCCG CCGTGGTTTA TGCCGGATGC GCCTTGGTGG ACTCCGTGCG GCGTTATTTA
TTTAAGCTCA TGAACGTGGA AAGGGGCGCC CGGGCGGTGA CGGGCTTTTG CGGAAAGCTG
GGGCATGCGT TCCGGAAGAT GTGCCGCCGG ATAGATTCGC ATCCCTAA
 
Protein sequence
MSLPLADDEE KGKRFSCGCP EYISPSRNYG IDALRIVAMM MVLILHLLAS IDVLSLENHG 
SASYNVGWLL EIAAYCGVNC YALITGYVCC DGTFRYERVV SLWFQVVFYT WGSLLLALLF
FPQEVQLSNI LHSLFPVLSG QYWYVTAYVG LFFFIPFLNA LGNRLTKLQF QYLLVTVFML
FSIIPTLLHT DVFPVEEGYS IWWLGILYML GMYVKKHGLL TGMKTRPLWM FYAGCVCFAW
VFKMVLNVVS PYLIGQIKGG GMFIRYNSPF IVGPAVALLL IFSRMHFSSR RAVSCISWLA
AASFSVYVLH CNALIGKWFL WDVFEWTASS SSALMVVNVL AIAAVVYAGC ALVDSVRRYL
FKLMNVERGA RAVTGFCGKL GHAFRKMCRR IDSHP