Gene Amuc_1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1957 
Symbol 
ID6275053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2372975 
End bp2374396 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content58% 
IMG OID642614017 
Productmembrane protein-like protein 
Protein accessionYP_001878551 
Protein GI187736439 
COG category[S] Function unknown 
COG ID[COG2364] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.217761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.00755155 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAGATT ATCGTGTCCG GAGAAGCCTT GGAGAACATG TCCTGCGCTG CTCCGTCCTG 
ATTGCTGCGC TCTTCATCAT GTCTCTGGGC ATCGCTCTGT CCACCAAGGC AGACCTGGGA
GTCTCCCCCA TCTCCTGCAC GCCCTATGTG CTCAGCCTGG CATTCCCTCT AACCATGGGG
ACCGTCACCA TCCTCATGCA CCTGAGCTTT GTGGCCGTGC AGGCGGCCCT GCTGAAAAGG
CAATTCCGTC CGGCGCATCT GCTCCAGATT CCCATAGCTT TTATCTTCGG CTTTCTGACG
GATTTTTCCA TGTGGATCAT AGCGCCTCTG GAACCGGACG GATACCTGTG GTCCGTTATT
CTCTGCCTGT TCAGCTGCGT GGTAATCGGA TTCGGCGTCT TTCTTCAGGT TAAGGCGGAT
TCCGTTCTTC TGGCAGGGGA AGGCATGAGC CTGGCCTTCG TCAAACTCTT CAAATGGGAA
TTCGGAGCCG TAAAAACCGG GATGGACTGC ACGCTCGTCT GCATCGGCCT GGCCTGTTCC
CTCATCTTCC TGCCCGGACT GACAGGCATA CGGGAAGGAA CCGTGGTGGC CGCCGTCCTG
GTGGGAATGA TCGTCCGTTT TTTCAACAGG CACGTCTTCT GGCCGGACAG GCTCCTGGAA
CGGCTGGCGC GCCCCGGAGC AGCAAGCGAG CTTCCTCCGC TGGCACAAAC GGCCGCTTAT
GCTCCGGACG CCCCTCTGGT CATTTCCATT GACCGGGAAT ACGGTTCCGG CGGCCATGCC
ATCGGGAAAA TGCTGGCGGA AAAGCTGGGC ATCCGATTCT ACGACTCGGA ACTGGTGTAC
CTCACAGCCT CCCGGAGCGG CCTCACTCCG GACTACATCC GCAAGCACGA ACAACAGCTT
TCCAGCCGCT TCCTGCACGA ACTTTACGCC CAGAACTATG CCTACACGGC GGAGGAAATG
CCCCCTGAAG ACGCCACCTT CCTGGCACAG AGCAAAGTCA TCCGGGACAT TACGGCCAGT
CAGGCATGCG TCATCGTGGG CCGCTGCGCC AACTTCATCC TGAAAGGGAG ACCCAACCTG
TTCAGTGTTT TCCTTCATGC GGACCGGGCC ACGCGCATGC AGCGCGTTAT TGAAAACTAT
GGGGTAGAAC CTGGCGGGGC AGCCCGGGCC ATGGACATCA TGGACTCCCG CCGCCGCACC
CACTGCCTGC ACTACACCGG GCAGGAACTG GGCAATGCAC GCCTCTACGA CTTGTGCGTC
AACACGTCCG ATTACGGACT GAAACGCACG GTGGAACTGA TTCTGGAAGC CATCAATACC
AGAACCGAAC AATCTTCCGC TGTAGAAACG GTTCCCGTCC GCTCCGCATC TTTTCCCGAA
CCGGAAGAGG ACAGCATTCC CGGAGAAATA TCCCTCGCCT GA
 
Protein sequence
MEDYRVRRSL GEHVLRCSVL IAALFIMSLG IALSTKADLG VSPISCTPYV LSLAFPLTMG 
TVTILMHLSF VAVQAALLKR QFRPAHLLQI PIAFIFGFLT DFSMWIIAPL EPDGYLWSVI
LCLFSCVVIG FGVFLQVKAD SVLLAGEGMS LAFVKLFKWE FGAVKTGMDC TLVCIGLACS
LIFLPGLTGI REGTVVAAVL VGMIVRFFNR HVFWPDRLLE RLARPGAASE LPPLAQTAAY
APDAPLVISI DREYGSGGHA IGKMLAEKLG IRFYDSELVY LTASRSGLTP DYIRKHEQQL
SSRFLHELYA QNYAYTAEEM PPEDATFLAQ SKVIRDITAS QACVIVGRCA NFILKGRPNL
FSVFLHADRA TRMQRVIENY GVEPGGAARA MDIMDSRRRT HCLHYTGQEL GNARLYDLCV
NTSDYGLKRT VELILEAINT RTEQSSAVET VPVRSASFPE PEEDSIPGEI SLA