Gene Amuc_1405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1405 
Symbol 
ID6275792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1678686 
End bp1680653 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content57% 
IMG OID642613461 
Productglycosyl transferase group 1 
Protein accessionYP_001878009 
Protein GI187735897 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0438] Glycosyltransferase
[COG2908] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.873771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTG ATGTGGTAAC GGATACGTAT GAACCTGATG TAAATGGAGT GGCCCTGACG 
CTGGGGCGTC TGGTCGGCGG CCTCCGGATG CGCGGGCATC TGGTGCATGT GATGAGGGCT
TCTGAACGGA ACGACTCTTT CGACCCTGGG GAGACGGCTA TGCCGTCCCT TCCCCTCCCC
ATGTACCATG AGGTGAAGAT AGGGCTGCCT TCTGCAGACA GGTTCCGCGC CAGATGGATG
AAAAAAAGGC CGGATGTGGT ATACGTGGCT ACGGAAAGCC CCATGGGAGC GTCTGCCGTG
AAGGCCGCCC GTACGCTGGA GGTTCCCGTG GTGATGGGAT TCCATACCAA TTTCCACCAG
TATATGAAGG ATTACCATTT CTCCAGACTG GAAACGGCCG CCGTGAATTA CCTGCGCAAA
CTCCACAACA GGGCAGGGAT GACAGTGGTT CCCACGGAGG AAATGCGCCG TACGCTGGAG
GGGATGGGGT TTGAACGCCT TTCCGTCATG GGGCGCGGAG TGGATGCCGT CCTGTTTGAT
CCCGCCAGGA GGGATGCTTC CCTGCGCCAG TCATGGGGCG TCTGGCGGGA TGAAGTGGTA
TTTGGCGTCG TGGGGCGTCT GGCGCGGGAA AAGAACCTGG TGACGGCCCT GGGGCTGTAC
ACGCGCTTGC AGAGGGAGTT TCCGGATTGC GGCATGAAAA TGGTGGTGGT AGGGGACGGC
CCCATGATGA ACAGCCTGCG CAGTGAGTTT CCTGATGCCG TTTTTTGCGG AATGCGCCGT
GGGGAAGACC TGGCGCGCCA TTATGCCGCC ATGGACGTTT TGCTGTTCGC CAGTGAGACG
GAAACTTTCG GGAACGTTTT GCTGGAAGGC ATGGCCAGCG GTTTGGCTAC GGTCAGCTAC
CGGTATGCCG CTTCCGCGGA TGTGGTGCTG GACGGCATCA ACGGCCTTCA GGCGGAGAAA
GGGGATGAAG AGGGGTTTTA TTCCGCCATG CGCCGTCTGC TGGAAGACGG GGAGATGATT
CGGAGGCTTG GGAAACAGGC CCGTAGGACG GTGAATTCCA GAACTTGGGA TTCCATCCAT
GACAGGTTTG AAGAACTGCT GGCTTCCGTG ACTCGGGAGG AAAACGGGAC CGGCTGCGCC
CGTCCCCCGC GGGATGCCGT GCTGGAATGC CGCACGGTTT TTTTGTCGGA CCTTCATCTG
GGAACGAAAG ACTGCAAGGC GGACGAATGC CGGAAGTTTC TGAAACATGT CCGGGCTGGA
AAAATTGTGC TGGTGGGGGA TATTGTGGAT GCTTGGGCTT TGTCCCGCGG CAGCCGCTGG
CGGCGGAGGC ATACGCGCTT TGTCCGCACC TTGTTGAAAA AGATGGAACA GGAGGATGTG
GAAGTCCTGT ACCTGCGCGG AAATCACGAT GATATTCTGG AAAAATTCCT GCCTTTTCAT
CTGGGCGGTT TGAAAATTGC CAGGGAGTAC GTGCACCAGG CTGCGGATGG AAGACGCTAC
CTGTGCGTGC ATGGGGACGG TTTTGACGCC ATTTCCACCA ATCACCGGTG GCTGGCGATG
CTCGGCTCTC TGGGTTATGA CGTTTTGCTG ATGGTCAACC GGTTTTACAA CAAATACCGC
GCCTGGCGGG GCAGGGAATA TCATTCCGTT TCCCGCGCCA TCAAGGGCCG CGTGAAATCC
GCCGTCAATT TTATTGGAAA ATATGAGGAA CAGCTTCAAC GCCTGGCGGT AAAGAGGCAG
TGTGACGGCA TCATCGCCGG GCATGTGCAT CATCCGGCGG ATACCATGGT GGGAAAGGTG
AGGTATCTGA ATTGCGGGGA CTGGGTGGAA ACCATGAGCG CAGTGCTGGA ATACGGCGAC
GGCCGTATGG AAACGGTGCT TTACAAGGAT TTTATGAAGC GCCTGGCATA TGGCAAATGC
GGAACCATGG AGGACGCGGA AACTTCTCCC GGCACGGGGG ATTCATGA
 
Protein sequence
MRIDVVTDTY EPDVNGVALT LGRLVGGLRM RGHLVHVMRA SERNDSFDPG ETAMPSLPLP 
MYHEVKIGLP SADRFRARWM KKRPDVVYVA TESPMGASAV KAARTLEVPV VMGFHTNFHQ
YMKDYHFSRL ETAAVNYLRK LHNRAGMTVV PTEEMRRTLE GMGFERLSVM GRGVDAVLFD
PARRDASLRQ SWGVWRDEVV FGVVGRLARE KNLVTALGLY TRLQREFPDC GMKMVVVGDG
PMMNSLRSEF PDAVFCGMRR GEDLARHYAA MDVLLFASET ETFGNVLLEG MASGLATVSY
RYAASADVVL DGINGLQAEK GDEEGFYSAM RRLLEDGEMI RRLGKQARRT VNSRTWDSIH
DRFEELLASV TREENGTGCA RPPRDAVLEC RTVFLSDLHL GTKDCKADEC RKFLKHVRAG
KIVLVGDIVD AWALSRGSRW RRRHTRFVRT LLKKMEQEDV EVLYLRGNHD DILEKFLPFH
LGGLKIAREY VHQAADGRRY LCVHGDGFDA ISTNHRWLAM LGSLGYDVLL MVNRFYNKYR
AWRGREYHSV SRAIKGRVKS AVNFIGKYEE QLQRLAVKRQ CDGIIAGHVH HPADTMVGKV
RYLNCGDWVE TMSAVLEYGD GRMETVLYKD FMKRLAYGKC GTMEDAETSP GTGDS