Gene Amuc_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0419 
Symbol 
ID6274837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp499800 
End bp501761 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content57% 
IMG OID642612469 
ProductGeneral substrate transporter 
Protein accessionYP_001877038 
Protein GI187734926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACC AAGAAAACAT GAAAGCCGCC GTTCCGGATG CGGCCAGGCG ATATATGCGC 
TATCTCCTCA TCATGGCCGG TTTGGGCGGC CTGCTCTACG GTGTGGACGT TGGCGTGATT
GCTGCCGCGC TTCCCTACAT TGAGCAGACG GCCGGTTTTA ATCCCTCCCA GCTTTCCCAG
GTGGTGGCCG CCGTGCTGTT CGGCAGCGTT CTTTCTTCTC TGTTTGCGGG CTATCTGGCC
GACAAGATGG GACGCAAGGC GTTGATTACC GTGGCCGCAG CCTTGTTCAC GGCGAGCATC
CCCGTGATCT GCCTGTCCCA GGAAGTATTC GGCATTATGT TGCTGGGCCG TATTCTTCAG
GGCGCCAGCG CCGGTATTGT GGGCGTGGTC GTTCCGCTGT ATCTGGCCGA GTGCCTGAGC
GCCGAGTCCC GCGGCAAGGG AACGGGAATG TTCCAGTTCC TGCTGACGGT GGGCCTGGTG
TTTGCCGCCG TCGTCGGTTT GCTTGCCGCC AGTTATGTGG GCGGCGTCGA GAATTCCGGC
GCGAGCGAAG AATCGCTGAC TTCCGCCAAG GTTCTGGCCT GGCAGGCTAT TTTCTGGGTT
TGCGCCATTC CCGGCCTGTT CCTGTTTTTC GGTTCCTTCC GTTTGAGCGA ATCTCCCCGC
TACCTGTTCC GCCGCGGCCG CAAGGATGAA GCCATGGCTG TGCTGGTGCG CAGTTACGGG
GATGCCCGCG CCAAGGAAGT GTTTGATGAA ATGGTCCATA TTGAAGAGGA AGAGAAGCAG
AAGGCGGAGG AACTCAAAAA GCAGAGTTCT TCCGGCGAGT CCCTTCTTCA GCGCAAGTAT
ATCTATCCCT TTGTTCTGGC GGTGCTCGTT CTGGCGTTTA CGCAGGCTAC GGGCATTAAT
TCCGTGCTGA ATTATTCCGT GAAGGTATTC CAGCAGGCAG GCTTGGAAGG CACCACCGCC
AACTGGGCAG ACTTTACCAT CAAGGTCGTG AACTGCTTGA TGACTATTGT CGCCATGGTG
CTGGTGGACC GCAAGGGCCG CAAGTTCCTG CTTAAAATCG GAACGGCCGG CATCGTGGTC
GGCCTTCTTG GCACCGGGTT CCTGTTCAAT AATGTGGAAA AAGCCCGCAA GGATGTGACT
GCGGATGTGG CTGCCCTGCT GGCCGCCCAG AGCCCTTCCG TCCAGAAGGA GTTTGAACAG
GGCAAGGATG TGGGTTCCAT CCGAACGCTT CAATTGGAAC GCACTCCGGA TTCCCCCTTC
ATCAGGAATC TTCTTGCCAA GAACGGCATG GCCGACAAGG ATATCAACAG GATGCAGCTC
ATCATCACCT ATGACCAGCC GGAAGCCAAT CCCGCCTGGT ACCAGTTCCT GATGGGGTCT
TCCACCCAGC TTTCCGTCGT AGAGTTTTCC GAACTGACCA AGGATACCAA GGATATCAAG
AAGGAAGAGG ACAGGGCTTC CCTGGCCGTG ATCAAGGCCG TTCCGGATTC CACCAATAAA
ATGGTGGTGA ACGGCAAGGA CGGCTATGCC ATGAAGCCCG TTTCCATTCT GAAGGCGGAA
TTGGGCGAAA AGCCGGATAC CTCCATGGGC TGGGGCGTGA CTGCGTTCTT CATTATCTTC
ATTGCTTTTT ATGCCACCGG CCCCGGCGTA TGCGTCTGGC TGGCACTGTC CGAGCTGATG
CCAGCCCGCA TCCGCTCCAA CGGTATGGCG ATTGCTCTGT TGATCAACCA GCTGGTTTCT
ACGGTTATCG CCGGTTCCTT CCTCCCGTGG GTGGGCAGCT GCGGTTATTC CGGCGTGTTC
TTTACGCTGG GCGGCATTAC GGTGCTGTAT TTCATTACGG TGACCTTCTT CCTGCCTGAA
ACCAAGGGAC GTTCCCTGGA GGAAATTGAA GGTTACTTCA CAACAGGCAA GATGCCGGAA
GATCCCAAGA TGATCGGCGA AGGCATAGAA GCGGAGGAAT AA
 
Protein sequence
MTDQENMKAA VPDAARRYMR YLLIMAGLGG LLYGVDVGVI AAALPYIEQT AGFNPSQLSQ 
VVAAVLFGSV LSSLFAGYLA DKMGRKALIT VAAALFTASI PVICLSQEVF GIMLLGRILQ
GASAGIVGVV VPLYLAECLS AESRGKGTGM FQFLLTVGLV FAAVVGLLAA SYVGGVENSG
ASEESLTSAK VLAWQAIFWV CAIPGLFLFF GSFRLSESPR YLFRRGRKDE AMAVLVRSYG
DARAKEVFDE MVHIEEEEKQ KAEELKKQSS SGESLLQRKY IYPFVLAVLV LAFTQATGIN
SVLNYSVKVF QQAGLEGTTA NWADFTIKVV NCLMTIVAMV LVDRKGRKFL LKIGTAGIVV
GLLGTGFLFN NVEKARKDVT ADVAALLAAQ SPSVQKEFEQ GKDVGSIRTL QLERTPDSPF
IRNLLAKNGM ADKDINRMQL IITYDQPEAN PAWYQFLMGS STQLSVVEFS ELTKDTKDIK
KEEDRASLAV IKAVPDSTNK MVVNGKDGYA MKPVSILKAE LGEKPDTSMG WGVTAFFIIF
IAFYATGPGV CVWLALSELM PARIRSNGMA IALLINQLVS TVIAGSFLPW VGSCGYSGVF
FTLGGITVLY FITVTFFLPE TKGRSLEEIE GYFTTGKMPE DPKMIGEGIE AEE