Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0419 |
Symbol | |
ID | 6274837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 499800 |
End bp | 501761 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642612469 |
Product | General substrate transporter |
Protein accession | YP_001877038 |
Protein GI | 187734926 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACC AAGAAAACAT GAAAGCCGCC GTTCCGGATG CGGCCAGGCG ATATATGCGC TATCTCCTCA TCATGGCCGG TTTGGGCGGC CTGCTCTACG GTGTGGACGT TGGCGTGATT GCTGCCGCGC TTCCCTACAT TGAGCAGACG GCCGGTTTTA ATCCCTCCCA GCTTTCCCAG GTGGTGGCCG CCGTGCTGTT CGGCAGCGTT CTTTCTTCTC TGTTTGCGGG CTATCTGGCC GACAAGATGG GACGCAAGGC GTTGATTACC GTGGCCGCAG CCTTGTTCAC GGCGAGCATC CCCGTGATCT GCCTGTCCCA GGAAGTATTC GGCATTATGT TGCTGGGCCG TATTCTTCAG GGCGCCAGCG CCGGTATTGT GGGCGTGGTC GTTCCGCTGT ATCTGGCCGA GTGCCTGAGC GCCGAGTCCC GCGGCAAGGG AACGGGAATG TTCCAGTTCC TGCTGACGGT GGGCCTGGTG TTTGCCGCCG TCGTCGGTTT GCTTGCCGCC AGTTATGTGG GCGGCGTCGA GAATTCCGGC GCGAGCGAAG AATCGCTGAC TTCCGCCAAG GTTCTGGCCT GGCAGGCTAT TTTCTGGGTT TGCGCCATTC CCGGCCTGTT CCTGTTTTTC GGTTCCTTCC GTTTGAGCGA ATCTCCCCGC TACCTGTTCC GCCGCGGCCG CAAGGATGAA GCCATGGCTG TGCTGGTGCG CAGTTACGGG GATGCCCGCG CCAAGGAAGT GTTTGATGAA ATGGTCCATA TTGAAGAGGA AGAGAAGCAG AAGGCGGAGG AACTCAAAAA GCAGAGTTCT TCCGGCGAGT CCCTTCTTCA GCGCAAGTAT ATCTATCCCT TTGTTCTGGC GGTGCTCGTT CTGGCGTTTA CGCAGGCTAC GGGCATTAAT TCCGTGCTGA ATTATTCCGT GAAGGTATTC CAGCAGGCAG GCTTGGAAGG CACCACCGCC AACTGGGCAG ACTTTACCAT CAAGGTCGTG AACTGCTTGA TGACTATTGT CGCCATGGTG CTGGTGGACC GCAAGGGCCG CAAGTTCCTG CTTAAAATCG GAACGGCCGG CATCGTGGTC GGCCTTCTTG GCACCGGGTT CCTGTTCAAT AATGTGGAAA AAGCCCGCAA GGATGTGACT GCGGATGTGG CTGCCCTGCT GGCCGCCCAG AGCCCTTCCG TCCAGAAGGA GTTTGAACAG GGCAAGGATG TGGGTTCCAT CCGAACGCTT CAATTGGAAC GCACTCCGGA TTCCCCCTTC ATCAGGAATC TTCTTGCCAA GAACGGCATG GCCGACAAGG ATATCAACAG GATGCAGCTC ATCATCACCT ATGACCAGCC GGAAGCCAAT CCCGCCTGGT ACCAGTTCCT GATGGGGTCT TCCACCCAGC TTTCCGTCGT AGAGTTTTCC GAACTGACCA AGGATACCAA GGATATCAAG AAGGAAGAGG ACAGGGCTTC CCTGGCCGTG ATCAAGGCCG TTCCGGATTC CACCAATAAA ATGGTGGTGA ACGGCAAGGA CGGCTATGCC ATGAAGCCCG TTTCCATTCT GAAGGCGGAA TTGGGCGAAA AGCCGGATAC CTCCATGGGC TGGGGCGTGA CTGCGTTCTT CATTATCTTC ATTGCTTTTT ATGCCACCGG CCCCGGCGTA TGCGTCTGGC TGGCACTGTC CGAGCTGATG CCAGCCCGCA TCCGCTCCAA CGGTATGGCG ATTGCTCTGT TGATCAACCA GCTGGTTTCT ACGGTTATCG CCGGTTCCTT CCTCCCGTGG GTGGGCAGCT GCGGTTATTC CGGCGTGTTC TTTACGCTGG GCGGCATTAC GGTGCTGTAT TTCATTACGG TGACCTTCTT CCTGCCTGAA ACCAAGGGAC GTTCCCTGGA GGAAATTGAA GGTTACTTCA CAACAGGCAA GATGCCGGAA GATCCCAAGA TGATCGGCGA AGGCATAGAA GCGGAGGAAT AA
|
Protein sequence | MTDQENMKAA VPDAARRYMR YLLIMAGLGG LLYGVDVGVI AAALPYIEQT AGFNPSQLSQ VVAAVLFGSV LSSLFAGYLA DKMGRKALIT VAAALFTASI PVICLSQEVF GIMLLGRILQ GASAGIVGVV VPLYLAECLS AESRGKGTGM FQFLLTVGLV FAAVVGLLAA SYVGGVENSG ASEESLTSAK VLAWQAIFWV CAIPGLFLFF GSFRLSESPR YLFRRGRKDE AMAVLVRSYG DARAKEVFDE MVHIEEEEKQ KAEELKKQSS SGESLLQRKY IYPFVLAVLV LAFTQATGIN SVLNYSVKVF QQAGLEGTTA NWADFTIKVV NCLMTIVAMV LVDRKGRKFL LKIGTAGIVV GLLGTGFLFN NVEKARKDVT ADVAALLAAQ SPSVQKEFEQ GKDVGSIRTL QLERTPDSPF IRNLLAKNGM ADKDINRMQL IITYDQPEAN PAWYQFLMGS STQLSVVEFS ELTKDTKDIK KEEDRASLAV IKAVPDSTNK MVVNGKDGYA MKPVSILKAE LGEKPDTSMG WGVTAFFIIF IAFYATGPGV CVWLALSELM PARIRSNGMA IALLINQLVS TVIAGSFLPW VGSCGYSGVF FTLGGITVLY FITVTFFLPE TKGRSLEEIE GYFTTGKMPE DPKMIGEGIE AEE
|
| |