Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0334 |
Symbol | |
ID | 6275008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 391336 |
End bp | 394161 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642612385 |
Product | protein of unknown function DUF214 |
Protein accession | YP_001876954 |
Protein GI | 187734842 |
COG category | [V] Defense mechanisms |
COG ID | [COG0577] ABC-type antimicrobial peptide transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.530518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0000549339 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGTTGA TGTTCAAACT GATTTGGAGG GATCTGCGCG CCAACCCCGG CCGAATGGCT GTCAGCGTGT TCGCCATCCT TGTCTCCGTC AGCCTCATTG TATGGATGAT GGGGAGCTAT GATACGCTGG TCAGGGAGTT TGACAATGAT GCGGAGGCCT ATATGGGGAA TTATGACCTC TGCCTGGTGC CGGAGTCTCC CAAAGGGCCC CTTCCTCCCG GACAGTACCC GCAGTTCCGT GACCCGGAGC TGCCCGCCCG TCTGGCCGCC TCTCCCCTTG TGGAAACGGT GAACGCCGCG TGCCAGGTGC CGCGCCTCCA AATCGGCTGC GGAAATGAGC GGGGCAGCTT TGACGAGCAG ACGCGCGACC GCATGGGAAT TCCTCCCCAG AGCCCCATCC TGGTGGGAAA CCATGCTGTG GAGTGTCCTT ATGAATTGAA AGAGGGAATT TGGCCGGATA TGGCTTCCTC CTCAGCCATG GAGGGGGTGC TGGGCAGCGG AAGCGCCAAA TATTTCAGCG CGGGAGTGGG GACTGTCATG AACGTGCGCG TGGGAACGCA TGTGTACGAC GTAAAGATTG TGGGCATCGT CAAGCAGGCC AAGGCCACCC CCGGCGTCAT TATGGGGCCG GGCGGCATGT CCGGCCCGGC TTTTTCCTCC CTTTTTGTGC CCGTGAAGGT ATGCGAGAAA ATTACGGGGC AGCCTTTTGC CCCTAATTTG ATTTACGTTC AGCTCAAGGA AGGGGTGGAC AAAAAGGAAT TTGCTGAGAA TTTCCGGCAG GAGCTGGTCC AGGCTTCCGC TGCGGTGGCG GATACGGATT CCATCATCCG GCGGCTTTCC AGCGACCGGG CCGTCCGTTC ACAGAAGGAT AGCGCGGAAA TGTCCGTCTG GCTTGTCCTG TTTTCCTGTA TTTTCATTAT TTTCACAACG TTGAGCATTG GCGTCAGCGA ACGTGCCCGC AGACTGGCCC TGATGCGTGC TCTGGGGCTG GGGCGCATGC AAATTGCACT GCTGATTGCA GGGGAGGGGA TTTTTCTGTG CATTCCGGCA TTGCTGGGAG GGTTGGCCGC AGGCTTTTTC CTGGTGTACC TGCTGGAGGA AGGCTCTGCC TCTGTACCCG TGCTGACCTG GTCCACAGTA TTAACCGCCG CCGTATGCGC TGTGGGCGGG GCCTTGCTGG CTTCCATTAT TCCTGCGTGG CGCGCTTCCC GCCAGTCTCC GCTGGAAGCG GCCGTTCCTT CCTCCGGATT CATCGGGAAG GTGAGCCGTG TTCCTGTGTG GTCTGTCGTG GCGGGCCTCG CATGCGTGTG TCTTCAGCCT GCGGCCCTGC TGTTGCCGGG CCTGGAGGTA GAAACGCGCA AATGGATATT TTTCTGGCTG GGTTACCCCG GCCTGGTAGC CGGTGCGCTG TTTCTGGCCC CCACCTTCGT GCGCGTGACG GAGTGGGCCG GGGCATGGAT TACCGGATTT CTGCTGCACG TTCCGCATTC TTTCCTGAAA ATGCAGCTCA GCCGCAACCT CAGCCGTTCC GTGGGAACTG CCGTATCCAT GTCTGTGGGC CTCTCTCTTT TTGTGGGGGT GCAGACATGG GGCTATTCCA TGCTGGTTCC TTTTTCCCCG GATACGTCAA CACCCGGAAC CCTGGTTTCC TTCCTGCATA CGGAGTTCAA ATCTGCGGAT GTTCCGGAGC TGATGGCGCG CCCCAGCCTG CGGAATTCCC GAATGTATCC CATCTACGTG GATGAACCGG ATATTGCTCC GGCACAGATG AAAAGCCCCG GTTTTTCCGG GATGCGCAAC CGTTCCATCG TACTGGCCGG CATTCCCGTG GCGGAGATGG CCGGGGGGAG CCATCCGCTG TTCAACCCCG TGTTTGTTTC CGGAAATCCG CAGGAGGCTT ATGCCATGCT GGAATCCACC CGTTCTCTCC TGATTCCGGA CACGTTTGCA CGGACGGTGG GGTTGAAGGT CGGGGATGAC TTGATGCTGG TTAATCCGTC TTCCCGGGAA CGCCGTTCCG GAAATGAGCC CTCTGCCGGT ATTCGCGGCC GGGGCAGGGG TGGCGCCGTA CGGGGAGAAC CCTGGAAGGT GGCGGGAATA GTTTCCTTCC CCGGCTGGCA TTGGTTGACC AAGACCAGCG GAATGCGCGT GCGCCGCGGG GGTTTTGTTG CCGCTTTGGC GATTGCCGAT GAACGCTGGC TCAAAGAGGA ATACGCTCAT CAGGGATTCC AGTTTATCTG GGGGGATACT GCTCCGGGGA TCAGCAACGT GGAACTTCAG AACGACCTGG GGGAATACGC CCTGATGAAG GTGCGGGAGC AGGAAAACGG AGGGGAAGGC GCCAAGCCTC TGGTGAAGGC CCTGACCCGG GAAAGCCTGG GAGGAAGCGT TACCAGCCGC GGGGACGATG TGATTTTTAC GATGAGCAAG CTTCCCATCA TCATGATGGT GATCGCCGTT CTGGCCGTAC TTAACACCGT CCTGGCTTCC GTGCAGTCCC GCCGCCGGGA GTTTGGATTG ATGCGTGCGG TGGGCGTGCC GGGCGGCATG GTGATGCGGA TGCTCTGGGC GGAAACGCTG ATGGTTTCCC TGTGCTCCGT TGTCATGAGT CTTGTGCTGG GAGTCCTGGG CGCCTGGTGC TCCATCCAGA TTTTGGAATA CGGCTATCAC TTCGGCGTCG TTACTCCTCC TGTCACGATG CCTTGGGCGC ATCTGGCCTG TGCTGTGCTT TTGGTGCTTT CCCTCTCATC CTTGGCGTGC CTTCTGCCTG CGTGGCGCAT GAAGCATGCG TCCGTAACGG ATTTGCTTTC CGTTCGCGAG GGGTAG
|
Protein sequence | MTLMFKLIWR DLRANPGRMA VSVFAILVSV SLIVWMMGSY DTLVREFDND AEAYMGNYDL CLVPESPKGP LPPGQYPQFR DPELPARLAA SPLVETVNAA CQVPRLQIGC GNERGSFDEQ TRDRMGIPPQ SPILVGNHAV ECPYELKEGI WPDMASSSAM EGVLGSGSAK YFSAGVGTVM NVRVGTHVYD VKIVGIVKQA KATPGVIMGP GGMSGPAFSS LFVPVKVCEK ITGQPFAPNL IYVQLKEGVD KKEFAENFRQ ELVQASAAVA DTDSIIRRLS SDRAVRSQKD SAEMSVWLVL FSCIFIIFTT LSIGVSERAR RLALMRALGL GRMQIALLIA GEGIFLCIPA LLGGLAAGFF LVYLLEEGSA SVPVLTWSTV LTAAVCAVGG ALLASIIPAW RASRQSPLEA AVPSSGFIGK VSRVPVWSVV AGLACVCLQP AALLLPGLEV ETRKWIFFWL GYPGLVAGAL FLAPTFVRVT EWAGAWITGF LLHVPHSFLK MQLSRNLSRS VGTAVSMSVG LSLFVGVQTW GYSMLVPFSP DTSTPGTLVS FLHTEFKSAD VPELMARPSL RNSRMYPIYV DEPDIAPAQM KSPGFSGMRN RSIVLAGIPV AEMAGGSHPL FNPVFVSGNP QEAYAMLEST RSLLIPDTFA RTVGLKVGDD LMLVNPSSRE RRSGNEPSAG IRGRGRGGAV RGEPWKVAGI VSFPGWHWLT KTSGMRVRRG GFVAALAIAD ERWLKEEYAH QGFQFIWGDT APGISNVELQ NDLGEYALMK VREQENGGEG AKPLVKALTR ESLGGSVTSR GDDVIFTMSK LPIIMMVIAV LAVLNTVLAS VQSRRREFGL MRAVGVPGGM VMRMLWAETL MVSLCSVVMS LVLGVLGAWC SIQILEYGYH FGVVTPPVTM PWAHLACAVL LVLSLSSLAC LLPAWRMKHA SVTDLLSVRE G
|
| |