Gene Amuc_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0334 
Symbol 
ID6275008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp391336 
End bp394161 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content59% 
IMG OID642612385 
Productprotein of unknown function DUF214 
Protein accessionYP_001876954 
Protein GI187734842 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.530518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0000549339 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGTTGA TGTTCAAACT GATTTGGAGG GATCTGCGCG CCAACCCCGG CCGAATGGCT 
GTCAGCGTGT TCGCCATCCT TGTCTCCGTC AGCCTCATTG TATGGATGAT GGGGAGCTAT
GATACGCTGG TCAGGGAGTT TGACAATGAT GCGGAGGCCT ATATGGGGAA TTATGACCTC
TGCCTGGTGC CGGAGTCTCC CAAAGGGCCC CTTCCTCCCG GACAGTACCC GCAGTTCCGT
GACCCGGAGC TGCCCGCCCG TCTGGCCGCC TCTCCCCTTG TGGAAACGGT GAACGCCGCG
TGCCAGGTGC CGCGCCTCCA AATCGGCTGC GGAAATGAGC GGGGCAGCTT TGACGAGCAG
ACGCGCGACC GCATGGGAAT TCCTCCCCAG AGCCCCATCC TGGTGGGAAA CCATGCTGTG
GAGTGTCCTT ATGAATTGAA AGAGGGAATT TGGCCGGATA TGGCTTCCTC CTCAGCCATG
GAGGGGGTGC TGGGCAGCGG AAGCGCCAAA TATTTCAGCG CGGGAGTGGG GACTGTCATG
AACGTGCGCG TGGGAACGCA TGTGTACGAC GTAAAGATTG TGGGCATCGT CAAGCAGGCC
AAGGCCACCC CCGGCGTCAT TATGGGGCCG GGCGGCATGT CCGGCCCGGC TTTTTCCTCC
CTTTTTGTGC CCGTGAAGGT ATGCGAGAAA ATTACGGGGC AGCCTTTTGC CCCTAATTTG
ATTTACGTTC AGCTCAAGGA AGGGGTGGAC AAAAAGGAAT TTGCTGAGAA TTTCCGGCAG
GAGCTGGTCC AGGCTTCCGC TGCGGTGGCG GATACGGATT CCATCATCCG GCGGCTTTCC
AGCGACCGGG CCGTCCGTTC ACAGAAGGAT AGCGCGGAAA TGTCCGTCTG GCTTGTCCTG
TTTTCCTGTA TTTTCATTAT TTTCACAACG TTGAGCATTG GCGTCAGCGA ACGTGCCCGC
AGACTGGCCC TGATGCGTGC TCTGGGGCTG GGGCGCATGC AAATTGCACT GCTGATTGCA
GGGGAGGGGA TTTTTCTGTG CATTCCGGCA TTGCTGGGAG GGTTGGCCGC AGGCTTTTTC
CTGGTGTACC TGCTGGAGGA AGGCTCTGCC TCTGTACCCG TGCTGACCTG GTCCACAGTA
TTAACCGCCG CCGTATGCGC TGTGGGCGGG GCCTTGCTGG CTTCCATTAT TCCTGCGTGG
CGCGCTTCCC GCCAGTCTCC GCTGGAAGCG GCCGTTCCTT CCTCCGGATT CATCGGGAAG
GTGAGCCGTG TTCCTGTGTG GTCTGTCGTG GCGGGCCTCG CATGCGTGTG TCTTCAGCCT
GCGGCCCTGC TGTTGCCGGG CCTGGAGGTA GAAACGCGCA AATGGATATT TTTCTGGCTG
GGTTACCCCG GCCTGGTAGC CGGTGCGCTG TTTCTGGCCC CCACCTTCGT GCGCGTGACG
GAGTGGGCCG GGGCATGGAT TACCGGATTT CTGCTGCACG TTCCGCATTC TTTCCTGAAA
ATGCAGCTCA GCCGCAACCT CAGCCGTTCC GTGGGAACTG CCGTATCCAT GTCTGTGGGC
CTCTCTCTTT TTGTGGGGGT GCAGACATGG GGCTATTCCA TGCTGGTTCC TTTTTCCCCG
GATACGTCAA CACCCGGAAC CCTGGTTTCC TTCCTGCATA CGGAGTTCAA ATCTGCGGAT
GTTCCGGAGC TGATGGCGCG CCCCAGCCTG CGGAATTCCC GAATGTATCC CATCTACGTG
GATGAACCGG ATATTGCTCC GGCACAGATG AAAAGCCCCG GTTTTTCCGG GATGCGCAAC
CGTTCCATCG TACTGGCCGG CATTCCCGTG GCGGAGATGG CCGGGGGGAG CCATCCGCTG
TTCAACCCCG TGTTTGTTTC CGGAAATCCG CAGGAGGCTT ATGCCATGCT GGAATCCACC
CGTTCTCTCC TGATTCCGGA CACGTTTGCA CGGACGGTGG GGTTGAAGGT CGGGGATGAC
TTGATGCTGG TTAATCCGTC TTCCCGGGAA CGCCGTTCCG GAAATGAGCC CTCTGCCGGT
ATTCGCGGCC GGGGCAGGGG TGGCGCCGTA CGGGGAGAAC CCTGGAAGGT GGCGGGAATA
GTTTCCTTCC CCGGCTGGCA TTGGTTGACC AAGACCAGCG GAATGCGCGT GCGCCGCGGG
GGTTTTGTTG CCGCTTTGGC GATTGCCGAT GAACGCTGGC TCAAAGAGGA ATACGCTCAT
CAGGGATTCC AGTTTATCTG GGGGGATACT GCTCCGGGGA TCAGCAACGT GGAACTTCAG
AACGACCTGG GGGAATACGC CCTGATGAAG GTGCGGGAGC AGGAAAACGG AGGGGAAGGC
GCCAAGCCTC TGGTGAAGGC CCTGACCCGG GAAAGCCTGG GAGGAAGCGT TACCAGCCGC
GGGGACGATG TGATTTTTAC GATGAGCAAG CTTCCCATCA TCATGATGGT GATCGCCGTT
CTGGCCGTAC TTAACACCGT CCTGGCTTCC GTGCAGTCCC GCCGCCGGGA GTTTGGATTG
ATGCGTGCGG TGGGCGTGCC GGGCGGCATG GTGATGCGGA TGCTCTGGGC GGAAACGCTG
ATGGTTTCCC TGTGCTCCGT TGTCATGAGT CTTGTGCTGG GAGTCCTGGG CGCCTGGTGC
TCCATCCAGA TTTTGGAATA CGGCTATCAC TTCGGCGTCG TTACTCCTCC TGTCACGATG
CCTTGGGCGC ATCTGGCCTG TGCTGTGCTT TTGGTGCTTT CCCTCTCATC CTTGGCGTGC
CTTCTGCCTG CGTGGCGCAT GAAGCATGCG TCCGTAACGG ATTTGCTTTC CGTTCGCGAG
GGGTAG
 
Protein sequence
MTLMFKLIWR DLRANPGRMA VSVFAILVSV SLIVWMMGSY DTLVREFDND AEAYMGNYDL 
CLVPESPKGP LPPGQYPQFR DPELPARLAA SPLVETVNAA CQVPRLQIGC GNERGSFDEQ
TRDRMGIPPQ SPILVGNHAV ECPYELKEGI WPDMASSSAM EGVLGSGSAK YFSAGVGTVM
NVRVGTHVYD VKIVGIVKQA KATPGVIMGP GGMSGPAFSS LFVPVKVCEK ITGQPFAPNL
IYVQLKEGVD KKEFAENFRQ ELVQASAAVA DTDSIIRRLS SDRAVRSQKD SAEMSVWLVL
FSCIFIIFTT LSIGVSERAR RLALMRALGL GRMQIALLIA GEGIFLCIPA LLGGLAAGFF
LVYLLEEGSA SVPVLTWSTV LTAAVCAVGG ALLASIIPAW RASRQSPLEA AVPSSGFIGK
VSRVPVWSVV AGLACVCLQP AALLLPGLEV ETRKWIFFWL GYPGLVAGAL FLAPTFVRVT
EWAGAWITGF LLHVPHSFLK MQLSRNLSRS VGTAVSMSVG LSLFVGVQTW GYSMLVPFSP
DTSTPGTLVS FLHTEFKSAD VPELMARPSL RNSRMYPIYV DEPDIAPAQM KSPGFSGMRN
RSIVLAGIPV AEMAGGSHPL FNPVFVSGNP QEAYAMLEST RSLLIPDTFA RTVGLKVGDD
LMLVNPSSRE RRSGNEPSAG IRGRGRGGAV RGEPWKVAGI VSFPGWHWLT KTSGMRVRRG
GFVAALAIAD ERWLKEEYAH QGFQFIWGDT APGISNVELQ NDLGEYALMK VREQENGGEG
AKPLVKALTR ESLGGSVTSR GDDVIFTMSK LPIIMMVIAV LAVLNTVLAS VQSRRREFGL
MRAVGVPGGM VMRMLWAETL MVSLCSVVMS LVLGVLGAWC SIQILEYGYH FGVVTPPVTM
PWAHLACAVL LVLSLSSLAC LLPAWRMKHA SVTDLLSVRE G