Gene Amuc_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0104 
Symbol 
ID6274955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp128919 
End bp130283 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content59% 
IMG OID642612149 
Productprotein of unknown function DUF214 
Protein accessionYP_001876730 
Protein GI187734618 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCC TTCCTCTCCT TAAAACCTGC ATCAGGGCCC TGGTGCGCAA TCCCATGCGC 
GCAGCCCTCA CCATCCTGGG CATCATCATC GGCATTGCCG CAGTAATCGC CATGGTGGAA
ATCGGCCAGG GATCTACGCT CCAGATTAAA AATACCATCG CTTCCATGGG GGCGGACACG
CTGAATATCC GCCCCGGAGC CATCTCCAAA AGCGGCGTCA ATACCGGCGC GGGCGGACGG
GCCTCCCTGA CCAACGCAGA CTGCGAAGCC ATTGCGAAGG ACTGCCCCAT GGTTCTGCGG
GCTACCCCCG TGGTGCGCGC CAGCGGCCAA GTCATTTACG GCAACAAGAA CTGGAGCCCG
GAAACTGTAG AGGGCGGTTC CGTGGAATAC CTGAAAATCA AAAGCTGGTA TGACATGGCG
CGAGGCCAGC CCTTCTCCGA GGAAGATGTG GAACAGGCCA GGCGCGTGTG CGTCATTGGC
CAGACGGTGG CCAAGGAACT TTTCGGGGAC GAAGACCCGC TGGGCAAGGA TATCCGCATC
AAGAATGTCA TGTTCAAAGT CATCGGCATC CTTCAGAAAA AAGGGGCCAA CATGATGGGA
CGCGACCAGG ACGACTCCAT CATCCTCCCG TGGACAAGCA TCCGCTACCG CCTCCAGGGC
CTGGGCGGCG GTTCCACCAC CACTTCCACC GGCAACAGCA CCACCACCTT CAACCGGGCA
GATAAATACA CCGCCAGTTC CGTGGATTAC TACACGGAAA CTACGGACCA GCCCTATACG
GACGCGCCCC ATCCGCGGCG CTTCAACAAT ATTGATTCCA TCATGGCTCA GATTTCAGAC
CCGGAACGCT CCTCCGAGGC CATTGACCAG ATTACGGAAG TGATCCGTGC CAAACACAAC
CTCAAGGACG GCCAGCTGGA CGATTTCCGG GTATGGGACA TGGCGGAAAT GTCCCGCGCC
ATGAGCAGCA CCACGGAAGT GATGACCAAT CTGCTGATGA TCGTGGCCAT GATCTCTCTG
GTCGTCGGCG GCGTCGGCAT CATGAATATC ATGCTCGTTT CCGTCACGGA ACGGACCAAG
GAAATTGGCC TGCGCATGGC GGTGGGCGCC CGTCCGCAGG ATATCATGCG CCAGTTCCTG
CTGGAAGCGG TGCTGCTCTG CGTGGTGGGC GGCGCGCTGG GCATCATGCT CGGCAAGGCG
ATCTCCATCA TCGTCAGCCG CACCATGAAC TGGGCCACGG CCTCCTCCCC GGAAGCCATG
GCTCTGGCTG TAGGCGTCTC CGTATTCATC GGCCTGGCCT TCGGATGGTA CCCCTCCTGG
AAGGCGTCCA AGATGGACCC CATTGATGCC CTTCGCCACG AATAA
 
Protein sequence
MKFLPLLKTC IRALVRNPMR AALTILGIII GIAAVIAMVE IGQGSTLQIK NTIASMGADT 
LNIRPGAISK SGVNTGAGGR ASLTNADCEA IAKDCPMVLR ATPVVRASGQ VIYGNKNWSP
ETVEGGSVEY LKIKSWYDMA RGQPFSEEDV EQARRVCVIG QTVAKELFGD EDPLGKDIRI
KNVMFKVIGI LQKKGANMMG RDQDDSIILP WTSIRYRLQG LGGGSTTTST GNSTTTFNRA
DKYTASSVDY YTETTDQPYT DAPHPRRFNN IDSIMAQISD PERSSEAIDQ ITEVIRAKHN
LKDGQLDDFR VWDMAEMSRA MSSTTEVMTN LLMIVAMISL VVGGVGIMNI MLVSVTERTK
EIGLRMAVGA RPQDIMRQFL LEAVLLCVVG GALGIMLGKA ISIIVSRTMN WATASSPEAM
ALAVGVSVFI GLAFGWYPSW KASKMDPIDA LRHE