Gene Amuc_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2001 
Symbol 
ID6274550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2430986 
End bp2433025 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content55% 
IMG OID642614061 
Productputative lipoprotein 
Protein accessionYP_001878593 
Protein GI187736481 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.629311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.117944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTGCG AGCGTCATTA TACGGCCCCC GTCAAGGGGG ATGGACAGTT TTTTTCCCTG 
GGCCTGAGCG TTCCATGTCA CACAAAGGGG CTTTTTCCCG GGCATGGGAG GGGCCGGGCG
TTTTTCCTTG AAACAATTCC GGAAACTGAA CAGTATTATT TAACAAATAT GATTTTCTCA
CCATTGTTGA AAAAGGGGGT TCTTTTATTG GCCCTCTGCG GCGTGCTGGG CGGCGGCGTG
CTTTCCGCCC AGGAAGCAGG GGGTGCGCCG ATTACGGAGG AGGGTTTGAA AGCATTGGAT
TCCAATGTGT TCGCCAATGA ATACATGCTG GCGTTGAAGC CCAATGTCAC GCGCGACCGC
ATCGCCAGAA TCAAGGACCC CTTTGTGCGC GGGCTCGCCA CGCGGATGGC GGAGAAGAAA
TTTGACCGCA AGGCCCGCGC CCTTACGGCG GAACCGTATG AACCGGTCAA GGATCTGGCG
GAACGCCTGC GTACCAGCCA GTACAGCAAA TTTGAAAATC CCACCGGCAT CTTTTTTGAG
GAGGGGGAGG ACGTGTTGCT GGTCATGGGG GATCCCAAAG GTGAAAAGCT GAATCTGGTT
ATCCACAACT TCGGGCGGGA CGGAGGCCAC AGCTCCTATC CGCTGAAGGA GGGGGTCAAC
ATCATCAGGG CGAAGAACAA GGGGCTTGGC TACATTGAGT ATTTCACCCC CAATTATAAG
AAGGCTCCCA AGGTGCATTT ATCCATTTTG TCCGGCAAGG TGAACGGAGT TTTCGTGGGA
GGAGTCAGCA AAAACAGCGA CTGGAAGAAG ATGCTGGAAA ACTCCCCCAC GGAAGTTGTG
GACATCGTGG GCAGCCGCGT GCATCTGGTG TATCCGGTGG AGGAACTTAA ACAATTCTGT
CCGGACAAGG GGGAGGAGCT TATTGCCCTG TACGACAGGA TTATCGGCAT GGAGCAGCAG
ATCATGGGCT TGTACAAGTA CAGGATGCTG CCCAAAAACC GCATGTTCGG CCGTGTGATC
TGGAACGGGT TCATGCACGC GGACGGCACG GGGGCCGCCT TTCATAACGG CACCATGAAG
GAGGTCGGCA ATCCGGACAG GATTCCCGGC AGCGCCTGGG GGATTGCCCA TGAGTTCGGC
CACGTCAACC AGGTGCGCCC CGCCATGAAA TGGGTCTCCA CCGGAGAAGT GACCAATAAC
ATTTACTCCG CTTACGTGAA TTACATGCTG AATCCCTCCA GCATGCGCCT GGAGCATGAA
CGCATCAACG GCGGGGACGG CAACATGATT GGCGGACGCT TCAACGCCTA TCTGAACAAC
GGCATTTTGA AGGGGGAAAA CTGGCTTGTC CAGAGCGGGC CGGACAAGCG TTCCGGCGGG
GATAACCGTC CGATGGTGCA CGACCATTTT GTCAAGCTGG CCCCCCTGTG GCAGCTTGAG
CTGTATTTCA AGGTAGCCGG GAAGGGTAAT CCCGACTTTT ATCCGGATAT CTTTTATAAG
GCCATCAAGA TGGATACCAG GGGCAAGAAG GACGGCGAGC TCCAGCTTGC GTTCATGAAG
AACGCCTGCG ACGCCGCCAG ACAGGACCTG ACGGATTTTT TCCGGAAAAC GGGCATGCTC
AAGCCCATTG ACCAGGAATT GGACGACTAC ACCTGCGCCC GCATGACCAT CACGGAGGCC
GACTGCAAGA ACCTGATTGC CTACGCCAGG AAGTACAAAA AGCCGGAAAG CCCCGTCATT
TACTATATTT CCGTCAACAG CGCGGAAGCA TATAAGAACC GACTGCCTGT CCGCGGCGTT
TACAACCAGG GCGTTACGGA GCAGGGCAAC AGGCGCGTTG TTTCCCATGA CGTGTGGAAA
AACGCCGTCG TCTTTGAAAC ATACAAAGAC AGGGAAATGG TCCGCATCAC CATGGTGGGG
ACAGATTCCA GGGATAATTC CTCCACCACG GTTCCCTATC CGGAAGGTTC TACGCGGATT
GAAGCCGTTT CCTGGGACGG CAGGCGGACG CTGGTGTACG GCAAACGCCC TGCCAAATGA
 
Protein sequence
MFCERHYTAP VKGDGQFFSL GLSVPCHTKG LFPGHGRGRA FFLETIPETE QYYLTNMIFS 
PLLKKGVLLL ALCGVLGGGV LSAQEAGGAP ITEEGLKALD SNVFANEYML ALKPNVTRDR
IARIKDPFVR GLATRMAEKK FDRKARALTA EPYEPVKDLA ERLRTSQYSK FENPTGIFFE
EGEDVLLVMG DPKGEKLNLV IHNFGRDGGH SSYPLKEGVN IIRAKNKGLG YIEYFTPNYK
KAPKVHLSIL SGKVNGVFVG GVSKNSDWKK MLENSPTEVV DIVGSRVHLV YPVEELKQFC
PDKGEELIAL YDRIIGMEQQ IMGLYKYRML PKNRMFGRVI WNGFMHADGT GAAFHNGTMK
EVGNPDRIPG SAWGIAHEFG HVNQVRPAMK WVSTGEVTNN IYSAYVNYML NPSSMRLEHE
RINGGDGNMI GGRFNAYLNN GILKGENWLV QSGPDKRSGG DNRPMVHDHF VKLAPLWQLE
LYFKVAGKGN PDFYPDIFYK AIKMDTRGKK DGELQLAFMK NACDAARQDL TDFFRKTGML
KPIDQELDDY TCARMTITEA DCKNLIAYAR KYKKPESPVI YYISVNSAEA YKNRLPVRGV
YNQGVTEQGN RRVVSHDVWK NAVVFETYKD REMVRITMVG TDSRDNSSTT VPYPEGSTRI
EAVSWDGRRT LVYGKRPAK