Gene Amuc_0719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0719 
Symbol 
ID6273858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp847992 
End bp849104 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content55% 
IMG OID642612771 
Productprotein of unknown function UPF0118 
Protein accessionYP_001877337 
Protein GI187735225 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000137163 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTC AGAAAGAATC CGGAAAAGAG GGCCTGAAAA TTCTGCTCAT GCTCGCCAGC 
GTCATTATCA TCACGGCCGG GCTGCAGGCG GGAAAACCGG TGCTCCTGCC CATCGTTCTA
TCCGGCTTTC TGGCAATCGT CAGTTATCCG CTGACGACTT TTTTCAAGAG CCGTCTTTGC
TTCCCGCACT GGCTGGCAGT GACTTTCACG GTCATCATGG ACTTTGGCAT TCTGGTGGGC
CTGGGCTATC TGGCCCAATA CCTGGGGCAG GATCTGGCCA AAACGGTCAC GGTCAAATAC
CAGCCTCTTA TGATGGAGAA AATCCATGAA CTCCGCGCTT TCCTGATTGA ACAGGACTGG
AACAACCTGG CTGACCAGAT GCTTCAGGAA CTTCCGGACC TGCTCAACGG CCAGCGCATC
GTGGCGTTTT CCACAGGGGT GATGGGGCAG TTAGCTTCCA TGCTGACCTT CACCACCCTG
ATTCTGATCC TGATGACTTT CTTCCTGGGG GAAGCCCCCC GCTTCCGGGC GAACATCAAT
AAACTGGGGC ATAACAGCGA CACAGGCATC CGCAAATTCT CCAAGGCCCT GGCCGGAGTT
CAGAAATATC TCATTATTAA AACCTTCATC AGCGCAGTTA CAGGGCTTCT GGCTTTCCTG
CTTTGCTATT ACATGAACGT GGACTTTCCG CTGTTGTGGG GCATCGTGGC TTTCGCCCTC
AACTTCATTC CCACCTTCGG CTCCATCATC GCGGCTATTC CCCCCACGCT TCTCGCCATG
CTTCTGATCA GCCCGACTGC GGGCATCATT GTTGCCGGCG GCTACCTGGT GATTAACACA
GCCCTGGGAA ACTGCCTGGA ACCCATGCTG TTGGGACGAC AATTCGGCAT TGTGACCAGT
ATGGTTCTGC TCTCCGTCAT CTTCTGGGGC TGGGTATGGG GCCCCATCGG CATGCTGCTG
GCCGTACCCA TTACCATGTT GATTAAACTC GGGCTGGAAA GCTCCAAGGA TCTCGCCTGG
ATCGCCCAGC TCATTGACAA CCCTCCCACT CCCAGATTCC CTCTCCCCCC TCTCCATTCC
GGGAAAACCA ACGAAAGCAC AACCAAGGAA TAA
 
Protein sequence
MTFQKESGKE GLKILLMLAS VIIITAGLQA GKPVLLPIVL SGFLAIVSYP LTTFFKSRLC 
FPHWLAVTFT VIMDFGILVG LGYLAQYLGQ DLAKTVTVKY QPLMMEKIHE LRAFLIEQDW
NNLADQMLQE LPDLLNGQRI VAFSTGVMGQ LASMLTFTTL ILILMTFFLG EAPRFRANIN
KLGHNSDTGI RKFSKALAGV QKYLIIKTFI SAVTGLLAFL LCYYMNVDFP LLWGIVAFAL
NFIPTFGSII AAIPPTLLAM LLISPTAGII VAGGYLVINT ALGNCLEPML LGRQFGIVTS
MVLLSVIFWG WVWGPIGMLL AVPITMLIKL GLESSKDLAW IAQLIDNPPT PRFPLPPLHS
GKTNESTTKE