Gene Amuc_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0120 
Symbol 
ID6274912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp148008 
End bp149318 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID642612165 
Productprotein of unknown function UPF0118 
Protein accessionYP_001876746 
Protein GI187734634 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACG AACAACATCA AAATACCCAC ACTTCCGCCG CCCCCGCCAT TCCCTCCCCC 
TTCCAGAAGA AAACCTGCTG GCATGCCCTG ACAGGCGTAT CCATTCTGGT GATGCTGGGC
ATTGCGGCCT TCGTCATTTT TGAAGTAGTG GAACTTCTGG GGTTTCTGGA ACCGGTGCTC
CTGCCCATCC TGATTGCCGC CGTCGTGGCG TATTTGCTGG AACCCATCGT TTCCTGGCTG
GTGCGTCTCA AATTCTCCCG TCCATGGGCC GTAGTCACGG TCATGTTTGC GGCTCTGGCC
GTTCTGGTAG GCTTTGGAGC CACGATTCTC CCTCCCCTGA TCAGGCAGAC GGATGAACTG
ATCGACAACC GGATGGAGCT ATGGGACAAA ACCTCCGAAC TGATTGACTC CACCATTGAA
ATCCCCTTCA TTTCCCGCAC CATTGACAGT GTTTACAGCA CCAGCCTGCG GGAACTGAAT
GCCAGCCATT ATACGGAAGC GGAAGTCCAT GACCTGAGAA ACGCCCGGAC CGCCCGGGAA
AAGCTGGGAG CCTACATGAC CATCAATTCC TCCTTTTACC AGGACAAGCT GATGAGCTGG
CTCACTTCCG GGGGACGGGC CCTGTACAGC ACCATAGGAA TCATGGTCAG CATCCTGATC
ACGCCCATTT TCGCTTTTTA CTTTCTGCTG GAGGCGGATA AAATCAAAGA GAAATGGCCC
AGCATCCTGC CCCTGAAAGT CTCCAAATTC AGAAAAGACG TGGTGGACAC CATGGAGGAA
ATCAACGGCT ACCTGATTTC CTTCTTCCGC GGCCAAATGC TGGTAAGCAT CATTGAAGGC
ATTCTGATTG CCATCTGCCT GAAACTGATA GGCCTGCCGT ACGCCATCAC CATCGGCGCC
GCGGTCTGCG TGCTGGGCAT CGTTCCCTAC CTGGGCATCA TCACCGCCTT TATCCCTGCG
GTGCTGCTGG CCTGGTTCAC ATGGGGGGAT TTCCAGCACG TGCTGATTGT TTCGGGCATC
TTCCTGGCCG TCAACCAATT TGACGGATGG ATCATCCAGC CGAAAATCGT GGGGGATTCC
GTGGAACTCC ACCCGCTCAC GGTCATGTTT TCCGTGTTGA TCTGGACACT CATCCTGGGC
GGCTTGATCG GCGCCCTGCT GGCTGTCCCC CTGACAGCCG CCATCAAGGT CCTTTACAAG
CGGTACATCT GGCAAAATGC CAGCATGCGC CCCATGACGG ACCCCGTGCT TCCGCCGGAA
CATCCCGGCG AACAGCCTCC GGACCCCCCC GAAGGTTCGG CCCACGCTTA A
 
Protein sequence
MNNEQHQNTH TSAAPAIPSP FQKKTCWHAL TGVSILVMLG IAAFVIFEVV ELLGFLEPVL 
LPILIAAVVA YLLEPIVSWL VRLKFSRPWA VVTVMFAALA VLVGFGATIL PPLIRQTDEL
IDNRMELWDK TSELIDSTIE IPFISRTIDS VYSTSLRELN ASHYTEAEVH DLRNARTARE
KLGAYMTINS SFYQDKLMSW LTSGGRALYS TIGIMVSILI TPIFAFYFLL EADKIKEKWP
SILPLKVSKF RKDVVDTMEE INGYLISFFR GQMLVSIIEG ILIAICLKLI GLPYAITIGA
AVCVLGIVPY LGIITAFIPA VLLAWFTWGD FQHVLIVSGI FLAVNQFDGW IIQPKIVGDS
VELHPLTVMF SVLIWTLILG GLIGALLAVP LTAAIKVLYK RYIWQNASMR PMTDPVLPPE
HPGEQPPDPP EGSAHA