Gene Amuc_0088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0088 
Symbol 
ID6275001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp114937 
End bp116889 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content59% 
IMG OID642612133 
Producthypothetical protein 
Protein accessionYP_001876714 
Protein GI187734602 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACT CTCCCCCCAG CACCTCCCCC GCTCCGCGCA TTGCTGAAAC TGCCGCCGGG 
AAAATTCTGG CGGGCCTCCT GGCCCTTTTT TACATCTTCC TGGCAACGAC GGCGGCAGGC
GGGGAATGGC ACACCCTTTT CCTTCCCGCC TGCTTTCTGG CGGCAGCCCT CCTGCTGTTC
TGCTTCTGTA TTCTCCGGGG ATACAAGATT CCCAGCCCCG GACTGCCGGG CTGGCTGGCC
CTGGGCCTGG GAGGCGGCTA TTTCCTTGTC CGGGCATGGT TTTCCCCCTG GTTTTATTAT
GAGAGCGTAG CGGATCTGGG CCTTATCGCC ACGGCCATCG TCATGTTCGC GGCGGGAACG
TATGCCGGAG CCGGAAACGG GGAAAAGAGC ATTCTTCCCG TACTGGCTGC GGCATTGGGG
CTGCTGAATG CCCTGCTGTG GGGGTATCAG AACATAACGG GAACGGAAGC CTCCTGGTTC
AGACCGGATT ATTCCCTGTT CGGCACGGAA ATCCGCAATA TCGGATTGTT CGGATACAAA
AATTTTTCCG CCCATTTCCT GTCCGTCACC GGTTTTTTCC TTTGCGCATA CAGCATGGCT
TCCGCCAGAA AATGGGGCAT CCGCCTGTTC ACGGGATTGG CGCTCATCCT CGTTTCCTTC
ACGTGCGGCT CGCGGTCCGC CTTCCCAAAC GCCCTGGCTG GCGTAACCCT GTGCTTCTTT
ATTTATACCT CCAGCGTTTT CCGCAATAAC AGAAAATTTT ACACGGCGTC CATCCTCTTT
ATTATCCTGC TTTTCCTGGG CACGTCCTAT GCCGTGCTGG ATTTGTCCCG GGGGGCGGGC
AGGCTTGCCG CCCTTCTGGA TACGTTTTCC TTCGGCAACC GGCTGGACCT GTCCAAACTG
GCCTGGGCGC TGGCGGACCA GGCTCCGCTG TTCGGTCACG GCAGCCGGAT GTACACCAAT
CTTTCCACGG AGTTTTTCTC CGGAGCCAAC CTCCCCAATT TTGCCCACCA CGAATACGCA
CAAGCCGCCT GCGACTACGG CTATGCCGGG CTGGGGCTGA TGCTAGCTCT GCTGGCCCTG
TTTCTTATTT TCGGACTCCG GAGCGTTCTG AAATTGTCAG GAGAACATCA ACGGCCCAAT
CCTCTGGGAC CGGCGGCGTT CTGCGTATTG TGCATCGCCG CTTTCCACGC CTATGGGGAA
TTCATCTGGC ATAACCCTGC CTTGCTTGGA GCCAGCGCCC TATGCGGCGG CATTACCTGC
ACGGCCCCTC TTTCCAGGGT GAAGGCTTCG CGCCAAGCCG GCCGCTGGCT TCAAGCGACC
GCCGCACTGC TGATGGCCAT ATTGGCGCTG TGTTATGCGT TTCTGGCTTT TCCGGTCTGG
AAAAACTCCC TTCAAGCCGT CCCGGCTTCC TCCGGCAACC GGCTGCCCAT GCTGGAAGCG
GCCGCCTCCT GCAGCCTGGA CCCGGATTTG GTGCGGCGCA ACATTCTGCA TGCCGCAGGC
AGTTCTCCCC CCCCAAACCC GGCCCGGCTC AAAGCATTGG AGCATCAGGA AGAAAAAGCG
GAATTACTGA GCCCCGGAAA CCACGGCCTG ACGGCGGCTA AAAGCCTCCT TTACATCCTG
CAGGGACGCC TCACGGAAGC GGAGCAACTG CTCCGCCCAT ACGTAGAGAG TCCCGGCAGG
TTTGATGACC GAATGTTCGC CTGGACTACC ATTTACAATA ATATGCTGTA TTCCTGGAGT
ACGGCCATTG CTGCCCAGTC CCCCGGACGC GCCCTGTCCA TGGCCATGAC GGCCCAGCGC
CTGATGTCCG CCCAAACGGA CCGGTGGCTG TATTATGGGG CCCTGGATCC CGAAGTCAGG
AAAAAACACT ACTCGCGCCT CAATGAGCTC AAGATGCTCA TCATGATGCT TCAGGCCCGC
GGAACGACGC CCGACCCTTC CTGGAGGAAA TAG
 
Protein sequence
MPDSPPSTSP APRIAETAAG KILAGLLALF YIFLATTAAG GEWHTLFLPA CFLAAALLLF 
CFCILRGYKI PSPGLPGWLA LGLGGGYFLV RAWFSPWFYY ESVADLGLIA TAIVMFAAGT
YAGAGNGEKS ILPVLAAALG LLNALLWGYQ NITGTEASWF RPDYSLFGTE IRNIGLFGYK
NFSAHFLSVT GFFLCAYSMA SARKWGIRLF TGLALILVSF TCGSRSAFPN ALAGVTLCFF
IYTSSVFRNN RKFYTASILF IILLFLGTSY AVLDLSRGAG RLAALLDTFS FGNRLDLSKL
AWALADQAPL FGHGSRMYTN LSTEFFSGAN LPNFAHHEYA QAACDYGYAG LGLMLALLAL
FLIFGLRSVL KLSGEHQRPN PLGPAAFCVL CIAAFHAYGE FIWHNPALLG ASALCGGITC
TAPLSRVKAS RQAGRWLQAT AALLMAILAL CYAFLAFPVW KNSLQAVPAS SGNRLPMLEA
AASCSLDPDL VRRNILHAAG SSPPPNPARL KALEHQEEKA ELLSPGNHGL TAAKSLLYIL
QGRLTEAEQL LRPYVESPGR FDDRMFAWTT IYNNMLYSWS TAIAAQSPGR ALSMAMTAQR
LMSAQTDRWL YYGALDPEVR KKHYSRLNEL KMLIMMLQAR GTTPDPSWRK