Gene Amuc_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0201 
Symbol 
ID6275339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp250515 
End bp252080 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content60% 
IMG OID642612247 
Productcarboxyl transferase 
Protein accessionYP_001876826 
Protein GI187734714 
COG category[I] Lipid transport and metabolism 
COG ID[COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0217258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.713971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTG ATCCCAAATT GATTGACGAT CTGAACAGCC GCCGCAAGAA GGTCATCCTC 
AGCGGCGGTC AGGAAAAGAT CGACAAGAGA CATGAAAAGG GCGAAATGAC AGCCCGCGAC
CGCATGGGGT ACCTCTTTGA GGAAGGCACC TTCTCGGAAA TCGGCATGCA TGTGCGCCAC
AACTGCCACA ACTTCGGCAT GGGTAAGAAG GAAATCCCCG GCGACGGCGT CGTTTCCGGT
TTCGGCCTGG TGGACGGCCG CCCGGTAGCC TGCGCGGCCT CCGATTTCCT GGCCCAGGGC
GGTTCCCTCG GCTACATGCA CGCCATGAAA ATTGCGGATG CTCAGAAGTA TGCGCTGAAA
GCCGGCATCC CGATGGTGAC CGTGAACGAC TCCGGCGGCG CGCGCCTCCA GGAAGGAGTG
GCGGCCCTTT CCGGATACGC CCAGGTATTT TACAACAATG TGCTTGCCTC CGGCGTGGTT
CCGCAAATCT CCATGATCCT GGGCCCCTGC GCGGGCGGCG CGGCCTACTC CCCCGCCCTG
ACGGACTTCA TCATCATGCG CAATTCCGGC AACGCGGGCA TGTACATCAC CGGTCCCAAA
GTGATTGAAC AGGTCACGTA TGAAAAATGC ACGATGGACG ACATCGGCTC CGCCGCCATT
CACGCCACCG TGTCCGGCAA CGTCCATTTC GTAGCGGACA GCGACGCGCA CGCCATGGAC
ATCCTGAAGA GGCTTCTTTC CTTCCTTCCT TCCAACAACA CGGAAGAACC GCCCCACAAA
CTGAACACTC CGCTGGACCT GAGCGCTGAC GAAGGCATGA GCGACCTGAT TCCCGGCGAC
AACCGCACGC CGCTGGACGT CCAGCCCATC ATCAGCCGTC TGGTGGACAA CGGGGACTTC
CTGGAAGTAC ACAAGGACTT CGCCAAAAAT GTCGTCGTCG GATTCGGCCG CATCTGCGGC
GTGGTGGTCG GCATCATCGC CAACCAGCCC AATGTGAAAG CCGGCTGCCT GGATATCGAC
TCCTCCGACA AGGCCGCACG CTTCATCCGT TTCTGCAACG CGTTCAATAT TCCGTTGGTG
AACCTGGTGG ACGTGCCCGG CTTCCTGCCC GGCAAGAACC AGGAACGGGG CGGCATTATC
CGCCACGGCG CCAAACTTAT CTTCGCCTAT TCCCAGGCCA CGGTGCCCAA AGTCACCCTG
ATCATGCGGA AAGCCTACGG CGGCGCCTAC ATCGCCATGT GCTGCAAGGA CCTGGGCGCT
GACGCCGTCT TCGCCTGGCC TGGTGCGGAA ATTGCCGTTA TGGGTGCGGA AGGCGCCGTC
CCGGTACTCT ACGGCCGCGA ACTGAAGGCT GTGGAAGACC CGGCGGAAAA AGCCAAGCGC
CAGGGCGAAC TTCTGGAAGA ATACCGGGAA GCCTTTTACA ACCCGTATGT GGCGGCCGGC
ATGGGGCAGA TTACGGAAGT CATCAATCCG GAAGAAACCC GCGCCAAAAT CGCCTTCGCC
CTGCGCACCC TGCTGAACAA GAAGGAAGTG CGCCCGGCCA AGAAACACGG CAACATTCCG
CTCTAA
 
Protein sequence
MAIDPKLIDD LNSRRKKVIL SGGQEKIDKR HEKGEMTARD RMGYLFEEGT FSEIGMHVRH 
NCHNFGMGKK EIPGDGVVSG FGLVDGRPVA CAASDFLAQG GSLGYMHAMK IADAQKYALK
AGIPMVTVND SGGARLQEGV AALSGYAQVF YNNVLASGVV PQISMILGPC AGGAAYSPAL
TDFIIMRNSG NAGMYITGPK VIEQVTYEKC TMDDIGSAAI HATVSGNVHF VADSDAHAMD
ILKRLLSFLP SNNTEEPPHK LNTPLDLSAD EGMSDLIPGD NRTPLDVQPI ISRLVDNGDF
LEVHKDFAKN VVVGFGRICG VVVGIIANQP NVKAGCLDID SSDKAARFIR FCNAFNIPLV
NLVDVPGFLP GKNQERGGII RHGAKLIFAY SQATVPKVTL IMRKAYGGAY IAMCCKDLGA
DAVFAWPGAE IAVMGAEGAV PVLYGRELKA VEDPAEKAKR QGELLEEYRE AFYNPYVAAG
MGQITEVINP EETRAKIAFA LRTLLNKKEV RPAKKHGNIP L