Gene Amuc_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2113 
Symbol 
ID6275495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2574808 
End bp2575917 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content59% 
IMG OID642614175 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001878703 
Protein GI187736591 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.512598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.05877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCTCT ACGTCCACAT TCCTTTCTGC CACCGTATTT GCCCGTACTG CGCTTTCTTC 
AAGCACACGC CGGCCTCCAC GGACATGAAA TCATTCATCC GTGCTCTGGG CAGGGAGGCG
GAATCCCGCG CAGCCGCTCT GGCCACAAAC CGCGGAGGGG AAACGGCTAC GCTTTATTTT
GGCGGGGGTA CTCCCTCCAT GCTCTCGGAC ACGCATCTGG GGCATTTTAT GGAAACCCTG
GATCGTCTCG TGCCCGTGGA CAAACTGGAC GAATTCTCCT TTGAGGCCAA CCCCGCCACC
TTTACGGAAA AAAAAGTGCG CTTTTGGCGC AGCCTGGGCA TGACACGTGT CTCCCTGGGC
GTGCAGTCCC TGGATTCCGG CATCCTGCAT CTGCTGGGGC GCGAACATAC CCCGGCCCAG
GCTCTGCACT CCGTGGAAAT GCTGAAAAAT GCGGGAATGC CCCATATCAA CATGGATCTC
ATGTTCGCCA TTCCGGGGCA AACCCTCTCC ATATGGGAAG CCACCCTGAA GGAAGCTGTC
CGCGCCGGAA CGGACCATAT CTCCGCCTAC AACCTCACCT ATGAAGAAGA CACGGAATTC
TTCCGGAGCC TGCTGAGGGG AGAGAAAAGG CAGGATCCGG ACGAAGACGC CGCCTTTTTT
GAACTGGCGG AACACATGCT GGAGGCGGCA GGCTTGCGCC ACTATGAAAC CTCCAACTAC
GCCCGGGAGG GCTGCATCTC CCCCCACAAC ATGGCCTACT GGAAGGGAGA GGACTATGTG
GGCATAGGGC CTGGCGCGGT CAGCACCATC AACGGCATAC GGTACTCCAA CACGCGGGAT
ACGGACGCCT ACATACGCAG CACGCTGGAA AACGGCCTCC CCCTTTCCGA ACAGGAACCC
GTCACCGAGG AAGACTACCG CTTGGAACGC ATTGCCCTGA TGCTCCGGAC GGATGAAGGA
TTGCCGCTGA AATACATTCT GCCGGAATCC CGTCCTCTGC TGGAACAATA CCGGGAACTG
GGCTTGGCGG ACATTTCCCC GGAACAAAGG TTCATCCTGA AAGGCCGCGG ACGCCTGCTC
GTGGACGCCA TCGCAGCGGA ACTGTGCTGA
 
Protein sequence
MHLYVHIPFC HRICPYCAFF KHTPASTDMK SFIRALGREA ESRAAALATN RGGETATLYF 
GGGTPSMLSD THLGHFMETL DRLVPVDKLD EFSFEANPAT FTEKKVRFWR SLGMTRVSLG
VQSLDSGILH LLGREHTPAQ ALHSVEMLKN AGMPHINMDL MFAIPGQTLS IWEATLKEAV
RAGTDHISAY NLTYEEDTEF FRSLLRGEKR QDPDEDAAFF ELAEHMLEAA GLRHYETSNY
AREGCISPHN MAYWKGEDYV GIGPGAVSTI NGIRYSNTRD TDAYIRSTLE NGLPLSEQEP
VTEEDYRLER IALMLRTDEG LPLKYILPES RPLLEQYREL GLADISPEQR FILKGRGRLL
VDAIAAELC