Gene Amuc_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1072 
Symbol 
ID6274036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1279869 
End bp1281134 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content52% 
IMG OID642613123 
Producthypothetical protein 
Protein accessionYP_001877679 
Protein GI187735567 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.00830956 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGAAAT ATGCTGTGAT CGGAGCCGGG GTTTCCGGAT TGTCCATGGC CGGAATGTTG 
CTGAAGAAAG GGCATGAGGT AGTCGTTTAT GAGAGGGACT CCCGGCCCGG CGGCCTGATT
AAATGCACGG AAGTACAGGG GAACCTGTAT CATCGTGTGG GAGGCCATGT GTTCAATTCC
CGGCGGCAGG AAGTGCTGGA CTGGTTTTGG TCCCGGTTTG ACAGGGAGCG CGATTTTGTT
TCCGCCCGGA GACGGGCTGT TATTTCCCTG GAGGGCGGGG CTGTGGTGGA TTATCCCATT
GAGAACCACC TCGACCAGTT TCCCGAGGCC GTCCGTTCCT CCATCGTCCA TGAGCTGCTG
GAACTTTACA GGAATCCTCC CGCGGAGCCC CGCTCCCTGG GTGAGTTTTT TCTGAACCGT
TTCGGAAAAA CCCTGAACAG CCTTTATTTT ACGCCGTACA ATAACAAAGT GTGGAGGCAG
GATATCAGCC AAATTGCCAT GGATTGGCTG GAAGACAAGC TTCCCATGCC GAGCGTGGCG
GAAATCCTGT TGAACAATAT TGGGCACATC AATGAAAGCG CCATGGTGCA CAGTTCTTTC
TTTTATGCGA AAAACGGCGG TTCCCAGTTT CTGGCGGATA CGCTGGCTCG CGGCCTCAAG
GTCAGGTATC GGCAGGAAGC TGTAAATATC CTCCCGAAGG ACGGGAAATG GCTCGTACAG
GGAGAATTGT TTGACAGAGT TGTCTTTACG GGGAATGTCA GGCAGCTTGG GGATTGTTTT
CCCTGCATGG ATGAATTGCG GCCGTTTTTC CCCCGGATTT CAGAATTGCG CTCTCACGGA
ACCACTTCCG TGCTTTGCCG GATTTCCCCC AATGATTACA GCTGGATTTA CATGCCCTCT
CCATCCCACC GCTCTCACCG GATCATTTGC ACGGGGAATT TTTCCAGAAA TAATAATAAC
GGGGACATCA CCACCGCGAC TATTGAATTT TCCGAGCAAA TGATGGAAGG GGAAATCAGG
CGCCAGCTGG AGCTTATTCC CTTTTCCCCT GTGTATCTGG CCCATCACTG GGAAGAATAT
ACCTATCCCG TTCAGGATGT TTCCAGCCGA ACCCTCATAC GGGAACTAAA GGAGTGCCTG
GAACCGAAAG GCATTTATTT ATTAGGCCGT TTTGCGGAGT GGGAATACTA CAATATGGAT
GCGGCCATGG GGGCGGCCCT TGATTTGGAT AAAAGGCTGG CTGCGGAGCA GATGACACGG
GGATAA
 
Protein sequence
MMKYAVIGAG VSGLSMAGML LKKGHEVVVY ERDSRPGGLI KCTEVQGNLY HRVGGHVFNS 
RRQEVLDWFW SRFDRERDFV SARRRAVISL EGGAVVDYPI ENHLDQFPEA VRSSIVHELL
ELYRNPPAEP RSLGEFFLNR FGKTLNSLYF TPYNNKVWRQ DISQIAMDWL EDKLPMPSVA
EILLNNIGHI NESAMVHSSF FYAKNGGSQF LADTLARGLK VRYRQEAVNI LPKDGKWLVQ
GELFDRVVFT GNVRQLGDCF PCMDELRPFF PRISELRSHG TTSVLCRISP NDYSWIYMPS
PSHRSHRIIC TGNFSRNNNN GDITTATIEF SEQMMEGEIR RQLELIPFSP VYLAHHWEEY
TYPVQDVSSR TLIRELKECL EPKGIYLLGR FAEWEYYNMD AAMGAALDLD KRLAAEQMTR
G