Gene Amuc_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2027 
Symbol 
ID6275445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2460342 
End bp2461343 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content60% 
IMG OID642614087 
Productdihydroorotase 
Protein accessionYP_001878618 
Protein GI187736506 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.000191556 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTCTGG AACTGCACTC CCCGCTGGAC ATGCATCTTC ACCTGAGGGA CGGCGATATG 
ATGAAGCTGG TCGCTCCGTT AAGTTCCGCC TCTTTTGCCG GGGCGGTCAT CATGCCCAAC
CTGGTGCCTC CGGTGGCTGA TGCGGGTGCG GTGCAGGCTT ACCGGCAGCG GGTGCTGGAC
GCTTGCGGAG ACGATGTATT CCAGCCGTAC ATGACGGCAT TTTTTCGCTC CTATTCAGAA
AAGGAATTGT CCCGGCTCAA GGAACTGGTG TTCGGCATCA AGCTGTACCC GGCAGGAGCC
ACCACGAACA GCGAGGGCGG CGTGAAGGCC ATGAAGGATG CGGAAGCTAC CCTGTCCATC
ATGCAGGAAA TGGATATTCC TTTGCTGGTG CATGGCGAAA GCCACGGCTT CGTGATGGAC
CGGGAGGCCG AATTCCTGGA TGTTTACCGT GATTTGGCTA CGCGCTTCCC CCGGCTGACT
ATCTGCATGG AACATATTAC CACGGCCGCC GCCGTGCAGC TGCTGGACGA ATTTGAAAAC
CTGGCCGCCA CGGTAACCCT CCAGCATCTT CTCATTACTT TGGACGATGT GGCCGGTGGC
ATGCTGAGGC CGCATCTGTT CTGCAAGCCG ATCGCCAAAA GGCCGGAAGA CCGGGAAGCC
CTGTTGCAGG CTGCCCTTTC CGGGCATCCC CGCCTCATGT TCGGCAGTGA CTCCGCCCCC
CATCCCATCC ATGCCAAGGA AGCGTGCGGA TGCGCCGCCG GCGTGTTTAC CGCCCCCATC
GCTCTTCCTC GTCTGGCGGC CCTGTTTGAC GAACACGGGG CCCTGGACCG GTTGCAGGGC
TTTGTTTCCG GTCATGCCTG CGCTTTGTAC GGGTTGAATC CGCCTGCCAG GACGGTCCGT
CTGCAGCGGC GTGAAATGCT GGTGCCGGAC GCTTATGAAG GACATGGACA GAAAGTGGTG
CCGATGGATG CCGGATGCAC CATTCCTTGG AGACTGATAT GA
 
Protein sequence
MILELHSPLD MHLHLRDGDM MKLVAPLSSA SFAGAVIMPN LVPPVADAGA VQAYRQRVLD 
ACGDDVFQPY MTAFFRSYSE KELSRLKELV FGIKLYPAGA TTNSEGGVKA MKDAEATLSI
MQEMDIPLLV HGESHGFVMD REAEFLDVYR DLATRFPRLT ICMEHITTAA AVQLLDEFEN
LAATVTLQHL LITLDDVAGG MLRPHLFCKP IAKRPEDREA LLQAALSGHP RLMFGSDSAP
HPIHAKEACG CAAGVFTAPI ALPRLAALFD EHGALDRLQG FVSGHACALY GLNPPARTVR
LQRREMLVPD AYEGHGQKVV PMDAGCTIPW RLI