Gene Amuc_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1645 
Symbol 
ID6274641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1987924 
End bp1988982 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content57% 
IMG OID642613705 
Productdihydrouridine synthase DuS 
Protein accessionYP_001878246 
Protein GI187736134 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.173532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGAC CCATGCTCCT GGCCCTGGCC CCCATGAAAG ATGTGACGGA TCTGGCTTTC 
CTCAACACAT TGAAGGACTT GAATTCCCTG CCGGATTATT TCATCACGGA GTATTTCCGG
ACGGTGGCCC ATCATAAAAA GATGTCGCCA TACATTCTGC GCTCCATTGA TGAAAATCCT
ACAGGCCGCC CCATTTACGG ACAGCTCGTG GGCCATGAAC CGGAATACCT GGCAAGGGAT
GCGCAGGTCC TGATGGAACA CGCCTGTGCA GGCGTGGATC TGAACATGGG CTGCCCCGCT
CCCATCGTAT GCCGGAGAAA TGCGGGCGGA GGCATGCTGC GGTCCCTGAA GGCCATGGAC
GCGGCCCTGG GAGCGCTTCG GGACGTATTG CCCCCCGGAG CCTTCACCGT CAAATGCCGC
CTGGGATACG AAACGCCGGA CGAGTTTGAA CGGATTCTTC CGGTAATTGC CTCCCATTCC
CCGGACAGGG TGTGCATCCA CGCCCGCACC GTCCGTGAGG GCTACCGCTC CCCAGTACAC
CCGGAATGGG TGAAATGGGC CGCAGGAATG CTGAAATGCC CAGTAGTCGC CAACGGAAAT
ATTGTGGATG CAGCCACAGC GGAGGCATGG GTGCGGCTGG CCCGGCCAGC TGGGCTGATG
ATTGGGCGGG CAGCCCTGCG CAATCCCTGG ATATTTTCAC AGCTTCATTC CCGCTTTCAG
GGCCATCCTG CAGCAGACCT TACCTTCCGG AACGTGCTGC ACTACATTCG CCGCCTTTAT
GAACGCACGC GGGAAATGCA GGAACATTAT GTGGAGGAAA AACACATCCA CCGCATGAAA
AAATATCTGG TTTATACCGC GCGGGGACTT CCCGACACTT TTGACCATTA CATGAAAAGG
GCGAAAACCG CCCGCGATTT CATGCGCATT TGCGAGGATA TTCTGGATAA CGACGCACCC
TTTGCCCCCA CTCCGCCGGA AGACACGCAT CTCTTTGCCC ATTTCCATAC CCTTCTTGCG
CAAGAGGAAG CTTGTCTCCC TCCCGGAATT CAGGTATGA
 
Protein sequence
MHRPMLLALA PMKDVTDLAF LNTLKDLNSL PDYFITEYFR TVAHHKKMSP YILRSIDENP 
TGRPIYGQLV GHEPEYLARD AQVLMEHACA GVDLNMGCPA PIVCRRNAGG GMLRSLKAMD
AALGALRDVL PPGAFTVKCR LGYETPDEFE RILPVIASHS PDRVCIHART VREGYRSPVH
PEWVKWAAGM LKCPVVANGN IVDAATAEAW VRLARPAGLM IGRAALRNPW IFSQLHSRFQ
GHPAADLTFR NVLHYIRRLY ERTREMQEHY VEEKHIHRMK KYLVYTARGL PDTFDHYMKR
AKTARDFMRI CEDILDNDAP FAPTPPEDTH LFAHFHTLLA QEEACLPPGI QV