Gene Amuc_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1747 
Symbol 
ID6274647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2126764 
End bp2128119 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content53% 
IMG OID642613810 
Productcytidyltransferase-related domain protein 
Protein accessionYP_001878346 
Protein GI187736234 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0615] Cytidylyltransferase
[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID[TIGR00125] cytidyltransferase-related domain
[TIGR01518] glycerol-3-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGG TTATTACTTA CGGAACCTTT GATTTGCTGC ACACGGGGCA CGTGAATCTG 
CTGAAGAGAG CCCGGAAACT GGGGGACCGG CTCATCGTCG GCGTGACTAC GGACAGTTAC
GACCAGAGCC GCGGCAAGCT GAATGTCATG GAGAGCCTGG AGGAACGCAT GGAAAATGTG
CGGAAAACCG GTCTGGCGGA TCTCATTATT AAGGAAGAAC TAGAAGGGCA GAAGATTCAT
GACATACGGA AGTACGGGGC GGATGTTTTT GTGATTGGCT CCGACTGGTC GGGGAAATTT
GACTATCTTC GCGATTATTG TGAGGTGGTT TACCTGGAAC GCACCAAGGG CGTTTCTTCA
ACAGACCTCC GTTCCGCCCG GAATCCCATT GTATATATGG GAATTGCAGG TCACGGGCGC
ATAGCGGGCC GTTTTCTCCG GGAGTCCAAA TATGTCAGCA ATATTGAAAT AACAGCCGTT
TTCGGAAGGA ATGAGGAGAA AGTCCGCCGT TTTGCAGAGT TCCACGCTCT GCTGGAATAT
TATACGGAAT ACGAACAGTT TCTGGACCGG GTTCATGCCG TTTATATTGC CGTCCCCCAT
CATCTCCATT ATGAAATGGC CAGGAGGGCC CTGTTGCGGG GGAAGCATGT ATTGTGCGAG
AAGCCTCTTG CCCTTGCCCG GGAGGAGGCG GAAGAGCTGT TCCGGCTGGC CGAAGAGAAA
GGAGTCGTTT TGCTGGAAGC CCTTAAAACT GCGTTTTGTC CGGCTTTCCA GCAACTGACC
AGTTTGGCGG GGAGCGGCAT TATTGGTTCC ATCAAGGCGG TGGACGCCAC GTTTACCAAG
CTGATAGAGG ATGAGGCTGC CAGGGAGTAT GACCCCATGC AGGCCGGAGG CGCCTGGACG
GAACTGGGTT CCTATCCCGC CTTTGTCATT GGGAAGCTTC TGGGAACCGA GCCCCGCAGG
ATTCGTTTTG TTACTTGCAG AAAGCCTCAT ACGGGCGTGG ACGTGTTCAC GCGCGCGGAA
TTTCTTTATT CCAATGCAGT AGCCACCGCC ACGGCAGCCA TAGGAGCCAA GCAGGAAGGG
GACTTGTGCA TTACCGGAAC GGAGGGGTAT ATTTATGTGC CGGCGCCTTG GTGGAAGACG
GAGATGTTTG AAGTGCGGTT TGAGGATGCC CGGCTCAACA GGAAATATTT TGCCAATTTT
GAAGGGGATG GGCTGCGTTA TGAACTTGGC GCGTTTTTGC GCCTGATTCA TGGCTGCCAG
CACGGCAACC GTCTTTTGAC CCGTGAGGAT TCCGTGTTCA TGGCTGATGT TGCCTCCCGT
TTCAGGAGAG GGTATTGCGT GGAAGAGATC AGTTAG
 
Protein sequence
MKTVITYGTF DLLHTGHVNL LKRARKLGDR LIVGVTTDSY DQSRGKLNVM ESLEERMENV 
RKTGLADLII KEELEGQKIH DIRKYGADVF VIGSDWSGKF DYLRDYCEVV YLERTKGVSS
TDLRSARNPI VYMGIAGHGR IAGRFLRESK YVSNIEITAV FGRNEEKVRR FAEFHALLEY
YTEYEQFLDR VHAVYIAVPH HLHYEMARRA LLRGKHVLCE KPLALAREEA EELFRLAEEK
GVVLLEALKT AFCPAFQQLT SLAGSGIIGS IKAVDATFTK LIEDEAAREY DPMQAGGAWT
ELGSYPAFVI GKLLGTEPRR IRFVTCRKPH TGVDVFTRAE FLYSNAVATA TAAIGAKQEG
DLCITGTEGY IYVPAPWWKT EMFEVRFEDA RLNRKYFANF EGDGLRYELG AFLRLIHGCQ
HGNRLLTRED SVFMADVASR FRRGYCVEEI S