Gene Amuc_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0744 
Symbol 
ID6275534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp879379 
End bp880446 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content57% 
IMG OID642612795 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001877361 
Protein GI187735249 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.819882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGT TCAAAACAGA CGACATACGC ATTCAAGACA TTGAGCCCCT GATTTCCCCC 
GCCATTTTAA TCAAGGATTA TCCGGCAACC ACCGAAATCG CCAAAATGGT GGCTACGACC
CGCAAGAATG CGGAGAACAT CATTTCCGGC CACGACGACC GCCTGCTGGT GGTGGTGGGG
CCATGCTCCA TCCATGATCC CCAGGCTGCC GTGGACTATG CCTCCCGCCT GAAGGAACAA
ATGGCGCGCT TTGAAAAGGA TCTGGTGATC ATCATGCGCG TGTATTTTGA AAAGCCCCGC
ACCACCGTTG GCTGGAAGGG CCTCATCAAC GACCCGTTCA TGAACCATAC CTTTGACATC
AACCGCGGCC TCCATATGGC CCGCGGGCTG CTGCTGCGCC TGGGGGATAT GGGAGTGCCC
GCAGCTACCG AGTTCCTGGA CACCATCACG CCGCAGTACA TTGCGGACCT GATCACGTGG
GGCGCCATCG GCGCGCGCAC CACGGAAAGC CAGGTACACC GTGAACTGGC TTCCGGGCTT
TCCATGCCTG TGGGATTCAA GAATGGCACC AGCGGCAGCC TGCAAATCGC TGTAGACGCC
ATTGTTTCCT CCTCCTGTCC GCACTGCTTC CTTTCCGTGA CCAAGCAGGG AGTTTCCGCC
ATTGTTTCCA CTACGGGCAA TAAATCCTGC CACCTGATCC TGCGCGGTTC CTCCCTGGGG
CCGAACTTTG ATGAAGATCA TGTAAAAGAA GCGGAAGAAG CCTTGCAGAA GGCCGGCATC
AACAACCGCA TCATGATAGA CTGTTCCCAC GGAAACAGTT GCAAGGATTA TCGCAAACAG
CCGGCTGTGG CCGCCAATAT CGCGGAACAG ATATCCAGCG GGTCCGAACA GGTTGTTGCC
GTGATGATTG AAAGCAACAT TGTGGAAGGG GCCCAGCCGT TGAGTTCCGA CCTGGTGTAC
GGCAAGAGCA TCACGGACCA GTGCATTGGG TGGGAGACGA CAGTGGAAGT GCTGGAAACC
CTTGCCGCCG CTGTCCGCAA ACGCCGTGCC AAACGGCAGG AAGCGTAA
 
Protein sequence
MNWFKTDDIR IQDIEPLISP AILIKDYPAT TEIAKMVATT RKNAENIISG HDDRLLVVVG 
PCSIHDPQAA VDYASRLKEQ MARFEKDLVI IMRVYFEKPR TTVGWKGLIN DPFMNHTFDI
NRGLHMARGL LLRLGDMGVP AATEFLDTIT PQYIADLITW GAIGARTTES QVHRELASGL
SMPVGFKNGT SGSLQIAVDA IVSSSCPHCF LSVTKQGVSA IVSTTGNKSC HLILRGSSLG
PNFDEDHVKE AEEALQKAGI NNRIMIDCSH GNSCKDYRKQ PAVAANIAEQ ISSGSEQVVA
VMIESNIVEG AQPLSSDLVY GKSITDQCIG WETTVEVLET LAAAVRKRRA KRQEA