Gene Amuc_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1714 
Symbol 
ID6275758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2082977 
End bp2084275 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content61% 
IMG OID642613777 
ProductHistidinol dehydrogenase 
Protein accessionYP_001878313 
Protein GI187736201 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000013465 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000000000040894 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATTT ACCGTCCTTC CGATACCTGC TTTGACGAGA TGAAGCGCCG GATGAACCGC 
CGCGCACTCC CGGAGGATTC CGTAAGGGAT ACCGTGAACG CTATTATCCG GGATGTTTCC
GTGCGCGGAG ATGAAGCCCT GTTTGATTAT GCCGCCAGAT TTGACAAGGC GCATCTGGAT
TCTTCCTCCC TGTTCGTAAC GGAGGCAGAA CTGGCGGAAG CGGAGGCCAT GGTGGAGGAG
TCTGTGAAGG AGGCCATTGC CGTTTCTCTG GCAAACATCC ATTATTTTTC AGACCGCAGC
CGCAGGCGGG ACTGGTCCGG CGTGAATGCG CAGGGCGTGG AGGTAGCGGA GCGGTTCCTT
CCGTACGACC GTGTGGGCAT TTATATCCCC GGAGGAAAGG CTCCCCTGGT ATCTACTTCC
ATCATGACGG GCGGTTTTGC TCAGGCCGCC GGCGTGCGGG AGATTGTGGC CGCCACGCCC
TGCGGGCCGG ACGGCCGGGT GAATCCCGCG CTTTTGTATG CGCTGAAAGC CTCCGGCGCA
ACGGAGATTG TCAAGATAGG GGGAGCCCAG GCGATAGCCG CCCTTGCGCT GGGGACGGAG
AGCGTGAGAC CGGTGGAGAA GATTTTTGGC CCCGGCAACC GTTTTGTGGT GGAGGCCAAG
CGTCAGTTGG TGGGGGCCGT TGCCATTGAT TTGCTGCCCG GCCCCAGTGA AGTAATGGTG
CTGGCAGATG ATACTGCGGA TGCGGAGTTC CTTGCTGCCG ACCTGCTGGC GCAGGGCGAG
CATGGCCCGG ACAGCGTAGT TGTTTTTGTC ACCACATCGA AAGCGTTATT GGAGCAGGTG
GAGGCGGAAG TGGAACGCCA GGCCGCCCTG CTGAGCCGCG GGTCCATCAT CCGGGAGGTG
CTGGACAAGC ATGCCTACGG TTTTCTGGTT TCTTCCATTC AGGAAGGGGT GGAACTGGTT
AATGCTTTTG CGCCGGAACA TTTGGTGCTC GTCACGAGGG ATGAAGAAGC CGTGCTGAAT
GGCATCAGGA CGGCGGGAGC CATTTACGCA GGCTCCCTTT CTACTGTAGC CTGCGGGGAT
TTTCTGGCGG GTCCCAGCCA TACGCTGCCT ACCGGCGGCG CCGGCAAGTC TTTTTCCGGC
TTGCGGGCGG ATCAGTTCCA GCGCCGCACC AGCGTGGTGC GCATGGACCG GAATGCCGTG
CTGAACTCCG CCCCGTATGT GGCAGAGTTC GCCCGGGTGG AGGGGCTGGA CGCCCACAAC
CACTCCATTC AGGTTCGCGC CGCCCGTGTG GACCGGTAA
 
Protein sequence
MKIYRPSDTC FDEMKRRMNR RALPEDSVRD TVNAIIRDVS VRGDEALFDY AARFDKAHLD 
SSSLFVTEAE LAEAEAMVEE SVKEAIAVSL ANIHYFSDRS RRRDWSGVNA QGVEVAERFL
PYDRVGIYIP GGKAPLVSTS IMTGGFAQAA GVREIVAATP CGPDGRVNPA LLYALKASGA
TEIVKIGGAQ AIAALALGTE SVRPVEKIFG PGNRFVVEAK RQLVGAVAID LLPGPSEVMV
LADDTADAEF LAADLLAQGE HGPDSVVVFV TTSKALLEQV EAEVERQAAL LSRGSIIREV
LDKHAYGFLV SSIQEGVELV NAFAPEHLVL VTRDEEAVLN GIRTAGAIYA GSLSTVACGD
FLAGPSHTLP TGGAGKSFSG LRADQFQRRT SVVRMDRNAV LNSAPYVAEF ARVEGLDAHN
HSIQVRAARV DR