Gene Amuc_0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0329 
Symbol 
ID6275023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp387616 
End bp388983 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content43% 
IMG OID642612381 
Productputative transcriptional regulator 
Protein accessionYP_001876950 
Protein GI187734838 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.024983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.000000313629 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAACGG CATCTCAAAT ACTTGACCTT GTAAAAGGAG GCGAAGGCTA CAATGTGGAC 
TTCAAACGCA GTGTGCCTTC AAAAGTAAGA GAACTATTCG AGGAAGTGGC TGGTTTTGCT
AATGCCGCCG GAGGTTATGT TCTCATTGGT GTTAGTGATG ACAACAAGAT TATAGGCGCC
GAGATAGACA ATAATAAGCG TTCAGCAATC CAAGACACCA TCGGTGAAAT TTCTCCTGCC
CTGCATTGCG TCTTTTATCC GATGGATGTT GATGGTAAGA AGATATGGGT TGTTGATGTG
CCAAGCGGTA AGGACAAACC TTACATCGCC GGAGGCGTGA TATATGTAAG AGAGGGTGCC
AACTGCCAAA AGTTGAGAAC GGCTGAAGAA ATACGGGCTT TCTTTGCCGA ATGTTCAAAG
ATATTTTATG ATGCCATTCC ATGCAAATGG TTTGGCATTG ACGAGGACAT AGACAGCCAT
AATTTCCGTT CCTTTCTTGA AAAATCTCAC CTATCTGAAG ATTTACCGAT TCGACAATTA
TTCGACAACC TGGAACTAAC AACCGATGAT GGACGAGTGA AGAATGCCGC TGCTCTTTTC
TTCGGCAAAG AGCCGGAGAG GAAGTTTGCT CATGCTGTCG TAAGATGTTT GAGATTCAAA
GGTTTTGACA AGGTCCACAT CATAGATGAC AAGACTTTTG GAGGACCACT ATATCAGCAG
TATCTCAATA CTCTCTCGTG GATTGAGAGC AAACTTGAAG TAGAGTACAT CATCGAAGGT
ACGGGGCCAC GAAAAGAAAT ATGGGAGATA CCGCTGGATG TTTTCAAGGA GTCGGTTATG
AACGCCATCT GTCATCGCGA CCTGTATGAA GAGGGTGCGA CAGTGATGGT TGAAGTATAT
GATGACCGGG TTGAGATTTC CAATCCCGGA GGTCTTCTTC CCATTGTTGC AGAAAACTTC
GGAGAAAAGA GTATGAGCCG GAATCCACTT ATTTTCGGAC TATTCACAAG AATGCAGCTT
GTTGAGAAAG TAGGCTCCGG CATTCCCCGT ATGCGCCGCC TTATGAAAGA AGCTGGTCTG
CCTGAACCCG AATTTGACAA CAAGGGCTTC TTTACCGTTA CTTTTATGAA GCGAACCAAG
TCCCTAAGAA CCACCGATGA TAAACTAAAT GATAGATTAA ATGATAGGAT AAATTCACGA
GAGAAACAAG TTCTTCTGAT TTTATCTAAA ACTCCTGGTC TAAGGACCAA TGAGTTGTCT
ACAATGATAG AAGTCTGTGT CCCTACGTTG TCTCGCACAT TGAAGAATCT AATCAGTCTA
GGCTTGATAG AATATCGTGG AGCAAAAAAA ACGGAGGATA TTATATAA
 
Protein sequence
MLTASQILDL VKGGEGYNVD FKRSVPSKVR ELFEEVAGFA NAAGGYVLIG VSDDNKIIGA 
EIDNNKRSAI QDTIGEISPA LHCVFYPMDV DGKKIWVVDV PSGKDKPYIA GGVIYVREGA
NCQKLRTAEE IRAFFAECSK IFYDAIPCKW FGIDEDIDSH NFRSFLEKSH LSEDLPIRQL
FDNLELTTDD GRVKNAAALF FGKEPERKFA HAVVRCLRFK GFDKVHIIDD KTFGGPLYQQ
YLNTLSWIES KLEVEYIIEG TGPRKEIWEI PLDVFKESVM NAICHRDLYE EGATVMVEVY
DDRVEISNPG GLLPIVAENF GEKSMSRNPL IFGLFTRMQL VEKVGSGIPR MRRLMKEAGL
PEPEFDNKGF FTVTFMKRTK SLRTTDDKLN DRLNDRINSR EKQVLLILSK TPGLRTNELS
TMIEVCVPTL SRTLKNLISL GLIEYRGAKK TEDII