Gene Amuc_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2095 
Symbol 
ID6275642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2546550 
End bp2547734 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content49% 
IMG OID642614157 
Producthypothetical protein 
Protein accessionYP_001878685 
Protein GI187736573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAAC ACGCACACCT TCCAAAAGCA GGCATTATTA CAAGCCACTA CTATTTCATT 
AAAACAAACT ACGGTTCACT GCTACAAAAT TTTGCCCTCC AGCGTTACTT GGAAAAAATG
GGCCTATTTC CATTCCTGAT CAGACAGGAA GAAATCAGCC AGCCCATTTC CTTCCGGGAG
AAAATAAAGT TTTATCTTCT TCATCCTCTC CAATTGTTCC GCCGGCTTTT CCAAAAACCA
GCACGGGAAG CTGAGGAAAA AGCACAGAGG ATTGCCCGCT TCAACCGGGA GCACCCGCGC
CCCTTTGAAT CTTTCATCAG CAAACACCTT AACACCACCC CCATCACCTA TGACCGCGTT
ACATTGCGCG AGCATCCGCC GGAAGCGGAT GTTTACCTGG CGGGCAGCGA CCAAATATGG
ACCCTTGATG ATTTTGACAA ACTGCTGAAT TTTGCTCCTC CGGGAAAACG AATCGCCTAT
GCGGCCAGCG CCAATTGGGG AAAACAAAGC AAACGATGGT TTATTGAAGC CAGAAAGGAG
CTGCCTTATT TTACAGGAAT CTCCGTCAGG GAAACTGAAG GCAGGGAAAT ATGCCAAAAA
GCCGGTATGG AGCAAGTGGA AGTCGTTCTC GACCCAACCC TGTTGCTGGA TCCTTCAGAA
TACACCTCGC TAGTCACGGC ACAATCCGCC TACCTTCCTC CTGACTCCAT TCTCGGGTAT
TTCCTCAATA CGGACGCCCT TACTGAAATT TACTGGAATC AGATTCTTGA TTCCTTCAAG
GGAAATCCTC TTCGTATCAT TCCCCTGCAG GGAACGGAAC TCTGCATTCC GGAAGACAGC
ATCATCACCC CTGATCCTTA TGAATTCATC CAGGCCTTCA AGGAAGCGAA AAACATCATC
ACCAATTCCT TTCATGGTAC GGTTTTTTCC ATCATCATGC GCAAGCCGTT TCTGAGCATT
CTTCAGGCAG GAGACACGGC CATTCAAAAC ACGCGTTTCT TCTCTCTCCT GAAATCCCTG
GGGCTGGAAG ACAGGATTTA CGCGCCGGAG AGAGGTCTCA TGCGGGAACA GATGGAACAG
AGGATCCAAT GGGAAGCCGT AGAAAACAGG CTGGAACAGC TTCGCGGCCA CTCTGCCGAA
TTTCTGGAAA AGGCCATTCA ACAAAGCATT TGCCGCCATG GCTGA
 
Protein sequence
MNQHAHLPKA GIITSHYYFI KTNYGSLLQN FALQRYLEKM GLFPFLIRQE EISQPISFRE 
KIKFYLLHPL QLFRRLFQKP AREAEEKAQR IARFNREHPR PFESFISKHL NTTPITYDRV
TLREHPPEAD VYLAGSDQIW TLDDFDKLLN FAPPGKRIAY AASANWGKQS KRWFIEARKE
LPYFTGISVR ETEGREICQK AGMEQVEVVL DPTLLLDPSE YTSLVTAQSA YLPPDSILGY
FLNTDALTEI YWNQILDSFK GNPLRIIPLQ GTELCIPEDS IITPDPYEFI QAFKEAKNII
TNSFHGTVFS IIMRKPFLSI LQAGDTAIQN TRFFSLLKSL GLEDRIYAPE RGLMREQMEQ
RIQWEAVENR LEQLRGHSAE FLEKAIQQSI CRHG