Gene Amuc_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0014 
Symbol 
ID6275229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp17295 
End bp18533 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content41% 
IMG OID642612054 
Producthypothetical protein 
Protein accessionYP_001876642 
Protein GI187734530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTTA AAACAAAGGT AAAGAATGGA ATTGGCATTT TTCTTGTCGT GTTGTCCGTA 
TGTTGCGGCA GCCCTCTTTT TGCGGGGCTT CAGCCGGAAG AGGGGATGGA TTTGAGCAAA
CCCCGGATTT GGACGGCGGT GACAGGCCAT CAATTAAAGG CAGCGTATGA GGGAAGAGAT
GGCAACAAAA TCCGGTTGCT TGCAAATGGT GGAAAAATAA AAACCATTTT ACTGGAAAAA
CTTTCAGAAG CTGATCAGCG GTTATTGGAG CATCTGACGC GGGAGGAAAA CCGGGGCGGG
GAAGCTGGCT TTCCGGATTC CGGCAGGAAT GGCAAGGAGC ATCCATTTGT CGGATTTCTT
GCTGAAGCGA AACAAAAACG CCTTGATGGA GAAGAAGGTG TGGAATATTA CCAGAATCTG
ATTGATGCTC TTTATGAAGA AATGAAAAAA CATGAATTCG GGTATCCGAA AGATTTTATT
AAGGAAGATG GTTTGTTCAA TATGAAATAT TTAAAAAGCG TTCACGGTAG GAAAAAACGA
ATTTCTGAGC GCGTAAATCT TTTTACGGGA AAACATTTAT TTACTCCTTC CTATGGTAAT
GAACCTCGTT GGGAATGGGT GGATCAGAAA ATAAATTTTG TCATAGAAGG AGATCTTATC
CGACTTAAAC AGCAAGATAC TCTGTATGCA AAAAATAAGG AAAAGATGGT GGAATTCGGT
ATTCTGATTA ATAAGAAAAA GCGCAATGCT GAAAACTATA AAGCTTGGTA TACGGAAGAC
AGCAACGATT ATTTGTACCG GTTGGAAGCT TGCGTGAATT CATCTGATGA CGGCAGGTTC
GTCATAGACA GAATGAATAG AGAGGAATGG CTGCATTTAT GTGCTCACGA TACGAAGAAA
GGAATTTTCT ATACTTTAAA AATTTTGGTT GATGAAAATG GAAAAATAAC AATCAGAAAT
TCCGAAGAAG GCTCATTCCT GTTTTTTGCC ACAGTTCCCA AAAACAGGGG GTCTAAAGAT
GTTTTTGCTT ATCCGAGAAA TAATGACAGG CTTGTTTTTA ATTCAGTATA CAGCATAGCT
GTTGCTTTGG GGAGATACGA TAGGAACGGC TTTCTTATGG ATTTGGAAAC GCTGTGGGAT
GGCGTGGAAA AGAGGGGGCA TTATATTAAT GCCAAACCTT CGGGCAAACA TGATGATCCT
TTAAATGTTT TTGCGGAACG CAATGTCAAG GACAGGTAA
 
Protein sequence
MSLKTKVKNG IGIFLVVLSV CCGSPLFAGL QPEEGMDLSK PRIWTAVTGH QLKAAYEGRD 
GNKIRLLANG GKIKTILLEK LSEADQRLLE HLTREENRGG EAGFPDSGRN GKEHPFVGFL
AEAKQKRLDG EEGVEYYQNL IDALYEEMKK HEFGYPKDFI KEDGLFNMKY LKSVHGRKKR
ISERVNLFTG KHLFTPSYGN EPRWEWVDQK INFVIEGDLI RLKQQDTLYA KNKEKMVEFG
ILINKKKRNA ENYKAWYTED SNDYLYRLEA CVNSSDDGRF VIDRMNREEW LHLCAHDTKK
GIFYTLKILV DENGKITIRN SEEGSFLFFA TVPKNRGSKD VFAYPRNNDR LVFNSVYSIA
VALGRYDRNG FLMDLETLWD GVEKRGHYIN AKPSGKHDDP LNVFAERNVK DR