Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0555 |
Symbol | |
ID | 6275349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 654455 |
End bp | 655786 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642612605 |
Product | protein of unknown function DUF21 |
Protein accession | YP_001877174 |
Protein GI | 187735062 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.149438 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTGG CAGTTGCCGG CTTTATCCTT TTTTTGTTGT TGAATGCGTT TTTTGCGGCA GGCGAGTTCG CCTTGATGAA GGTTCGTGAA AGCCAGCTGC ACGCCGGGGA AGGTGTTCCG GCCCGAACTC GGAAAAAACT GGCCCGGGCG CGGAAGGCTG CCAAGCATCC TGATCTTTAT TTGGCCGCCT GCCAGGCGGG CATTACGCTT TCTTCCCTGG CGTTGGGATT CCTGGGAACG TTTTTTGTAT CGGAACTGAC AGCTCCCTTT CTGGTTTCCC TGGGATTGGG AGGCATGGTT TCCGTTTACG GAATCGCTCT GGCCGTTACA TTTATTTTCT TTGCCTGCTG CCAGGTGGTA TTTGGGGAGT TTATCCCCAA GGCTATGGCG ATGCGCCAGC CGGACAAGGC CGCCCTGGCG ACGGTTCCCC TGCTGTATTT CTTTTATACG GTGTTCAGAT ATACGGGTAT TCTTGGCCTG ACGGGCGGAA TGGCGCGGTT TGTGCTGAAA TACCTGCTGG GCATAGACCC CCGTTCCACG GCGTGCACGG TGCACAGCAC GGATGAATTG ATGTATCTGG TGGAAGAAAG CGAACGTTCC CGCGAGCTGA CGAAGCAGGA GGCTGAAATT TCCAAAAATG CCCTTGAACT GAACGATATG TGCGTCAAGG ACGTAATGAC CCCGCGCTCT GAAGTGGATG TGATGGATTT GACGGCTCCC TTTGAGGAAA ACTGGGAGCT TGCCCGGAAA TCCCGCCACA CCCGGTTTCC GCTGGTGGAG GGAGACCACC TGGATGAAGT GAAAGGCTGG GTGCATGTCA AAGATCTGCT CAAACTGGTA GGACGGGAAA ATCCGGATCT GAGGAGCGTG CGGCGTGAAT TGCGCGTGGT GCCGGATACG ATGCCCCTGG ACAGCCTTCT CACGTTCTTT CTGAAAGAAC ATGCCCACTT TGCCCTGGTA GTGGATGAAT TCGGTGATTC TATCGGCCTG GTATTCCTGG ATGATGTGCT GGAACAGATT GTGGGGGATG ACATTCAGGA CGAGTTTGAC CAGGAGGAAA TGCGGGAGTT TGTGAAAACC GGCAAGGATA CATATGCCGT CAATGGGGCT ATTACCCTGT TTGACCTGGC AGATTACCTG CCTGAAATGG ATTTGGATTG TCCGGGCGTT ACTACGCTGG GCGGTTACGT AATCAGCCGG ATGGGTTATA TTCCGGAAGA AGGGGAGGAA TTGCGGATTG GCCGCTACCG GGCTGTGGTG ACGGGGTCTG ACGGCAGGAG AATCACGCAG ATTCTGCTGG CCCGCCTTCC GGAGGAACAG GAGGAGGAAT AG
|
Protein sequence | MMLAVAGFIL FLLLNAFFAA GEFALMKVRE SQLHAGEGVP ARTRKKLARA RKAAKHPDLY LAACQAGITL SSLALGFLGT FFVSELTAPF LVSLGLGGMV SVYGIALAVT FIFFACCQVV FGEFIPKAMA MRQPDKAALA TVPLLYFFYT VFRYTGILGL TGGMARFVLK YLLGIDPRST ACTVHSTDEL MYLVEESERS RELTKQEAEI SKNALELNDM CVKDVMTPRS EVDVMDLTAP FEENWELARK SRHTRFPLVE GDHLDEVKGW VHVKDLLKLV GRENPDLRSV RRELRVVPDT MPLDSLLTFF LKEHAHFALV VDEFGDSIGL VFLDDVLEQI VGDDIQDEFD QEEMREFVKT GKDTYAVNGA ITLFDLADYL PEMDLDCPGV TTLGGYVISR MGYIPEEGEE LRIGRYRAVV TGSDGRRITQ ILLARLPEEQ EEE
|
| |