Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1688 |
Symbol | |
ID | 6274552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2051564 |
End bp | 2052886 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642613747 |
Product | amidohydrolase |
Protein accession | YP_001878287 |
Protein GI | 187736175 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.26873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.126758 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAAC AAGCGTGTGA TTTGCTGGTG ACCGCTTCCC GCATGCTGGG CGGAGCGGAA CAGTCCGTTG CTCCCGGGGA TGCGGCCGTC GCCGTCCGGG ACGGGAAAAT TCTGGAAACG GGAAGCGCCG CAGAACTGGA GGCCAGGTGG AGGCCCGCCG TCCGCAGGAA TTTGGGGAAT GTGCTGCTGA TGCCGGGGCT TATTAATGCC CATACGCATG TCCCCATGAC CTTTTTGCGC GGTTTTGCGG ATGACTTGCC GCTGATGGAA TGGCTGACCG GACATATTTT CCCAGTGGAA GCCCGGCTGA CGGATAAAAT CGTCTCCCTG GGGGCCCGGC TGGGCATGTA TGAAATGATG CGGACGGGCA CCACGGCATT TGTAGATTCC TACCTGTTGG AAGCCAACGT TCTCCAGGAG GCGGAACGCA TGGGAATGCG CTGTGTGGGC GGAGAGGTTG TTTTTGCCTT TCCCTCTCCG GCGTACGGCG GGTGGGACGG TGCGGAAGCC CTGTACCGGG AGCAGGCGGA GCGCTTTTCC GGCCGCGGCA GGGTACAGGT GGCCCTGATG CCGCACAGCG TGTACACCAC CAGTGACGAA GTGCTCCGGC GTTCCATGAA GCTGGCGGAG GAACTGGACC TGATGCTGCA TATCCATTTG TCCGAATCCG CCGGGGAAGT GGAGCAGTGC CGTTCCCTGC ACGGGGGCAG GCGTCCCGTA GGCTATGCCC GTGATATGGG GCTGCTGAAT GAACGCTCCG TGCTGGCCCA TATGGTGGAC GTGACGGATG AGGAACTGGA ACTGGTGGCC GCTTCCGGAG CGGCGGTTGT CCATAACCCG GTCAGCAACC TGAAACTGGC ATCCGGCTTC GCCCGCGTGC GGGATATGGT GCGGGCCGGC GTTCCCGTCT CTCTGGGAAC GGACGGAGCG TGCAGCAACA ACAGTCTGGA CATGTTTGAA ACAATGAAGC TGGCCGCTAT TCTTGCCAAG GGGTACAGCG GGGACGCTAC CGCGGTCCCC GCCATGCAGG CGTTGAAGAT GGCGACGGAG GAGGGCGCCC GCATTTTCCG GACGCCGGGG CTGGGAACAC TGGTTCCGGG GGCTCCCGCG GATATGATCG CCCTGAACCT GGATGAACCG AACCTGTGCC CGATTTTCAA CGAAACGTCC CATGCCGTTT ATGCCTCTTC CGGCAAGGAT TGCGTGTTCA CCATGGTGGA AGGGAGAATT TTGTATGATC ATGGAATTTA CACGGACGGC CTGTATGCGG ATACCGCCGC GGAAATGCGG GATCTGGTAA AGTGGGTGAA GAATAGTGAT TGA
|
Protein sequence | MEQQACDLLV TASRMLGGAE QSVAPGDAAV AVRDGKILET GSAAELEARW RPAVRRNLGN VLLMPGLINA HTHVPMTFLR GFADDLPLME WLTGHIFPVE ARLTDKIVSL GARLGMYEMM RTGTTAFVDS YLLEANVLQE AERMGMRCVG GEVVFAFPSP AYGGWDGAEA LYREQAERFS GRGRVQVALM PHSVYTTSDE VLRRSMKLAE ELDLMLHIHL SESAGEVEQC RSLHGGRRPV GYARDMGLLN ERSVLAHMVD VTDEELELVA ASGAAVVHNP VSNLKLASGF ARVRDMVRAG VPVSLGTDGA CSNNSLDMFE TMKLAAILAK GYSGDATAVP AMQALKMATE EGARIFRTPG LGTLVPGAPA DMIALNLDEP NLCPIFNETS HAVYASSGKD CVFTMVEGRI LYDHGIYTDG LYADTAAEMR DLVKWVKNSD
|
| |