Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1504 |
Symbol | |
ID | 6274586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1796305 |
End bp | 1797363 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642613563 |
Product | hypothetical protein |
Protein accession | YP_001878106 |
Protein GI | 187735994 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1 [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0023514 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCATGG ATTCCAGCTA CCAGACTCTG GAAGGCGCCG TTAAAAAGCT TTCCGAAATA GCCTCCACCA GGCACGATCC CCGCTACCTC CAAGAGTATA TCAAAACGGG AATCAACATG GCCCAGTCCG CCGCCTCGGA CCATGATTTT ACCGTCCTGA TCCGCTCCGG GCGGGAGATG TACCGCGCCA ACTGCGTTTT CGCCCCGTAC CGCCACATCC GCAAGATTTC CGTCTTCGGC TCCGCCCGCA TCAGGAATGA CGAACCGGCG TATGAAACGG CGAGGGAATT CGCCAGGGAA GCCAGCGAAC ACGGCTACAT GGTCATTACC GGAGGCGGAC CGGGCATCAT GCAGGCAGCC AATGAAGGGG CGGGAGAGCA ACGCTCCTTC GGCCTGAACA TCACCCTGCC GTATGAACAG ACCTCCAACC ATGTGGTGGC CCACAGCGAC AAACTCATCA ATTTTTATTA CTTTTTCGTC AGAAAACTGA ACTTCGTGGC GGAAAGCGAC GCCATGGTGG CATTCCCCGG AGGCTTCGGA ACCATGGATG AAGTGTTTGA AACACTCACT CTGATCCAGA CGGGAAAAGC GACCATTTAC CCGATCGTCC TTCTGGATTC CCCCGGCAAA ACCTTCTGGC TGAACTGGCT GGCCTTCATT CGCGTGGAAC TGGTGGATTC CGGACTGATT TCCGCAGACG ATCTTCATCT CATCCATGTC ACTAAAAATC CGGCGGAAGC CATGGAACAC ATCGACCGTT TTTACCGGAT TTTCCACTCC TACCGTTTTG TCGGAGATTC CATCGTCATC CGGCTGAATG CGCAGCTTCC CGCCCAGTGG GTGGAACATC TGGAACGGGA CTTTTCAGAC CTGATTCTGC CTGGGGGGAA AATGATCCAG AGCGGCCCCC TGCCGGATGA AGCGGACGAA CCACACCTGG ACCGCCTGCC CCGGCTCGTC TTCCCCATCA AACGCGGCAA CTACGGCAGG CTGAGGTTGC TGATTGACCG CATCAACCAG ACGCCCAGCC GAACCTATTC CCCGCCCACC CATGCCTGA
|
Protein sequence | MSMDSSYQTL EGAVKKLSEI ASTRHDPRYL QEYIKTGINM AQSAASDHDF TVLIRSGREM YRANCVFAPY RHIRKISVFG SARIRNDEPA YETAREFARE ASEHGYMVIT GGGPGIMQAA NEGAGEQRSF GLNITLPYEQ TSNHVVAHSD KLINFYYFFV RKLNFVAESD AMVAFPGGFG TMDEVFETLT LIQTGKATIY PIVLLDSPGK TFWLNWLAFI RVELVDSGLI SADDLHLIHV TKNPAEAMEH IDRFYRIFHS YRFVGDSIVI RLNAQLPAQW VEHLERDFSD LILPGGKMIQ SGPLPDEADE PHLDRLPRLV FPIKRGNYGR LRLLIDRINQ TPSRTYSPPT HA
|
| |