Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1777 |
Symbol | |
ID | 6274471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2164464 |
End bp | 2166728 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613840 |
Product | von Willebrand factor type A |
Protein accession | YP_001878376 |
Protein GI | 187736264 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.582496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.264323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTCC TTTATCCCAA CGTTCTGTAC GCGCTCATTC TCCCGGTCCT GCTCGCAGCG GCGGCCTGGT GGCTGTGGCG CCGCCGTTCC AGAAAATGGG AAGTGCTCGT TTCGCCGGAA TACCGGCGGG AACTGGTTCA CGCTCCGGCC ACCTGGCACC GGGTGCTCCC CGTCATCTTT GCCGTGCTGG CGTCCATCTT CGCCATCCTC TCCATTGCGC GCCCGGTGGA CGGCTACACG GAGGTAAAGG AAATCCCCAA ATCGCGCAAC ATCCTCATTG CTATTGACTG CTCCCGTTCC ATGCTCAGCA AGGATGCTTC CCCGACCCGC TTGGGAAGGG CAAAAACCGC CGCCTACGAC CTGCTGGACG CCCTGCCGGG GGACAACTTC GGCATCATCA TCTTCTCCGG AGACGCCGTC CTGCTCATGC CCCTTACCCA TGACCATAAC GCATTAAAAG AAACCATTGA ACAACTTCAA TTCGGCTGGG TCTCCCAAGG GGGAACCAAC CTGGAAAACG TTGTGCGGCT GGCCCTTCAA ACCTTCAAAC GGGACAAGGA AGCGGACGCC AAAAACGCCC TGGTCATCCT CAGCGACGGG GAAGACACCG TCAACATCAC CTATAAGACT GCGGAAGCCG CACGGCAGCA CCAGCTTATC ATCGTGACAG CGGGTATCGG TACTACTATC GGTACCACTA TTCCGGATGA GCAATCCCCC TCCGGCCTGT ACCGGGATCG TCGTGGCCAG CACGTCGTCT CCAAGCTCAA TCCGGAAAGC CTGCAATACC TGGCCAGGCA AACGGAAGGC CAATATGTCC AACTCTCCGA CGGAGCGGCG CTGAACCGCT TCGTCAAGGA CATCGCGGAC CGGCTGGATA CCACGGAAGG CAAGGAAGAA GTGCGCCGCG TGCCGAATGA CCGCTACATC ACCTTTGCCG TTCCGGCCCT CATCTGCCTG CTCCTCACCC TTCTGGCGGG AACACGCTGG CGTTCCTTCC GCCGTTCCGG ACGCCGCGGT ATGACATCTC TGACTGCTGC GGCGTTATTC TGCGCCGGAC TGCTGGGCCA GGAAGCGCAG GCGGACACGC ACGCCCTGGA CAATGTCACG GATTTGATCC GGACGGGAAA AACGGAAGAA GCCGTCAAAT CCATTGATGA AATGCTCGCT GTTCCCGACC TGGCGGAAGA AACGCGCCAG GCCTTGGAAT TCGCCAAAGG ATGCCTGGAA CAAAAAGCGG ATAACCCCAA AGAAGCCGCG GAAGCTTTCT CCCAGGCGCT CCTTTCTCCC AAACCAGCCC TCCAGGCGGA TTCCCACTTC AACCTTGGCA ACCTGGAAGC CGCCCAGGCC CGCAAGACCA TGACTTTCTC CAGGCCGGAA GGGGAACAGC AGCAAGCCCG GCTCTCTTCC ATTGACGACC AAATTAAGGA AATTGATGCC CGGCTGGCAA AAATACCCGT TGCGAAAAAA CATGTTAAAG AGGCCGTCAA ACGTTTTGAC GATGCGCTCT CTGCCTATGG TTCCCATGAA GGAGCAGCGT CCAACAAGGA GGAAATGCTC CGTTATGATA AATCCTTGGA CGAATACCGC CGGCAGCTGG AAGAATTAAA GAAGAAATTG GAAGAAGAGA AGAAAAAACA ACAGGACCAG AACAAGGACC AGAACAAGGA CCAGAACAAG GACCAGAACA AGGACCAGAA CAAGGACCAG AACAAGGACC AGAACAAGGA CCAGAACAAG GACCAGAACA AGGACCAGAA CAAGGACCAG AACAAGGACC AGAACAAGGA CCAGAACAAG GACCAGAACA AGGACCAGAA CAAGGACCAG AACAAGGACC AGAACAAGGA CCAGAACAAG GACCAGAACA AGGACCAGAA CAAGGACCAG AACAAGGACC AGAACAAGGA CCAGAACAAG GACCAGAACA AGGACCAAGA CAAGGACCAA GACAAGGACC AAGACAAGGA CCAAGACAAG GATAACGCCG AAACCAAGGA AGGCAAGAGG GACGAAAATG AAATGAAAAA CGCCCGGCTT CCCTCCTCTC CGGAAAAAGA CAGGCTGCCA GAAACGTCCG GAATGGAAAA ACCGCAGCCC CAGCCGGAAG ATAACAAGCC CGTTCCGGCA GCCGCACAAA AGGAAAGCAG GGAGGAAAAG GAACGCAGGG AAGCAAGAGC CATCCTGATG GAACGCAGGG ACATTGAACC CGGCTGCCCC GTTCCCCAGC GTTCTCCGGA AATTCCTCCG GATAAGGATT ACTAA
|
Protein sequence | MTFLYPNVLY ALILPVLLAA AAWWLWRRRS RKWEVLVSPE YRRELVHAPA TWHRVLPVIF AVLASIFAIL SIARPVDGYT EVKEIPKSRN ILIAIDCSRS MLSKDASPTR LGRAKTAAYD LLDALPGDNF GIIIFSGDAV LLMPLTHDHN ALKETIEQLQ FGWVSQGGTN LENVVRLALQ TFKRDKEADA KNALVILSDG EDTVNITYKT AEAARQHQLI IVTAGIGTTI GTTIPDEQSP SGLYRDRRGQ HVVSKLNPES LQYLARQTEG QYVQLSDGAA LNRFVKDIAD RLDTTEGKEE VRRVPNDRYI TFAVPALICL LLTLLAGTRW RSFRRSGRRG MTSLTAAALF CAGLLGQEAQ ADTHALDNVT DLIRTGKTEE AVKSIDEMLA VPDLAEETRQ ALEFAKGCLE QKADNPKEAA EAFSQALLSP KPALQADSHF NLGNLEAAQA RKTMTFSRPE GEQQQARLSS IDDQIKEIDA RLAKIPVAKK HVKEAVKRFD DALSAYGSHE GAASNKEEML RYDKSLDEYR RQLEELKKKL EEEKKKQQDQ NKDQNKDQNK DQNKDQNKDQ NKDQNKDQNK DQNKDQNKDQ NKDQNKDQNK DQNKDQNKDQ NKDQNKDQNK DQNKDQNKDQ NKDQNKDQNK DQNKDQDKDQ DKDQDKDQDK DNAETKEGKR DENEMKNARL PSSPEKDRLP ETSGMEKPQP QPEDNKPVPA AAQKESREEK ERREARAILM ERRDIEPGCP VPQRSPEIPP DKDY
|
| |