Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1780 |
Symbol | |
ID | 6274531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2168373 |
End bp | 2169263 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613843 |
Product | protein of unknown function DUF58 |
Protein accession | YP_001878379 |
Protein GI | 187736267 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.226086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAAG CCTCAGACAT TCTCAAGCGC GTACGCCGCA TTGAACTGCG CGCCAGGCAT CTGGCCACGG AAAACTTCGC CGGGCAATAC CAGTCCGGCT TCCGCGGACA GGGGCTGGAC TTTGACGACT TCCGGGAATA CATGCCGGGA GATGACCCCC GCTTCATTGA CTGGAAGGTA ACGGCCAGGA TGAACTCCCC TTTTGTCCGC CGTTTCCGGG AGGAACGGGA ACAGGCCGTC ATTCTGGCGG TGGACGTCAG CGGCTCCATG CACTACGCCT CCTCCGCGGC CCGCGTCTCC AAACTGGACT ATGCGGCGGA AGTAGCGGCA GTGCTCGCCT ACAGCGCTGC CCAGAGCGGA GACAAATGCG GCCTCCTTAT CTACGGGAAC AGCCACTCCC ATTACATCCC CCCGGCCAAG GGAGTCAAGC AGACCCTGCG CATCGTCCGC GAAATCGTAG CCAGTAAAAA CGATGGAGCC GACCAGAACA TTTCCGATGT AGCCCGGCAA CTTGTCCTTT CCCAGAAAAA AGCGGCCATG GTCATCATGA TCAGTGACTT TTGGGGTGAG AACAATAAAG CCGCCCTGGG GCAGCTCAAC TTCAAGCATG ACTTCATCCC CATCCGCATC GCAGACCCGA TGGAACTGCA TCTGCCGGAT GCCGGACGCG TCATCCTGAA AGATCCGGAA ACTGGCAAAA GCATGTTCCT GAACCTTTCC CGTCAGGATG TCCGGGAAAC CCACGCCAAC GTCGTTCATC TGCACCGCGA GAAATGGACG CAGGATTTCC GCCGCCTGGG CATTGACTTC CTGGACTTGC AGACCACAGA CAACTTCATG CCTCCCCTCC GGGCCCTTTT TGCCAGAAGA TCCCGTAAAT TTTCACGCTA A
|
Protein sequence | MDKASDILKR VRRIELRARH LATENFAGQY QSGFRGQGLD FDDFREYMPG DDPRFIDWKV TARMNSPFVR RFREEREQAV ILAVDVSGSM HYASSAARVS KLDYAAEVAA VLAYSAAQSG DKCGLLIYGN SHSHYIPPAK GVKQTLRIVR EIVASKNDGA DQNISDVARQ LVLSQKKAAM VIMISDFWGE NNKAALGQLN FKHDFIPIRI ADPMELHLPD AGRVILKDPE TGKSMFLNLS RQDVRETHAN VVHLHREKWT QDFRRLGIDF LDLQTTDNFM PPLRALFARR SRKFSR
|
| |