Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1120 |
Symbol | |
ID | 6273947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1340007 |
End bp | 1342397 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613171 |
Product | hypothetical protein |
Protein accession | YP_001877727 |
Protein GI | 187735615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTTCC AATCTCTATA TTTATCTGCC GTGGCGGTAA GTGCCGTTTC TTTCGGGTGG GGGGCTCTGG ATAAGCCTTC CGCCTCCAAT TTGATCTGGT CTGATGAACC GGCCGTAGTT GTTTATCCGC AGGAAGACAA AAATTCCGAG GGCAGTTTTG GCAAGTACAG AAAGCCTGCC TCCGTCTGGG AAGCGGAGGG GTATCCCATT GGCAATGGTC GCGTTGGAGC CATGATTTTC AGCGCTCCCG GCCGCGAGCG GCTGGCCCTG AATGAAATCA GCCTTTGGTC CGGAGGTGCC AATCCGGGGG GAGGGTACGG TTACGGGCCT GATGCCGGAA CGAACCAGTT CGGCAACTAT CTTCCCTTTG GGGATTTGTT TGTGGACTTT AAAAAAGGCG ACCAACCGGC TTCCCTGTCT GTGGAAGATT TTACGCGCTC CCTGGATCTC CGGGACGGCA TTCATAAAGT GAATTACAAG GCGGACGGCG TAACGTATGA CCGGGAGGCA TTCTCCAGCA CGCCTGCCAA CGTCCTGGTG CTGAATTATA AAGCCAGCAA ACCCGGCCAA TTCAGCGCGG ATTTTTCCGT TAACAGCCAG CTTGGAGCAG ATATTTCCGC CAAGGGATCC GTCATCACCT GGAAGGGGAT GTTGAAAAAC GGCATGAATT ATGAAGGCCG CGTTTTGATC CGTCCCAAAG GCGGTACGCT TTCTGCCTCG GGAGATAAAA TTTCCGTGAA AAATGCGGAT TCCTGCATGG TCGTCATCGC CATGGAGACG GATTACCTGA TGGATTATAA AAAGGACTGG AAGGGTGAAT CTCCCTCCAG GAAGCTGGAC CGTTATGCGG CCAAAGCCGC TTCTGCGGAT TATGCCGCCC TGAAACAGGC CCACATTTCC CAGTACAAGT CCATGTTTGA CCGGGTGAAG GTCAACTTCG GAAAAACGGA GGAGGATGTA GCCAAGCTGC CTACGCCAAA ACGTCTGGAG GCCTATAAGA AAAATCCGGC AGACCCCGAT TTGGAGGAAA CCATGTTCCA GTTTGGCCGA TATCTGCTGT TGTCCAGTTC CCGGCCCGGC ACGCTACCGG CCAACCTGCA AGGGTTGTGG AACGATTATG TCAAACCGCC GTGGGCCTGC GACTACCATA ACAACATCAA CGTCCAGATG GCGTATTGGG GGGCGGAACC CGCCAATCTT TCCGAATGCC ATGAGGCCCT GGTCAATTAT GTGGAGGCAA TGGCCCCCGG CTGCCGGGAC GCTTCCCAGG CGAACAAGGG GTTCAATACC AAGGACGGTA AACCCGTGCG CGGCTGGACG GTGCGCACCT CCCAGAATAT CTTCGGTGGC AACGGCTGGC AGTGGAACAT TCCCGGTGCG GCCTGGTATG CGCTGCATAT ATGGGAACAT TATGCATTTA CCGGTGATAG AAAGTATCTG GAAAAACAGG CGTATCCCCT GATGAAGGAA ATCTGTCATT TCTGGGAAGA CCATTTGAAG GAACTGGGAG CCGGGGGCGA AGGGTTCAAG ACAAACGGCA AGGATCCGAG CGAAGAGGAG AAGAAGGATC TGGCCGATGT GAAAGCCGGA ACTCTGGTGG CTCCCAATGG CTGGTCTCCC GAACATGGGC CGCGTGAAGA CGGTGTGATG CATGACCAGC AGCTCATTGC GGAGCTTTTC TCCAATACCA TCAAAGCCGC CCGCATTCTG GGCAAGGACG CCGCCTGGGC CAAGAGCCTG GAGGGCAAGC TGAAAAGGCT GGCCGGCAAC AAGATAGGCA AGGAAGGGAA TCTTCAGGAA TGGATGATTG ACCGCATTCC CAAGACGGAC CACCGCCATA CGTCCCACCT TTTTGCCGTT TTCCCCGGCA ACCAGATCAG CAAGCTCAAG ACGCCCAAGC TGGCGGAAGC CGCCCGCCTT TCCTTGGAAT GGCGCGGCAC GACCGGAGAC AGCCGCCGTT CCTGGACGTG GCCGTGGCGC ACGGCTCTGT GGGCCCGCCT GGGCGAGGGG AACAAGGCTC ATGAAATGGT TCAGGGACTC TTGAAATTCA ACACTTTGCC GAATATGCTG ACTACTCATC CCCCTATGCA GATGGACGGC AACTTCGGCA TTGTAGGCGG CATTTGTGAA ATGTTGGTGC AATCCCATGC CGGGGGCCTG GACATCATGC CCTCTCCCGT GGAAGCGTGG CCTGAAGGTT CCGTGAAAGG GCTGAAAGCC CGGGGCAACG TCACGGTGGA TTTTTCCTGG AAGGACGGCA AGGTAAGCAA CGTGAAGTTG TATTCCGCCC AACCCAAGGT GTTGCCCGTG CGCGTTAACG GGAAGATGAC CCGCATGAAA ACCCTGCCGC TGAAATCGGG AGCGGGATCT TCGCAGCCCG CGGCCAGGTA A
|
Protein sequence | MHFQSLYLSA VAVSAVSFGW GALDKPSASN LIWSDEPAVV VYPQEDKNSE GSFGKYRKPA SVWEAEGYPI GNGRVGAMIF SAPGRERLAL NEISLWSGGA NPGGGYGYGP DAGTNQFGNY LPFGDLFVDF KKGDQPASLS VEDFTRSLDL RDGIHKVNYK ADGVTYDREA FSSTPANVLV LNYKASKPGQ FSADFSVNSQ LGADISAKGS VITWKGMLKN GMNYEGRVLI RPKGGTLSAS GDKISVKNAD SCMVVIAMET DYLMDYKKDW KGESPSRKLD RYAAKAASAD YAALKQAHIS QYKSMFDRVK VNFGKTEEDV AKLPTPKRLE AYKKNPADPD LEETMFQFGR YLLLSSSRPG TLPANLQGLW NDYVKPPWAC DYHNNINVQM AYWGAEPANL SECHEALVNY VEAMAPGCRD ASQANKGFNT KDGKPVRGWT VRTSQNIFGG NGWQWNIPGA AWYALHIWEH YAFTGDRKYL EKQAYPLMKE ICHFWEDHLK ELGAGGEGFK TNGKDPSEEE KKDLADVKAG TLVAPNGWSP EHGPREDGVM HDQQLIAELF SNTIKAARIL GKDAAWAKSL EGKLKRLAGN KIGKEGNLQE WMIDRIPKTD HRHTSHLFAV FPGNQISKLK TPKLAEAARL SLEWRGTTGD SRRSWTWPWR TALWARLGEG NKAHEMVQGL LKFNTLPNML TTHPPMQMDG NFGIVGGICE MLVQSHAGGL DIMPSPVEAW PEGSVKGLKA RGNVTVDFSW KDGKVSNVKL YSAQPKVLPV RVNGKMTRMK TLPLKSGAGS SQPAAR
|
| |