Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2163 |
Symbol | |
ID | 6274756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2633224 |
End bp | 2635986 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642614223 |
Product | Sel1 domain protein repeat-containing protein |
Protein accession | YP_001878751 |
Protein GI | 187736639 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.591838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.290265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCCC GTCTTTTGCT TATTCCCTTT ATTTTTTCCG TTCCGGCTTA TGGCCAGGAC ATTCCGTCCG CCGCAGCGGC AAATCCCCCC GGGCAGGTGA AATACGCTCC GGTCCGGAAC CTCTCCCAAC TGCCTATGGA CGGCGGCTGG ACGTACGGGG AAGTGCTGGA CAAAGCCGCG GCTGGAGACT TCCGGGCCAA TATGCTGGCC GGGGAATACT GGATGGGGCT TTATTACCTG AACCGTGGAG ACCTGCCGCT GGCCTGCCTG GAACATATGC AGCATTTTTT CCGTGATGCG GCCACGGCAT CCGGCCATCC GGCACCCAAG CTCATGATGA CGGATCCTTA CGGGATTTTC GGGATTTTCA AGGCGCAACC CATGCAACCG GAACATGCGG AATGGCTGGC TGCCTGCCGC AAGCTTGCGG AAAAGGGAGA CATGGATGCC GTTACGGCCC TGTACACCGT CACGGATCTT CCGGAAGCCG AGTTGAGACG CTACCTGGCC CCCCTGGAAA AGAAGGCCGC TTCCGGCGAC CAGGATGCGC TGTCCCAGCT CGCTTTCATC AAAGTGGATT GGCTGAAGGA AAATCCCGCC AACGTTATCG AGACCCTGAA GCAATACCGG AAAAACGCCA AACCGGTTCT TCTGGCGCGG ATAGGGGAAC TGGCCCTTGC CCTGGACGGC AAGGACGGCA GGGGATGGAG GGATACGGCC ATTCAATCCC TTTCCCGGGC TGCGGAAAAG GGGCACGCGG GGGCCATGCG GCTTCTGGCG GAATTGACGA AGGGGGCCGC ACCGGCACAG GCCGGAAAGC TTTTGGCGGA ACTGCGCTCC AGAGGGGATC TGGAAACTCT GCTGGCGGAC CTCTCCGCCG GCATGAAGGA AAAAACAGGG GGAGGTAAAA GCTCCCTCCT CCAGCCTTTG TTCGAGCAGG CCAGGAAACT GGGCAGCAAT AACGCCTGTG CCTGGTACGC CCAGTTAATT TCCGAAGCGC AGCCCCCGCA GGAACAGGAG CTGCGCAAGG CGGCCCTTGA TCTGCTGTCC CTTCATGACG AACGGGGCAT GCCATACCTG AAGGAACCTC TGTCCTTCCC CCAGAAAAAA CTCAACCAGC TTCCCCCCTA CGCCAAGACA CATAAGGAGC TTGAGCTGGC CTCTCAGCTT GGGGACGCAG ATTCCAGCGT ACAGTTGGGC GAGCTGTGGA AGGAGGAACT GGCCAATACA CTGGATTCCG TAGGGAAGGC CCATGAAGCG CGGGAAAATA TGCGCGTATG GTACAGGCAG GCGGCGGAGG AAGGCCACCC CCTGGCCAGG CTGCGCCTGG CCCAAATGGA AAACCCGGAC TTAATGGACA AACCGTCCGC AGAACCTGCC AGAACGCTGC TGAACGATTG CCTGGCAAAA GCCGCCGCAG GAGATTTCAT CACGCTGAAA GGAATTTCCG AATACGTCAC GCCGGAACAA TGGGAACAGC TGACCGCCAC CCTCCGTAAA AAAGCTCAGG GGGGGAACGC CCGCAGCCAG GCGGATCTGG CATACCTGAT GTTATTCAGT TACAGGAGCG AAGAATCCGG CTACCCGGAA GCCCTGGCCT GGGCCCGGCT TTCCGCCGGA CAGGGAGATC CCTGCGGCAT GTACGTGCTC GGCCGCGCCC TCATTTATGA ATTCGGCCTG GAAAAACGCA TGGCCGAGGG GGATTCCTGG ATGTTCCGGG GAGCCTTGAA AGGCCATGTG GATGCGGCGG ACATGTGGAG TTCCAACAAT GAGAACCTGC AGCCTTACAG CGCCATGATG CACGGTCTGG CGGCACGGGG GCATATCCCC TCCCTGCTTT TCCTGGCCAA CCAGGCGGCC TATCCCCAGA ATACGGGGGA AACAGCGGCC CTGGACAACG GATTCACTTC CGCCTGCCAG CAGGTCAAGC TGACCCTGAA CGGAGGGGAC GCCAACACTC CGCCGTGGCT CCGGAAACCG GAACAGAAGG GTAAGGGGGA AGATATTCCG GCGGATGCCG GAACCGTTCC TCCCTGGATG ACCCCGTCCG GCAGGGAACG CCAGGCCCCG GCCCCGAAAA CCATCAATCC GGAAGCCGTG AAGTTCTGGG AACAAGCTGT GGAACTCGGC AGCCTGACGG CACTGGACGA GCTGGCCAAT TATTATGAAA CCGTGGCAGG GAATGTGCAG GAGCCTGCGG AACGGCAGCA GCTTCTGGCC TCTGCCATGA CGGCAGCCAC CGCCCTGGTG AAAAAGGAGG ACGTACGCGG TTTCAAGAGG CTGGCCCGCT ATTACGAGCA GGGCATCGGC GTGCAGCCGA ACAGGGAACT GTACAAGGAT TATGTACTGA AAGCCGCGGA AACCAGGGAT CCGGAAGCTC TGGTGGAGAA GGCGCGCCTG CTCATCAAGG GAGAGGATCT GGAGCCGGAC CCGAAAATGG CGCTGGATAT TCTGACCGAT CTGGAAGAGA ACCAGACGCA TGCGGTGCCC GGCATGCATT TCCTGCTCGG CTACTTGCAT GAAGAAGGCC TCGGCACCCC GCAGGATATG GCCCTGGCCT ATCAGTTTTA CATCAAGGGG GCGGAACAGG ACGACGCCAA GTGCATGAAC AACCTGGGTT CCATGTATGA ACGCGGAACT GGCGTAGCCA AGAACCTGGC CGAAGCCCAA AAGTGGTATG AACGGGCGGC GGAGCTGGGG AATGAAGACG CTCTTGCCAA TGTGGAACGG GTAAGGGAAA AACTCAACAC GAAGAAGAAA TAG
|
Protein sequence | MRARLLLIPF IFSVPAYGQD IPSAAAANPP GQVKYAPVRN LSQLPMDGGW TYGEVLDKAA AGDFRANMLA GEYWMGLYYL NRGDLPLACL EHMQHFFRDA ATASGHPAPK LMMTDPYGIF GIFKAQPMQP EHAEWLAACR KLAEKGDMDA VTALYTVTDL PEAELRRYLA PLEKKAASGD QDALSQLAFI KVDWLKENPA NVIETLKQYR KNAKPVLLAR IGELALALDG KDGRGWRDTA IQSLSRAAEK GHAGAMRLLA ELTKGAAPAQ AGKLLAELRS RGDLETLLAD LSAGMKEKTG GGKSSLLQPL FEQARKLGSN NACAWYAQLI SEAQPPQEQE LRKAALDLLS LHDERGMPYL KEPLSFPQKK LNQLPPYAKT HKELELASQL GDADSSVQLG ELWKEELANT LDSVGKAHEA RENMRVWYRQ AAEEGHPLAR LRLAQMENPD LMDKPSAEPA RTLLNDCLAK AAAGDFITLK GISEYVTPEQ WEQLTATLRK KAQGGNARSQ ADLAYLMLFS YRSEESGYPE ALAWARLSAG QGDPCGMYVL GRALIYEFGL EKRMAEGDSW MFRGALKGHV DAADMWSSNN ENLQPYSAMM HGLAARGHIP SLLFLANQAA YPQNTGETAA LDNGFTSACQ QVKLTLNGGD ANTPPWLRKP EQKGKGEDIP ADAGTVPPWM TPSGRERQAP APKTINPEAV KFWEQAVELG SLTALDELAN YYETVAGNVQ EPAERQQLLA SAMTAATALV KKEDVRGFKR LARYYEQGIG VQPNRELYKD YVLKAAETRD PEALVEKARL LIKGEDLEPD PKMALDILTD LEENQTHAVP GMHFLLGYLH EEGLGTPQDM ALAYQFYIKG AEQDDAKCMN NLGSMYERGT GVAKNLAEAQ KWYERAAELG NEDALANVER VREKLNTKKK
|
| |