Gene Amuc_2163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2163 
Symbol 
ID6274756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2633224 
End bp2635986 
Gene Length2763 bp 
Protein Length920 aa 
Translation table11 
GC content60% 
IMG OID642614223 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_001878751 
Protein GI187736639 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.591838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.290265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCCC GTCTTTTGCT TATTCCCTTT ATTTTTTCCG TTCCGGCTTA TGGCCAGGAC 
ATTCCGTCCG CCGCAGCGGC AAATCCCCCC GGGCAGGTGA AATACGCTCC GGTCCGGAAC
CTCTCCCAAC TGCCTATGGA CGGCGGCTGG ACGTACGGGG AAGTGCTGGA CAAAGCCGCG
GCTGGAGACT TCCGGGCCAA TATGCTGGCC GGGGAATACT GGATGGGGCT TTATTACCTG
AACCGTGGAG ACCTGCCGCT GGCCTGCCTG GAACATATGC AGCATTTTTT CCGTGATGCG
GCCACGGCAT CCGGCCATCC GGCACCCAAG CTCATGATGA CGGATCCTTA CGGGATTTTC
GGGATTTTCA AGGCGCAACC CATGCAACCG GAACATGCGG AATGGCTGGC TGCCTGCCGC
AAGCTTGCGG AAAAGGGAGA CATGGATGCC GTTACGGCCC TGTACACCGT CACGGATCTT
CCGGAAGCCG AGTTGAGACG CTACCTGGCC CCCCTGGAAA AGAAGGCCGC TTCCGGCGAC
CAGGATGCGC TGTCCCAGCT CGCTTTCATC AAAGTGGATT GGCTGAAGGA AAATCCCGCC
AACGTTATCG AGACCCTGAA GCAATACCGG AAAAACGCCA AACCGGTTCT TCTGGCGCGG
ATAGGGGAAC TGGCCCTTGC CCTGGACGGC AAGGACGGCA GGGGATGGAG GGATACGGCC
ATTCAATCCC TTTCCCGGGC TGCGGAAAAG GGGCACGCGG GGGCCATGCG GCTTCTGGCG
GAATTGACGA AGGGGGCCGC ACCGGCACAG GCCGGAAAGC TTTTGGCGGA ACTGCGCTCC
AGAGGGGATC TGGAAACTCT GCTGGCGGAC CTCTCCGCCG GCATGAAGGA AAAAACAGGG
GGAGGTAAAA GCTCCCTCCT CCAGCCTTTG TTCGAGCAGG CCAGGAAACT GGGCAGCAAT
AACGCCTGTG CCTGGTACGC CCAGTTAATT TCCGAAGCGC AGCCCCCGCA GGAACAGGAG
CTGCGCAAGG CGGCCCTTGA TCTGCTGTCC CTTCATGACG AACGGGGCAT GCCATACCTG
AAGGAACCTC TGTCCTTCCC CCAGAAAAAA CTCAACCAGC TTCCCCCCTA CGCCAAGACA
CATAAGGAGC TTGAGCTGGC CTCTCAGCTT GGGGACGCAG ATTCCAGCGT ACAGTTGGGC
GAGCTGTGGA AGGAGGAACT GGCCAATACA CTGGATTCCG TAGGGAAGGC CCATGAAGCG
CGGGAAAATA TGCGCGTATG GTACAGGCAG GCGGCGGAGG AAGGCCACCC CCTGGCCAGG
CTGCGCCTGG CCCAAATGGA AAACCCGGAC TTAATGGACA AACCGTCCGC AGAACCTGCC
AGAACGCTGC TGAACGATTG CCTGGCAAAA GCCGCCGCAG GAGATTTCAT CACGCTGAAA
GGAATTTCCG AATACGTCAC GCCGGAACAA TGGGAACAGC TGACCGCCAC CCTCCGTAAA
AAAGCTCAGG GGGGGAACGC CCGCAGCCAG GCGGATCTGG CATACCTGAT GTTATTCAGT
TACAGGAGCG AAGAATCCGG CTACCCGGAA GCCCTGGCCT GGGCCCGGCT TTCCGCCGGA
CAGGGAGATC CCTGCGGCAT GTACGTGCTC GGCCGCGCCC TCATTTATGA ATTCGGCCTG
GAAAAACGCA TGGCCGAGGG GGATTCCTGG ATGTTCCGGG GAGCCTTGAA AGGCCATGTG
GATGCGGCGG ACATGTGGAG TTCCAACAAT GAGAACCTGC AGCCTTACAG CGCCATGATG
CACGGTCTGG CGGCACGGGG GCATATCCCC TCCCTGCTTT TCCTGGCCAA CCAGGCGGCC
TATCCCCAGA ATACGGGGGA AACAGCGGCC CTGGACAACG GATTCACTTC CGCCTGCCAG
CAGGTCAAGC TGACCCTGAA CGGAGGGGAC GCCAACACTC CGCCGTGGCT CCGGAAACCG
GAACAGAAGG GTAAGGGGGA AGATATTCCG GCGGATGCCG GAACCGTTCC TCCCTGGATG
ACCCCGTCCG GCAGGGAACG CCAGGCCCCG GCCCCGAAAA CCATCAATCC GGAAGCCGTG
AAGTTCTGGG AACAAGCTGT GGAACTCGGC AGCCTGACGG CACTGGACGA GCTGGCCAAT
TATTATGAAA CCGTGGCAGG GAATGTGCAG GAGCCTGCGG AACGGCAGCA GCTTCTGGCC
TCTGCCATGA CGGCAGCCAC CGCCCTGGTG AAAAAGGAGG ACGTACGCGG TTTCAAGAGG
CTGGCCCGCT ATTACGAGCA GGGCATCGGC GTGCAGCCGA ACAGGGAACT GTACAAGGAT
TATGTACTGA AAGCCGCGGA AACCAGGGAT CCGGAAGCTC TGGTGGAGAA GGCGCGCCTG
CTCATCAAGG GAGAGGATCT GGAGCCGGAC CCGAAAATGG CGCTGGATAT TCTGACCGAT
CTGGAAGAGA ACCAGACGCA TGCGGTGCCC GGCATGCATT TCCTGCTCGG CTACTTGCAT
GAAGAAGGCC TCGGCACCCC GCAGGATATG GCCCTGGCCT ATCAGTTTTA CATCAAGGGG
GCGGAACAGG ACGACGCCAA GTGCATGAAC AACCTGGGTT CCATGTATGA ACGCGGAACT
GGCGTAGCCA AGAACCTGGC CGAAGCCCAA AAGTGGTATG AACGGGCGGC GGAGCTGGGG
AATGAAGACG CTCTTGCCAA TGTGGAACGG GTAAGGGAAA AACTCAACAC GAAGAAGAAA
TAG
 
Protein sequence
MRARLLLIPF IFSVPAYGQD IPSAAAANPP GQVKYAPVRN LSQLPMDGGW TYGEVLDKAA 
AGDFRANMLA GEYWMGLYYL NRGDLPLACL EHMQHFFRDA ATASGHPAPK LMMTDPYGIF
GIFKAQPMQP EHAEWLAACR KLAEKGDMDA VTALYTVTDL PEAELRRYLA PLEKKAASGD
QDALSQLAFI KVDWLKENPA NVIETLKQYR KNAKPVLLAR IGELALALDG KDGRGWRDTA
IQSLSRAAEK GHAGAMRLLA ELTKGAAPAQ AGKLLAELRS RGDLETLLAD LSAGMKEKTG
GGKSSLLQPL FEQARKLGSN NACAWYAQLI SEAQPPQEQE LRKAALDLLS LHDERGMPYL
KEPLSFPQKK LNQLPPYAKT HKELELASQL GDADSSVQLG ELWKEELANT LDSVGKAHEA
RENMRVWYRQ AAEEGHPLAR LRLAQMENPD LMDKPSAEPA RTLLNDCLAK AAAGDFITLK
GISEYVTPEQ WEQLTATLRK KAQGGNARSQ ADLAYLMLFS YRSEESGYPE ALAWARLSAG
QGDPCGMYVL GRALIYEFGL EKRMAEGDSW MFRGALKGHV DAADMWSSNN ENLQPYSAMM
HGLAARGHIP SLLFLANQAA YPQNTGETAA LDNGFTSACQ QVKLTLNGGD ANTPPWLRKP
EQKGKGEDIP ADAGTVPPWM TPSGRERQAP APKTINPEAV KFWEQAVELG SLTALDELAN
YYETVAGNVQ EPAERQQLLA SAMTAATALV KKEDVRGFKR LARYYEQGIG VQPNRELYKD
YVLKAAETRD PEALVEKARL LIKGEDLEPD PKMALDILTD LEENQTHAVP GMHFLLGYLH
EEGLGTPQDM ALAYQFYIKG AEQDDAKCMN NLGSMYERGT GVAKNLAEAQ KWYERAAELG
NEDALANVER VREKLNTKKK