Gene Amuc_1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1297 
Symbol 
ID6274175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1572892 
End bp1573965 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content49% 
IMG OID642613353 
Productputative substrate-binding protein of aliphatic sulfonate ABC transporter 
Protein accessionYP_001877902 
Protein GI187735790 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAATT TTAATTTTAT CCCATTTAAA TACCTGTTCT ACCATTTTAT GAAAACTCTG 
CCTCTTGCAT TAGATGCTCT TCTTTGTATT GGCGCTCTCC TGGTTCCAGC ATTTCTGGCA
TCCTGTCGGG AAGCATCCGA ACGGCAGGAT CAAATACGCT TTGGCCTTTT CCCCAACGTA
ACCCATGTCC AGGGGTTGGT TGCCAGGCAT TTCAGCCGCA CGGGGGAGGG ATGGTTTGAA
AAACGCATCT TTGAACGGAC GGGGAAAAAC ATCTCCATCC TATGGTATGC TTACAACGCC
GGCCCCAGTG CCATGGAAGC CATGTTCGCC AATTCCCTGG ATTTTACCTA CGTTGGTTCG
AGCCCTGCCA TTAATGCGTA TTCCAAATCC AATGGAACTC TCCTGCAAAT AGTAGCAGGA
GCCGTACAGG GAGGGTCCGG ACTGGTCGTC CCCACCCATT CCGAAGCACG AACACAAAAA
GACTTTCAAG GGAAAATCAT CGCTACCCCC CAATTGGGCA ATACTCAGGA TATAGCCTGC
CGGACATGGT TGGCCCTGGG AGGCGTAGCA GTCACTCAAT CCAGAGGAGA CGCAAGCATA
CTTCCTACCC CCAATCCAGA ACAGATATCT CTCTTTCGTC AAGGAAAACT AGACGGATCC
TGGACTGTAG AACCATGGAT TAGCCGCCTG GAAAAGGAAG CCGGAGGCAA GCTCCTGTTT
CTTGAAACGG ATGCCGTAAC TACTGTTCTG ACAGCCCAAA AGAGAATTCT GGAAAAACAG
CCGGACGTGG CACAGGCCAT TATTGAAGCT CACAGAGAAC TCACTTTCTG GATTATCGAA
CATCCCCAGG AAGCTCAAAA AATCGTCGTA GAAGAACTGA GGGAACTCAC CCGTTCCAGC
ATTGATCCCT CTTTAATTCT CCATGCATGG CCCAGGTTAG TTCCCACTAA TAAAATATCC
GAAGAAAAGC TCCAAGCATT TGCCAAGGAT ATGGTACGGA CTGGATTTTA CAAGGAACTT
CCCCGTGTCG AAGGCATCAT CCGGAATGAT AAAAATAATC CCAGCCATGA GTAG
 
Protein sequence
MWNFNFIPFK YLFYHFMKTL PLALDALLCI GALLVPAFLA SCREASERQD QIRFGLFPNV 
THVQGLVARH FSRTGEGWFE KRIFERTGKN ISILWYAYNA GPSAMEAMFA NSLDFTYVGS
SPAINAYSKS NGTLLQIVAG AVQGGSGLVV PTHSEARTQK DFQGKIIATP QLGNTQDIAC
RTWLALGGVA VTQSRGDASI LPTPNPEQIS LFRQGKLDGS WTVEPWISRL EKEAGGKLLF
LETDAVTTVL TAQKRILEKQ PDVAQAIIEA HRELTFWIIE HPQEAQKIVV EELRELTRSS
IDPSLILHAW PRLVPTNKIS EEKLQAFAKD MVRTGFYKEL PRVEGIIRND KNNPSHE