Gene Amuc_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0037 
Symbol 
ID6275166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp51964 
End bp53448 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content55% 
IMG OID642612078 
Productamino acid permease-associated region 
Protein accessionYP_001876665 
Protein GI187734553 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.877998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGTAT TCACTCTCGC CATGATCAAC GTTTCCGCCA TTGTCAGCCT GCGCGGCATG 
CCTGCGGAAA GCACTTACGG ACTGAGTTCC GTTTTTTATT ACATTTTCGC CGCGGTATTC
TTTCTGGTGC CTGTTTCCCT GGTCGCCGCG GAGCTTACTA CCGGATGGCC CCAAAAGGGC
GGCGTTTACC GCTGGGTAGG CGAGGCATTC GGGAAAAAAT GGGGGTTCCT GGCCATCTGG
CTGCAATGGA TTGAGAGTAC CATCTGGTTC CCTACCGTTC TGACATTCGC CGCCGTTTCC
CTGGCTTTCA TGGGGCCCGG ACAAAGATGG GATGAAGCGC TTGCCGCCAA TAAATGGTAT
GTTCTCATCG TGGTGCTGTG CGTGTACTGG GCGGCCACCC TGCTTAATCT GCGCGGCATG
AAGACTTCCG CAGGCGTCAC CAAATGGGGA ACCATCATCG GAACCATTAT TCCCGGAGCC
ATCCTGATCC TGCTGGGCCT GGGCTATTGG GCCGGCGGCA ACCCGATCCT GCTGGATATG
AGCTGGGACA AGCTGGTGCC GGACATGAGC AATTTCAACA ACCTCGTTCT GGCAGCCAGC
ATCTTCCTGT TTTACGCGGG GATGGAAATG TCCGCCGTGC ATGTGAAGGA TGTGAATAAT
CCCGGACGCA ATTATCCGCT GGCCATTCTG ATTTCCGCCA TCATTACGGT GCTTATTTTC
GTTCTAGGCA CGCTGGCCAT CGGCTTCATC ATTCCCAATT CCCAGATTAA TCTGGTGCAG
AGCCTGCTGA TTACTTATGA CAGCTATTTC AGCTTCTTCG GCCTCGGCTG GATGAACTGG
ATTCTGGCGC TTGCGCTGGC CGTCGGCGTT CTGGCCCAGG TAACCGCATG GGTGGGAGGC
CCCTCAAAAG GCCTGTACCA AGTGGGCCTG GCCGGCAACC TTCCGCCTGT CATGCAGAAG
CGGAACAAGA ACAACGTCCA GATGGGCATC CTTTTTATCC AGGGGGGAAT CGTCACCCTG
CTTTCCATCA TGTTTGTGAT CATGCCTTCC GTGCAATCCG CCTACCAGAT TATTTCCCAG
CTGACCATCA TTCTGTACCT CATCATGTAC ATGCTGATGT TCGCGTCAGG CATTTACCTG
CGCTACCGGG AACCGAATAC GCCCCGTACT TTCCGCATTC CCGGCGGCAG AACCTTCGGC
ATGTGGATTG TCGGAGGGCT CGGCTTCCTG GCCAGCCTGG CGGCTTTCCT GGTGAGCTTC
ATCCCGCCTA ACCAAATTAC CGTAGGCAGC AGTACCATGT ACATTCTCCT TCTGGTGGTG
GGTACCTTCA TTTTCGCGGG TATTCCCTTT ATCATCCATG CCATGGCCAA ACCCTCCTGG
AAACGGCCGG TGGATCCCGA AGACGCCTTT GAACCCTTCG GATGGGAGAA AAACAATGAT
TCCCATTCCG CAGCAACCCC TAGCCATTCC ATATCCCATG AGTGA
 
Protein sequence
MGVFTLAMIN VSAIVSLRGM PAESTYGLSS VFYYIFAAVF FLVPVSLVAA ELTTGWPQKG 
GVYRWVGEAF GKKWGFLAIW LQWIESTIWF PTVLTFAAVS LAFMGPGQRW DEALAANKWY
VLIVVLCVYW AATLLNLRGM KTSAGVTKWG TIIGTIIPGA ILILLGLGYW AGGNPILLDM
SWDKLVPDMS NFNNLVLAAS IFLFYAGMEM SAVHVKDVNN PGRNYPLAIL ISAIITVLIF
VLGTLAIGFI IPNSQINLVQ SLLITYDSYF SFFGLGWMNW ILALALAVGV LAQVTAWVGG
PSKGLYQVGL AGNLPPVMQK RNKNNVQMGI LFIQGGIVTL LSIMFVIMPS VQSAYQIISQ
LTIILYLIMY MLMFASGIYL RYREPNTPRT FRIPGGRTFG MWIVGGLGFL ASLAAFLVSF
IPPNQITVGS STMYILLLVV GTFIFAGIPF IIHAMAKPSW KRPVDPEDAF EPFGWEKNND
SHSAATPSHS ISHE