Gene Amuc_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0020 
Symbol 
ID6275218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp25433 
End bp26587 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content60% 
IMG OID642612060 
Productputative MFS family transporter protein 
Protein accessionYP_001876648 
Protein GI187734536 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACAT TTACCTGGCC TATTGCCCTG CTGCTGGCGG GGCTTCTTTT CCAGACAGTG 
GCGTTTGCCG TGTTGAATAC GGTTGTTCCC CTGTGGATGG AGCAATTTGA TGCCGCCACC
TGGGAGGCAG GGCTGGTGGG GGCTTTCTTT TTTCTCGGGA ACCTGGCCGG AACGCTGCTG
GCTGGCGGCG TGATTCGCCG GGCCGGGTTC AAAGGAAGTT ACCAATATGC ATGCCTTTTA
TGCGCAGTCT CCACCGTTCT GCTGCCTGTG TTTCCCGGCG TGCCGGCCTG GAGCGGCCTC
CGGCTGCTGG CGGGGATCAG CTGCGCCCTG GTCTGGGTGG TGGTGGAAAG CGCCCTGCTG
AGGGCCGGAA CGCTGCAAAC CCGCGGCATT CTGCTGGCTT CCTACATGGT GGTTTATTAT
CTGGGTACGG TGCTGGGGCA GTTGCTTCTG GGCTGGTTCC CCAGCGATAT GCCCCTGATT
GTGACGGAAG TCTGCATTTT ATCAGTGGCG GGCATGGTTC CGCTGATGTT TGCGCGTCTG
GAGCCGGGCA ATGGACAGGT TTCATCCTCC TCCCATATAG AGATTCGGAC ACTGCTGAGA
CGCCGCAGCG TCTTTCTGGG TGTTGTGGGA TGTGTGATTT CCGGCGTGGT ATTGGGTACT
ATTTATTGCC TGATGCCCCT GTTCCTGAAG CACCAGGGAA TGGACCACTC TTCCGTGGGA
TACTGGATGG CCCTGCTGAT TGCCGCTGCC ATTCTGGGGC AGTGGCCCAT GGGGCGGCTG
GCGGACAGGT ACGGCCGCGC TTTCGTCATG AAATGCCAGT CCCTGCTGGT GGCGGCGGCC
TGTGCCGGGC TGATGCTGAA GGGGGGGCTG ATGGCTCCCT CCCTGATTGC TCTGGGGCTG
GCCGGATTTT CCCTGTACCC TGTTGCCATG GCCTGGGGAT GCGAGGAAGC TTCCCGGGAT
GAACTGGTGA CCATGAACCA GCTTCTGCTG TTGAGTTATT CCCTGGGCAC GCTGGCCGGC
CCTTCCCTGA CTTCGTTCCT GATGCAGAGG TATTCCGACA ATTGGATGCC TATGGTTATT
GCGCTGGTGG CCCTTTCCTT CATGCCTGTG CTGATGCTGG GCGGCGGCCA CGGAAGGAGA
AAGCTGTCCC GGTAA
 
Protein sequence
MRTFTWPIAL LLAGLLFQTV AFAVLNTVVP LWMEQFDAAT WEAGLVGAFF FLGNLAGTLL 
AGGVIRRAGF KGSYQYACLL CAVSTVLLPV FPGVPAWSGL RLLAGISCAL VWVVVESALL
RAGTLQTRGI LLASYMVVYY LGTVLGQLLL GWFPSDMPLI VTEVCILSVA GMVPLMFARL
EPGNGQVSSS SHIEIRTLLR RRSVFLGVVG CVISGVVLGT IYCLMPLFLK HQGMDHSSVG
YWMALLIAAA ILGQWPMGRL ADRYGRAFVM KCQSLLVAAA CAGLMLKGGL MAPSLIALGL
AGFSLYPVAM AWGCEEASRD ELVTMNQLLL LSYSLGTLAG PSLTSFLMQR YSDNWMPMVI
ALVALSFMPV LMLGGGHGRR KLSR