Gene Amuc_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0171 
Symbol 
ID6275385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp210072 
End bp211634 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content58% 
IMG OID642612217 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_001876796 
Protein GI187734684 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.0745729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACGG ACCTTTTCGT CATCGTCATT TACTTTCTTG CTATCTTCTG CATTGGCATT 
TATGCGGGAC GAAAGCAAAA CTCCCTGACA GACTATGCCC TGGGCAACCG CTCCCTGCCC
TGGTGGGCTA TTCTGGCCTC TATCCTGGCG GCGGAAATCA GCGCGGCCAC CTTCCTGGGG
GCTCCCGGAG AAGGGTACCA TACCCGCAAC TTCACATACG CCCAGCTCTG CATCGGCACC
ATCCTGGGCC GCATCATCGT CGGCAGGCTC TTTCTCAAGC CCTATTATGA CTACAAGGTT
GTTTCCATCT ATGAATACCT GGAAAAAAGA TTCGGGCTCC TGACGCGGCG GACAGCCTCC
ATGGTCTTCC TGATCAGCCG GGTGCTCGCC AGCGGAACCA GGCTCTATTT CGCGGGCATT
CTGCTGGTCA TTGCCTACCA GTTCCTGACA GGCGTCACGG CAGACGCCGA CCAGATTGTC
CTGCTCTACA TTGCCGCCCT GGTTGCCATC TCCGTCGCCA CGACAATTTA TACCGCGATC
GGAGGATTGA AAGCCGTCGT CTGGACGGAC GTCCTCCAGG CCGTCGTTCT GGGAGTCTCC
ATGCTTTCCG CCCTGTGGGT ACTGTTCTCC CATATCCCCG GAGGGTGGAG CTCCATCTCC
GCCGCCATGA ATGGCGGAGA CGATTGGAAA TTCTTCTCCT GGGGAACCGG AGAAGGCCTG
GACTTCCTCC AACAAGGCGC CCGCGTGCTG GGCCAGGAAT ACACGGTGTG GGCAGCCTTC
CTGGGAGCAA CCTTCATCAC CATGGCTACG CATGGCACGG ACCAGGACAT GGTGCAGCGC
ATGCTGGCGG CAAAAAACAG CAAGGCGGGA ACGCGTGCAG TCATCGTATC CGGCTTGCTG
GACTTCCCCA TCGTCATCAT TTTCCTCTTC ACGGGCATTC TTCTCTATGT CTTCTACCAA
TACAACCCGG CAAACCTCCC TGCAGACACG CCGCAGCTGC ATGTGTTTCC CTACTTCATC
ATCCATGAAC TTCCCAACGG CATCCGCGGG CTGCTGATCG CCGGACTTCT GGCGACGGCC
ATGGGCTCCC TCTCCACGGC TCTAAACTCC CTGGCGACTA CCGCCACCAA GGACTGGTAC
CAGGGGATAT TCAAACCGGA AGCCACGGAA CGGCAGCTGC TCCGGTGCGT GCGCTGGGGA
ACGGCCGTTT TCTCCCTGCT GCTCATCCTC GTGGGTTCCA TCACAGCGTG GTATGTGGTG
CATCACCCGG AAGTCCGCAT CATCCAAATA GCCCTGGGCA TTTTCGGCTA TACCTATGGC
TCCCTGCTTG GCATTTTCCT GTTGGGAATG CTGACCCGCA CCAGAGGCAG CGACTCAGGA
AACATCATTG CCATGGCGGC AGGATTCTTT GTCATCGCTT TACTGACGGA ACTCATCCCT
CTGCCCTCCG GCTGGCAACA ATATGTTCCG GAAATAGCAT TTCCGTGGCG CGTTACCATC
GGAACTCTGG TTACTTTTAC CGTCGGCTTC TGTTTCAGAA AACGCCGCCT TCCCATGCGC
TGA
 
Protein sequence
MLTDLFVIVI YFLAIFCIGI YAGRKQNSLT DYALGNRSLP WWAILASILA AEISAATFLG 
APGEGYHTRN FTYAQLCIGT ILGRIIVGRL FLKPYYDYKV VSIYEYLEKR FGLLTRRTAS
MVFLISRVLA SGTRLYFAGI LLVIAYQFLT GVTADADQIV LLYIAALVAI SVATTIYTAI
GGLKAVVWTD VLQAVVLGVS MLSALWVLFS HIPGGWSSIS AAMNGGDDWK FFSWGTGEGL
DFLQQGARVL GQEYTVWAAF LGATFITMAT HGTDQDMVQR MLAAKNSKAG TRAVIVSGLL
DFPIVIIFLF TGILLYVFYQ YNPANLPADT PQLHVFPYFI IHELPNGIRG LLIAGLLATA
MGSLSTALNS LATTATKDWY QGIFKPEATE RQLLRCVRWG TAVFSLLLIL VGSITAWYVV
HHPEVRIIQI ALGIFGYTYG SLLGIFLLGM LTRTRGSDSG NIIAMAAGFF VIALLTELIP
LPSGWQQYVP EIAFPWRVTI GTLVTFTVGF CFRKRRLPMR