Gene Amuc_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0840 
Symbol 
ID6274329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp998943 
End bp1000622 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content58% 
IMG OID642612895 
Productsulfate transporter 
Protein accessionYP_001877454 
Protein GI187735342 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00377] anti-anti-sigma factor
[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAAC CTGCTCTTCT TTCCTCTCTC AAAACTTATA CCAAACAAAC CTTTCTGGCG 
GACCTTTTTG CGGGACTGAC CGTCGGTGTA GTAGCCATTC CGCTGGCCAT GGCCTTTGCC
ATCGCATGCG GACTCTCCCC AACCCAGGGC CTCATCACCG CCATTGTGGC CGGGTTCCTC
ATCTCCCTGT TCAGCGGAAG CAAATATCAA ATAGGCGGCC CCACCGGAGC CTTCGTGATC
ATTATCATGG GCGTCCTGGA GCAATACCAC GCATCCGGTC TGCTGGTCTG CACATTGATG
GCGGGCCTCT TCCTCATCAT CTTTGGGTTC TGCCGCATGG GGGCGCTCAT CCGCTTTATT
CCATTCCCTG TCACCACAGG GTTCACCTCC GGCATCGCCG TGGTAATCTT TTCCACGCAA
ATTAAAGACA TCTTCGGCCT CACCATCACG GAAAAAATTC CCGGAGAGTT CATTGAAAAA
TGGGCGTGTT ACTTCCATTA CTTCCACACC ATCAACTGGG CGGCGCTGGG GCTGGCCGCC
GGCACCGTAA TCATTACCCT GCTGAGCCGC CGCTTCTGGC CCAGAATACC GGCCATGCTA
GTGGGCATGC TGGGCATGAC GGCCGTTTCC GTGGCGTTTT CGTTGCCTGT GACAACCATC
GGGCAAGCCT TCGGCAGCCT CCCGAATACA CTCCCCCTGC CCTCCCTGCC CAGCATTGAC
TGGAGTACCC TGGGGGCGCT GACGGCCCCT GCTTTCACCA TCGCGCTGCT GGCGGCGATC
GAATCCCTGT TAAGCGCCTC CGTGGCGGAC GGCATGACCG GAGGGCGCCA CAAGCCCAAC
ATGGAGCTGA TTGCACAAGG CATCGGCAAC ATCGGCTCCG CCCTGTTTGG CGGCATTCCG
GCCACCGGAG CCATTGCCCG CACCGCCACT AACATCAAGG CTGGAGCTAA AAGCCCGGTT
TCCGGCATGA TTCACGCCCT GACCCTGCTA GCCATTCTGA TGGCCTTTGC CCACTATGCC
CAGCAGATTC CCCTGGCTGT CCTGGCGGGC ATTCTGACGG TAGTGTGCTA CAACATGAGT
GAAATACACA CGTTCAGCCG TCTGCTGAAA GGGCCCAGGC AGGATGCGGC GGTGCTGGTA
ATCACCTTCC TGCTGACCGT GTTTGTGGAC CTCGTTGTAG CCGTGGAAGT AGGCGTGGTG
CTGGCCGCCC TGCTCTTCAT GGGCCGCATG GCCCAAATCA GCGATGTTTC CGCCATCAAA
AACGAACTGC TGGAAAATGA TGAGGAAGAT GATGGAAACC GCTCTGCCGC CAAGCTGGAC
ATCCCGGAAG GTGTGGAAGT TTTCGACGTG AAAGGTCCCT TCTTCTTCGG TGCCGTGGAG
CAATTCAAGG ACCAGGTGCT GGAAACGCTG GAACATGATA CCAAGGTGGT TATCCTGCGC
ATGCGCCTGG TTCCCGCGCT GGACGCCACC GGCCTGAACG TCCTTTCCGA CTTCTGCCAC
CAGTGCCGGG AACACGGTTC CACCCTGCTG GTTTGCGGCG TGCAGCCCCA GCCTCTGGAC
GTCATCCGCC ACGCGCCCTT TTACCGGGAG CTGAAACGCT ACAATATCTG CGAGAATATT
GACGCCGCCC TGAACCGGGC CTGCAAAATC ATCAACGGCC CTGCGCCCAA ACACCTGTAA
 
Protein sequence
MFKPALLSSL KTYTKQTFLA DLFAGLTVGV VAIPLAMAFA IACGLSPTQG LITAIVAGFL 
ISLFSGSKYQ IGGPTGAFVI IIMGVLEQYH ASGLLVCTLM AGLFLIIFGF CRMGALIRFI
PFPVTTGFTS GIAVVIFSTQ IKDIFGLTIT EKIPGEFIEK WACYFHYFHT INWAALGLAA
GTVIITLLSR RFWPRIPAML VGMLGMTAVS VAFSLPVTTI GQAFGSLPNT LPLPSLPSID
WSTLGALTAP AFTIALLAAI ESLLSASVAD GMTGGRHKPN MELIAQGIGN IGSALFGGIP
ATGAIARTAT NIKAGAKSPV SGMIHALTLL AILMAFAHYA QQIPLAVLAG ILTVVCYNMS
EIHTFSRLLK GPRQDAAVLV ITFLLTVFVD LVVAVEVGVV LAALLFMGRM AQISDVSAIK
NELLENDEED DGNRSAAKLD IPEGVEVFDV KGPFFFGAVE QFKDQVLETL EHDTKVVILR
MRLVPALDAT GLNVLSDFCH QCREHGSTLL VCGVQPQPLD VIRHAPFYRE LKRYNICENI
DAALNRACKI INGPAPKHL