Gene Amuc_0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0970 
Symbol 
ID6274183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1157874 
End bp1160171 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content59% 
IMG OID642613024 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_001877583 
Protein GI187735471 
COG category[R] General function prediction only 
COG ID[COG4146] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000363528 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.258421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA CACTAGCAAC CATATTCGGG GCGCTCGCCC TGATGACGTG GGGCGCAGCT 
GCCCAGGAGG GTCCCGTACC TGCAACGGCG GCTGATGCGC AGCCCGCCTT GGCCGCTCCG
GCCGCGATTC CCGCTGTTGA AGTCCAGCCG GCAGCGGAAA TTCAGCCGGC GGCTCCCGCC
GCAGAGGCAG TACAGGCTGA ACAGACCGCC TCTGCGGAGC CTAAAACGGC CCCTCTTTCC
ATGGTGGAAG TCATTCTCTT CTGCGTGGTC GTGGTGGGCG TGATCGCCCT GGGCATCTGG
AAGAGCCGTG ACCCCGAAGA AACGGAAGAG GAAAAGAAGG CCAAGGGCGC TTCAGACTAT
TTCCTTGCCG GCCGCGGTTT GACCTGGTGG CTGGTGGGTT TCTCCCTGAT TGCCGCCAAC
ATTTCCACGG AACAGTTTGT GGGGATGTCC GGCAAATCGG CCAACTGGGT GGGCATGGCC
ATTGCCGGGT ATGAATGGCT GGCCGCTATC ACGCTGGTGG TTGTAGCCTT CTGCTTCCTC
CCCAAGCTGT TGAAAGGCGG CGTATATACG ATTCCCGAAT TCCTGGAATA CCGGTATAAC
ACGCTGGCCC GTTCCCTGAT GGCCATTGCT ACGCTGCTGA TTCTGGTAGG CGTGCCGACT
GCGGGCGTGA TTTACGCCGG CGCCAAGGTG ATTTCCGTGT TCTTCACGGG ATACTCTGCC
ATGGGCATTG ACTTCGGCGA CATCACCGTC GGCTGCGTTA TCATTGCCTT CTGCGCCACG
GTATACGTGT TCGTGGGCGG GTTGAAAGCC TGCGCCTGGA CGGACCTGTT CTGGGGCGCC
GCCCTGATTG TGGGCGGCGG CGTGGTGGCC TATTTTGCCC TGACGGAGCT CAGCGGCGCA
GACCCGAACC ATTTGATTCA ATCCGCCGCC GCCAATTCCG GCGCGACGGT CGCCTCCCTG
GGGAATCCTT CGGACAGCCT CTGGCATGGG GTGACGCGCT TTTTTGAGCT GAACTCCGGT
GACGCGGCCA GCGGCGTAAA TACCGTGGGC GGCAAGCTGC ACATGATCCG TCCTGCGGAT
GATGCGGAAA TCCCGTGGAC GGCTCTTTGC CTGGGCCTGT GGATTCCCAA CTTCTTTTAC
TGGGGCCTCA ACCAGTACAT CATGCAGCGT ACGCTGGCTT CCAAATCCCT GGCGGAAGGT
CAGATGGGCA TTGTGTTCGC CGCATTCCTC AAGCTCATCA TCCCATTCGT GGTGGTGGTC
CCCGGCATCC TGGCCTATAA CCTGTACCGC AATGACCTGA AGGAACAGGC GGAAGTAAAA
TACGCGGCGG AAATCCGTAA GACGGAAGAT CCCGCCGCAG TCAAGGGCCG CCCCGTCATC
TACAAGCTTA CGGACAGCTT CCTGGTGGAA AACGTGGAGG AAGGCTGTGC CCATGCCATT
CATAACGCGG AAGTAATGAA GGTGGGCGAA GATGTTATGG CCAATTTGAA ACAGGCCTGC
GCCGATTTGA AGGCGGATGC CGCCAACGAC CAGACTACGC TGGCGGAACG CGCCCCGTTC
GTGGAAAAAA TCGCTTCCCT TAACAACAAA ATCATCAAGC CGGCCGTGGA CAACTCGGAT
AACTACTATC TGACGGATAC GCTGGTAGGC TTTGACTATG ACTCCGCCTT CGGCACGTTG
ATCAGGAAGT TGCTTCCCGG CACGGGCTGG ACATGGTTTG TGCTTGCGGC CCTCTTTGGA
GCGGTGGTGT CTTCCCTGGC ATCCATGTTG AATTCCGCGT CCACCATCTT TACGATGGAT
ATTTACAACA AGCTGCGCAA AAATGCGGGG CCCACGGAGC TGGTTACCGT CGGCAAGATT
GGTTTGCTGG TGTGCGCCGT GATCGCCCTG ACCATTGCTC CGTTCCTGGA CAGCCCGGCC
TTTGGCGGCA TCTTCAACTT CATTCAGGAA TTCCAGGGCT TCCTGAGCCC GGGCGCCCTG
TGCGTGTTCC TCTTCGGCTT CTTTGTGCCC AAGTGCCCGC GCATCTTCGG TTGGCTGGGT
ATCGTCATTA ATGCCCTCCT GTACGGAATC CTGAAGGTAT GGCAGCCGGA AATGGCCTTC
CTGAACCGCA TGGCCGTGTG TTTCATCACG GTAGTGGTCA TCGGCTTCAT CTTCACGGCG
GTGAACGCCG CCCGCGGCGG ACAGCCTATC GTGCTGCCCG ACAGGGGCGT GGTTGCCCTT
CAGTCCTCTT CACGGGCTAA AATCTTCGGA TGGCTCGTGG TTGCCGCGAC GGTTGCCCTG
TACATCATCT TCTGGTAA
 
Protein sequence
MMKTLATIFG ALALMTWGAA AQEGPVPATA ADAQPALAAP AAIPAVEVQP AAEIQPAAPA 
AEAVQAEQTA SAEPKTAPLS MVEVILFCVV VVGVIALGIW KSRDPEETEE EKKAKGASDY
FLAGRGLTWW LVGFSLIAAN ISTEQFVGMS GKSANWVGMA IAGYEWLAAI TLVVVAFCFL
PKLLKGGVYT IPEFLEYRYN TLARSLMAIA TLLILVGVPT AGVIYAGAKV ISVFFTGYSA
MGIDFGDITV GCVIIAFCAT VYVFVGGLKA CAWTDLFWGA ALIVGGGVVA YFALTELSGA
DPNHLIQSAA ANSGATVASL GNPSDSLWHG VTRFFELNSG DAASGVNTVG GKLHMIRPAD
DAEIPWTALC LGLWIPNFFY WGLNQYIMQR TLASKSLAEG QMGIVFAAFL KLIIPFVVVV
PGILAYNLYR NDLKEQAEVK YAAEIRKTED PAAVKGRPVI YKLTDSFLVE NVEEGCAHAI
HNAEVMKVGE DVMANLKQAC ADLKADAAND QTTLAERAPF VEKIASLNNK IIKPAVDNSD
NYYLTDTLVG FDYDSAFGTL IRKLLPGTGW TWFVLAALFG AVVSSLASML NSASTIFTMD
IYNKLRKNAG PTELVTVGKI GLLVCAVIAL TIAPFLDSPA FGGIFNFIQE FQGFLSPGAL
CVFLFGFFVP KCPRIFGWLG IVINALLYGI LKVWQPEMAF LNRMAVCFIT VVVIGFIFTA
VNAARGGQPI VLPDRGVVAL QSSSRAKIFG WLVVAATVAL YIIFW