Gene Amuc_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2000 
Symbol 
ID6274785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2429606 
End bp2430790 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content59% 
IMG OID642614060 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001878592 
Protein GI187736480 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2807] Cyanate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.121235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGG CAGGGACTTC CGGAGGCATC AGGGGAAAGA TGCTGCTGTT TGTCGCCTTT 
TTAATCGTGT GCTTCAATTT GCGTACCGGG TTTGATTCTC CCGATCCGCT GCTGGGAACG
ATTGAAAGGG ATATGGGGCT TTCCCTGGAA AATTCAGGGC TGTTCGCCCT GTTGCCCGTC
TTTGTGCTGG GGGTGGCGGC TCCCATTTCT CCACGCGTGG CGCGCTGGAT GACGCCGTGG
AAAATCATTT TCTGGTTCCA GCTTCTGGCG GTGGCCGGAA TTTTCTGGCG CAGCTGGGAC
GGCGTAGCGG GGTTATACGG CGGCATGGTT CTGATGGGGC TGGGCATGGG AATTGCCGGG
GCGGCCATTC CCGGACTGAT TAAGCACCAG TTTCCTGACC ATGCTTCCGC CATGATGGGG
GTTTACAGCG CCATGATCGG CGTGGGCAGC GCTGCGGCTT CCGGATTGTC CGTTCCCATT
TCCAACATGC TGGGAGGCTG GCGTTTCGGC CTGGGGGTCT GGATTATCCC CATTCTGCTG
GGAATGCTGG TATGGGGCGC TTATTTTCTC AGGCACCCGG CCGGCGTGTT TCAAAGCGAT
CCGGCGGCTT CCGGTCACAA CCTGCTGCGC AGCGGCAAAG CATGGCAGGT TACCGTTTTT
TACTTGAGCC GCGTTGGGGC GGCCTATTTC TTTTATACTT GGATTCCCAT CTTTCTGAGG
CAGCGGGGCA TGAGCTATGA GGATGCCGGG TTCATCCTTT CCGTGGCCAT GTTCGCCCAG
CTGCCCGCCA CGCTTTCCGC ACACGTGCTG GAAAAGGCTA CGGGCGGCCG GGGGCTCCTC
ATCGTGATGG CCATGGCATT GGCGGCCCTT TCCTGCTGGG GCATCCTGTA TCTCCCTCCG
GACTGGGCTC TCTGGATGGC TATTCTGTTC GGTCTGGCGA CGGGAACCGT ATTCAGCCGC
GGAATGGCGC TGATGGTGGA ACGGGCCCGG ACCCCTTCCG AATCAATCAG GCTTTCCGGC
ATGTCCCAGG GAATAGGCTT TACCATGGGC GCCCTGTTGA GCCTGCTGTT TACCTCCTTC
CTGCATCAGG GCGGCTCCTT TTTGCCGTTC TGCCTGGTTT ATACCTTTTT CTGCGTGCTC
GGCATGGTTT CCGGCCGCAT GTCCGCCCGG CCGGGATACG TGTAA
 
Protein sequence
MDKAGTSGGI RGKMLLFVAF LIVCFNLRTG FDSPDPLLGT IERDMGLSLE NSGLFALLPV 
FVLGVAAPIS PRVARWMTPW KIIFWFQLLA VAGIFWRSWD GVAGLYGGMV LMGLGMGIAG
AAIPGLIKHQ FPDHASAMMG VYSAMIGVGS AAASGLSVPI SNMLGGWRFG LGVWIIPILL
GMLVWGAYFL RHPAGVFQSD PAASGHNLLR SGKAWQVTVF YLSRVGAAYF FYTWIPIFLR
QRGMSYEDAG FILSVAMFAQ LPATLSAHVL EKATGGRGLL IVMAMALAAL SCWGILYLPP
DWALWMAILF GLATGTVFSR GMALMVERAR TPSESIRLSG MSQGIGFTMG ALLSLLFTSF
LHQGGSFLPF CLVYTFFCVL GMVSGRMSAR PGYV