Gene Amuc_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2029 
Symbol 
ID6275503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2462442 
End bp2463578 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content56% 
IMG OID642614090 
Productefflux transporter, RND family, MFP subunit 
Protein accessionYP_001878620 
Protein GI187736508 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.109803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.00040221 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAC ACACGCATCA TCGGAAGGCG CCCCTCCGCC TGCTGGCCGG TTCCGGCCTG 
CTTTTTTCCC TGTTTGTTCC GGGTTATTCC CAAGGAACGC CGGGAGGGGC TGCAGCCAAA
CCCAGTACCG TACTCGTCCA GAAAGCCGCC GCTATTGACA GCGCGGTGAA CAAGAAGTAC
ATCGGCCAGG TGGAAGCCAT TGACCGGGTG ACTGTGCAGC CCCGCGTCTC CGGCAACATC
GTAGCCACCC GTTTCCGGGA AGGAAAGGTC GTGAAGAAGG GGGATCTCCT GTTCGAAATT
GAGGATACGC GCTACAGGGC GGCTGTGGAG GAAGCGGTAG CCAAAAAGGC CCAGCTTGAA
GCCAAACTGC TTTACGCAAA AAATAGTTTT GAACGCTATA ACAGGTTGCT GGCCTCCAAA
TCCGTCTCCA TGGACACGGT GGAAAACGCC AAGAGCACCA TGCATGCCCT GGAAGCGGAA
ATCCAGTCCG CAAACGCCGC CATTACGGTA GCGAAGGACG ATCTTAACTA TACCAGGCTC
ACGGCTCCCA TCACGGGCCG CACAGGCCGC GTCACCTTTT CCACGGGCAA TTACATCACC
CCCACCTCCG GTTCCCTGGT GACCATTACA GGCATCGATG AAGTGTACGT GAAGTTCCCT
ATCAGCGAAC GCGATTTCCT TTCCCTGTTC GGTACCCAGG AAAATATGAA GAAAGACGCC
CTTGTGTCCG TCAACCTCGC CAACGGCAAG GCATACGACC AGCCGGGCAG GATTTTCATG
ACGGACAATA CCGTCCAGAC GACCACGGAC ACCCTGAATG TCTGGGCAAA ATTCCCCAAT
CCGGAAGACG TGCTGACGCC CGGCGGCGTA GTCACGGTTA ATCTTTCCAA GAAAAACGTG
GACCGCTTCC CGGCTGCCAA CATTTCCTCC GTGATGCACG ATGCCTACAA GAGCTACGTT
TACATCGTCA ACGACCAGGG CGTCGTGGAA CGCCGGGACG TTACGCTGGG CAACACGGTC
AATAATGAAC AGTGTTTCAG TTCCGGCGTT AAGGAAGGGG AAGTCGTCAT CATCGACGGC
ATGCACAAGG TGCGTCCCGG CGCCAAGGTG AATCCGGTAT ATTCCGTTCA AAACTGA
 
Protein sequence
MKKHTHHRKA PLRLLAGSGL LFSLFVPGYS QGTPGGAAAK PSTVLVQKAA AIDSAVNKKY 
IGQVEAIDRV TVQPRVSGNI VATRFREGKV VKKGDLLFEI EDTRYRAAVE EAVAKKAQLE
AKLLYAKNSF ERYNRLLASK SVSMDTVENA KSTMHALEAE IQSANAAITV AKDDLNYTRL
TAPITGRTGR VTFSTGNYIT PTSGSLVTIT GIDEVYVKFP ISERDFLSLF GTQENMKKDA
LVSVNLANGK AYDQPGRIFM TDNTVQTTTD TLNVWAKFPN PEDVLTPGGV VTVNLSKKNV
DRFPAANISS VMHDAYKSYV YIVNDQGVVE RRDVTLGNTV NNEQCFSSGV KEGEVVIIDG
MHKVRPGAKV NPVYSVQN