Gene Amuc_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2043 
Symbol 
ID6274753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2484591 
End bp2486024 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content61% 
IMG OID642614104 
ProductRND efflux system, outer membrane lipoprotein, NodT family 
Protein accessionYP_001878634 
Protein GI187736522 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.186335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAA ACCCATACAT CACACGCGCC ATCCTGCTGG GAACGGCTTT TGCCGCCTCC 
TCCTGCATGA TGGGCCCCGA CTTCAAACCG GTGGACATGC CCATGCCTGC GGCATTCAGG
GGAGCCCCTG CAGCGACGGA ATCCATTGCG GACCTTCCCT GGTGGAAAGT TTTCAAAAAC
AAGGACCTTC AGGACCTCCT GACGGACACC TACAATAACA ACCGCGATTT GAAAGCTACC
ATGGCGCGCG TGGAAAAGGC ACGCCAGTAC ATCACCATCA CGGAGGCCCC GCTCTTCCCG
TGGGCGGATT ATTCCGGCTC CGTCAGCAAA GGCTCCAACT ATACCGGCGG CAGCATCGCC
CAGACTACCG GAACCACACT GACGCCTGGA GCGATTGATG CCGGCATTTC CTGGGAACTG
GATATCTGGG GCAAAACGCG CCGGATGACG GAAGCGGCCC GCGCGGATTA TCTGGCTTCC
GACGAAGGCC AGCGCGCGCT CATGCTTTCC CTGCTCCGCC AGGTGGCGGA CTCCTACCTC
CAGCTCCTCC AGCTGGACGA ACAGCTTGCC ATCGTGCAGA AATCCGTGGA ATCCTATTCC
GAAAGCCTGC GCCTGTTTGA CGAACAGCTT GAAGGCCAGG TAGGCGACAG GCTTCAGGTG
GCTTCCGCCA AGGCGGCTCT GGCCTCCTCC CAGGCCCAGA TTCCCGCCAT TCAGGTGCAA
ATTGCCAATC TGGAAAATGC AGTCTCCGTC CTGGCCGGAC GCGCTCCCGG CCATATCCGC
CGTTCCGGCA GCACCCGGGA CATCGCCTAT AACGTCAAGG TTCCCGCCGG CATTCCGGCC
TACATCCTTT CCAGAAGGCC TGACGTCCGC CAGAGCGAAT ACCAGCTGCG CGCCGCCAAT
GCGGAAGTGG GCGTGGCCAT TGCCAATTAC TTCCCGACCA TCTCCCTGAC GGCGGCGGGC
GGCCTCGCCT CCGCGGATCT GCGCCACGTG CAGGGGCGCC GCGGCGGCTG GGGCCTGGGA
GCCAATCTGA CCGGCCCCCT CTTCCAGGCG GGCAAGCTGA CGGCTTCCGA AAAGGCGGCC
AAGGCCGAAT TCCTGGCGGC CAGAAACGAT TATGAACAAA CGGTTCTCAA TGCCCTGGCG
GAAGTTTCCA GCACGCTTAT CCAGAGGGAC AAGCTGCGCA GCATCACCGC CACCCAGTCC
GAGGCCGTGG AAGCCTACCA CACGGCGGTG AAACTCTCCT TTGAACGCTA CCGCACGGGC
CTTTCCAATT ACATTGAAGT ATTGTACGCC CAGCAGAACC TGTACCCCGC TCAGATTCAG
CTTTCACAGT ATTATTACCA GCACGCCAGC ACGCTGGTTT CCCTGTACAC GGCCCTGGGC
GGCGGCTGGA ACATGAGCCA CAAGGCTATC ATGGACGGCC CTGCCAGGCA GTAA
 
Protein sequence
MRTNPYITRA ILLGTAFAAS SCMMGPDFKP VDMPMPAAFR GAPAATESIA DLPWWKVFKN 
KDLQDLLTDT YNNNRDLKAT MARVEKARQY ITITEAPLFP WADYSGSVSK GSNYTGGSIA
QTTGTTLTPG AIDAGISWEL DIWGKTRRMT EAARADYLAS DEGQRALMLS LLRQVADSYL
QLLQLDEQLA IVQKSVESYS ESLRLFDEQL EGQVGDRLQV ASAKAALASS QAQIPAIQVQ
IANLENAVSV LAGRAPGHIR RSGSTRDIAY NVKVPAGIPA YILSRRPDVR QSEYQLRAAN
AEVGVAIANY FPTISLTAAG GLASADLRHV QGRRGGWGLG ANLTGPLFQA GKLTASEKAA
KAEFLAARND YEQTVLNALA EVSSTLIQRD KLRSITATQS EAVEAYHTAV KLSFERYRTG
LSNYIEVLYA QQNLYPAQIQ LSQYYYQHAS TLVSLYTALG GGWNMSHKAI MDGPARQ