Gene Amuc_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0336 
Symbol 
ID6274996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp394920 
End bp397142 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content56% 
IMG OID642612387 
ProductTonB-dependent receptor 
Protein accessionYP_001876956 
Protein GI187734844 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.000135521 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTTTAT CCAACATGAA TACCCCCCGC CATCGCCGCC AGGGGCGCGG CCTGCACCGT 
GCTGCCGCCA TTGGGAGCCT GCTTGTTCTC GGACAATCTG CCTATGCCGA GACAAAGGCC
GCCGCCGCAG AGGAAGGCGT GCAGGAAATG CCGGAACGCG TGATGACCAT CCGTTCCAGG
TCCCTGCGGG CGGAAACGGT CAGTTCCGCT ACTCTCACGA ACATGAAGAC GGAGGAAGTT
CCGCAAACGG TAAACGTCAT TACAAGGGAC CTGATGGATT CCAAGGGGTC CGATTCCCTG
GTGGAAGCGC TGCGTATGGA TTCCTCCGTC AATACGGGTG GGGACATGCT TTTGTCCCGT
ACGGCGGACC AGTACACCAT CCGCGGCTTT GCCGGCAGTG ATGTTCAGAT CGGCAATATG
CCCCTCCCGC GAGGCATGGG GTATGGCATG GATACGTCCC TGATAGAAAA TATTGAGATT
GTCAAAGGCC CCATCGGTTC CATTTCCGGC GGTCAGACCA GTACGCTCGG GGCATACGGC
GCCGGCGGTT CCATCAACCT GATCCTGAAG GAGCCTGATT TCCTGGAACG GACGGAGTTG
ACGGCTTACG CCCGCCTTTC CCATCACGGG CAGAAATACC GCGCAACGAT TGACGATACC
CGGTACAGGG GGGATGAAAC GAATGGATTT GCCCTGCGCA CGGTGGTAGC CGCCGAGTAT
GAACGCCCGT TCTGGCTGAG CAACGGAGCC AACGGAGGCC AGAAGTACAC GGTGTCCCCC
ATTTTCCGCT GGCAGCATGA TTCCCGCACC AAGACGGTGC TGACGACCAG TTTCCAGTAT
CAGAATTCTC CGACGACCAT GGGCATCCCT GTGCTGGGCG GCCATTTTGT GGGGCCGTAT
GACGCTTGGT ACGGTTCTCC GAGCGGAAGG CTTAATTCCA AATCCCTGCT GGCGATGCTG
GATTTCGAAC GCAAGCTGGA AAAGATATGG ACGATCCGCA TTGGCGGCGG TATTGGGTAC
AGCGATGTGG ACTATAATGT CTGGGGCATT TCCTCTTCCG CTGGGAGAGG AACCAGCACG
GCGGATTATT ACAACCAGAT GATTGCTTCC GGAAAGGCCA AATATGAAGC CGCCTGGAGC
GACGAGTGGA ATATCAACTG GAATTTTTAT TCCAACGCAC TGGCGGAATT TAAGACCGGG
CAGGTGAAGC ATGAAGCCCT AATGGGCGTT TCTTACACGG GGAGCAGCAC TTATGGAGAC
GGTTCCAGCC TGGTGACAAA CGCCACCGCA AATACGAACG GCTATTTTTC CCTGTACAAT
CCTCCTCCTT TTTTCCCTGC AGGGCGCGAT TATTCCGGCG CCAATGCGAC GGATACCGTG
GTGCAGAGGG CTGGTTTTCT GCTGCAGGAT GTCCTCAGTT ATGGACAGTG GCGTTTTCTG
GCCGGCGTTC GCGGTGACGC GCATTTCAGC CTGGACAATA ATTATGCGTT TGCGTGGAGC
CCCCGCTTCG GAATTACCCG CATGTTCGGC GAACGCGTAG CGTTGTTCGC CAACGCAGCC
CGTACCTCCG CCCCCAACTT CGGGTATCTG GATGAAAACG GGAAGGAATT GACCGATTCC
TGGCGTACGG ACCAGATGGA ATTCGGGTTC CGGGTCAGCC CGGTTGATAA GGTGTGGTTT
TCCGCTTCCT GGTTTGACAT CATCCAGAAT AATACGCCTG TTGCCATAGA CGGATATACC
AACCGGTACT ATTCGGACGG CTCCAAGAGG GCGGAAGGCG TGGAACTTTC CCTGAACGGG
GAAATAACCA AGAATTGGAG TTCCTATCTT TCCTACACCT ACACCCGCAC CAAGAACCGG
ACCACCGGGG AAGTGTATCC CACGATTGCT CCGAATGCTC TGGCCCTCTG GCAGAAGTAC
CGCATTGACG GAGGGTTGCT GAACGGTACG GTGCTGGGCT TGGGCTACCG CTGCAAGGAT
TCCTATTACG CTACGTTCCG GGGCGCAAAA ATTGCGGACA ACTACACCAT CCCCTCCTAT
AGCGTATTCG ACTTTACGGT GGAAATCCCC CTGCCGGAAT CCAAATGGCT GAAAGATGCC
ACGCTGAGGC TGGCTGTGTA TAATATCTTC GATAAAAAGT ACGTGCAGTC CACCCGCCAT
GCCGTGCAGT GCACGGTGGG AGAGCCGCGC ACGTTTGAAG TAGGCCTGAA GACCACGTTT
TAA
 
Protein sequence
MRLSNMNTPR HRRQGRGLHR AAAIGSLLVL GQSAYAETKA AAAEEGVQEM PERVMTIRSR 
SLRAETVSSA TLTNMKTEEV PQTVNVITRD LMDSKGSDSL VEALRMDSSV NTGGDMLLSR
TADQYTIRGF AGSDVQIGNM PLPRGMGYGM DTSLIENIEI VKGPIGSISG GQTSTLGAYG
AGGSINLILK EPDFLERTEL TAYARLSHHG QKYRATIDDT RYRGDETNGF ALRTVVAAEY
ERPFWLSNGA NGGQKYTVSP IFRWQHDSRT KTVLTTSFQY QNSPTTMGIP VLGGHFVGPY
DAWYGSPSGR LNSKSLLAML DFERKLEKIW TIRIGGGIGY SDVDYNVWGI SSSAGRGTST
ADYYNQMIAS GKAKYEAAWS DEWNINWNFY SNALAEFKTG QVKHEALMGV SYTGSSTYGD
GSSLVTNATA NTNGYFSLYN PPPFFPAGRD YSGANATDTV VQRAGFLLQD VLSYGQWRFL
AGVRGDAHFS LDNNYAFAWS PRFGITRMFG ERVALFANAA RTSAPNFGYL DENGKELTDS
WRTDQMEFGF RVSPVDKVWF SASWFDIIQN NTPVAIDGYT NRYYSDGSKR AEGVELSLNG
EITKNWSSYL SYTYTRTKNR TTGEVYPTIA PNALALWQKY RIDGGLLNGT VLGLGYRCKD
SYYATFRGAK IADNYTIPSY SVFDFTVEIP LPESKWLKDA TLRLAVYNIF DKKYVQSTRH
AVQCTVGEPR TFEVGLKTTF