Gene Amuc_0714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0714 
Symbol 
ID6273871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp843134 
End bp844183 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content62% 
IMG OID642612766 
Producthypothetical protein 
Protein accessionYP_001877332 
Protein GI187735220 
COG category[S] Function unknown 
COG ID[COG4864] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0285775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACC TGCTGTCAGC AAACATTTCC AACGCAACCG CCTGGGGGAC CATTCTGGTC 
GTCGTCCTCG TCATCTTTCT GGTCATCTTT TTCGCCATCA TCGCCAAATT CTTCAAGACA
TGGCTGCGCG CCCGTCTGGC GAAAGCGCCC GTCTCCATGA GCAACATGCT GGGGATGTGG
CTCCGGAAAG TGCCCTATCC CCTGGTGGTG GACACCCGCA TTACGGCCGC CAAGGCGGGC
CTGGACATCA GCACGGATGA GCTGGAGGCC CACTTCCTGG CCGGCGGCGA CATCGTGGAC
TGCGTGCTTG CCCTGATTGC GGCGGAGAAA GCCGGCATTC CGCTGAGCTA CGACCGCGCC
TGCGCCATTG ACCTGGCCGT GAAAGGCACT TCCAAGACCG TGCTGGAAGC CGTGCGCACC
TCCATCAATC CCCGAGTCAT CGACTGCCCG AACCCCAGCT CCGGCCAAAC GCGCCTGACG
GCGGTGGCCC GTGACGGCAT TGCAGTGGCG GTGCGCGCCC GCGTAACGGT GCGCACCAAC
CTGGATCTCT TCGTAGGGGG CGCCACGGAG GAAACGGTGG TCGCCCGCGT CGGGGAAGGC
ATCGTCTCCG CCGTGGGTTC TGCCCCCTCC TACAAGGATG TTCTGGAAAA ACCGGAGGTG
ATCTCCCGTA CCGTAAGCGA CAAGGGGGTG GATGCCGCCA CGGCGTTTGA AGTGCTTTCC
ATTGACATTG CGGATGTGGA TGTAGCCGGC AATGTGGGCG CCCGCCTCCA GGCCGAGCAG
GCGGAAGCGG ACAAACAAAT CGCCCAGGCC AAAGCGGAAG TCCGCCGCGC CGCCGCCGTA
GCCACGGAAC AGGAAATGGC TGCAAAAACG CAGGAAATGC GCGCCAAGGT GGTGGAAGCG
GAAGCCCAGA TTCCCATGGC CATGGCGGAA GCCTTCCGCA ACGGCAATCT GGGAGTGCTG
GACTACGCCC GTTACCAAAA CGTGGTGGCT GACACCAAAA TGAGAGATTC CATCGCTCAA
CCGGATACGC CTCCCTCTTC CATCAAATAA
 
Protein sequence
MNNLLSANIS NATAWGTILV VVLVIFLVIF FAIIAKFFKT WLRARLAKAP VSMSNMLGMW 
LRKVPYPLVV DTRITAAKAG LDISTDELEA HFLAGGDIVD CVLALIAAEK AGIPLSYDRA
CAIDLAVKGT SKTVLEAVRT SINPRVIDCP NPSSGQTRLT AVARDGIAVA VRARVTVRTN
LDLFVGGATE ETVVARVGEG IVSAVGSAPS YKDVLEKPEV ISRTVSDKGV DAATAFEVLS
IDIADVDVAG NVGARLQAEQ AEADKQIAQA KAEVRRAAAV ATEQEMAAKT QEMRAKVVEA
EAQIPMAMAE AFRNGNLGVL DYARYQNVVA DTKMRDSIAQ PDTPPSSIK