Gene Amuc_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1120 
Symbol 
ID6273947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1340007 
End bp1342397 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content56% 
IMG OID642613171 
Producthypothetical protein 
Protein accessionYP_001877727 
Protein GI187735615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTCC AATCTCTATA TTTATCTGCC GTGGCGGTAA GTGCCGTTTC TTTCGGGTGG 
GGGGCTCTGG ATAAGCCTTC CGCCTCCAAT TTGATCTGGT CTGATGAACC GGCCGTAGTT
GTTTATCCGC AGGAAGACAA AAATTCCGAG GGCAGTTTTG GCAAGTACAG AAAGCCTGCC
TCCGTCTGGG AAGCGGAGGG GTATCCCATT GGCAATGGTC GCGTTGGAGC CATGATTTTC
AGCGCTCCCG GCCGCGAGCG GCTGGCCCTG AATGAAATCA GCCTTTGGTC CGGAGGTGCC
AATCCGGGGG GAGGGTACGG TTACGGGCCT GATGCCGGAA CGAACCAGTT CGGCAACTAT
CTTCCCTTTG GGGATTTGTT TGTGGACTTT AAAAAAGGCG ACCAACCGGC TTCCCTGTCT
GTGGAAGATT TTACGCGCTC CCTGGATCTC CGGGACGGCA TTCATAAAGT GAATTACAAG
GCGGACGGCG TAACGTATGA CCGGGAGGCA TTCTCCAGCA CGCCTGCCAA CGTCCTGGTG
CTGAATTATA AAGCCAGCAA ACCCGGCCAA TTCAGCGCGG ATTTTTCCGT TAACAGCCAG
CTTGGAGCAG ATATTTCCGC CAAGGGATCC GTCATCACCT GGAAGGGGAT GTTGAAAAAC
GGCATGAATT ATGAAGGCCG CGTTTTGATC CGTCCCAAAG GCGGTACGCT TTCTGCCTCG
GGAGATAAAA TTTCCGTGAA AAATGCGGAT TCCTGCATGG TCGTCATCGC CATGGAGACG
GATTACCTGA TGGATTATAA AAAGGACTGG AAGGGTGAAT CTCCCTCCAG GAAGCTGGAC
CGTTATGCGG CCAAAGCCGC TTCTGCGGAT TATGCCGCCC TGAAACAGGC CCACATTTCC
CAGTACAAGT CCATGTTTGA CCGGGTGAAG GTCAACTTCG GAAAAACGGA GGAGGATGTA
GCCAAGCTGC CTACGCCAAA ACGTCTGGAG GCCTATAAGA AAAATCCGGC AGACCCCGAT
TTGGAGGAAA CCATGTTCCA GTTTGGCCGA TATCTGCTGT TGTCCAGTTC CCGGCCCGGC
ACGCTACCGG CCAACCTGCA AGGGTTGTGG AACGATTATG TCAAACCGCC GTGGGCCTGC
GACTACCATA ACAACATCAA CGTCCAGATG GCGTATTGGG GGGCGGAACC CGCCAATCTT
TCCGAATGCC ATGAGGCCCT GGTCAATTAT GTGGAGGCAA TGGCCCCCGG CTGCCGGGAC
GCTTCCCAGG CGAACAAGGG GTTCAATACC AAGGACGGTA AACCCGTGCG CGGCTGGACG
GTGCGCACCT CCCAGAATAT CTTCGGTGGC AACGGCTGGC AGTGGAACAT TCCCGGTGCG
GCCTGGTATG CGCTGCATAT ATGGGAACAT TATGCATTTA CCGGTGATAG AAAGTATCTG
GAAAAACAGG CGTATCCCCT GATGAAGGAA ATCTGTCATT TCTGGGAAGA CCATTTGAAG
GAACTGGGAG CCGGGGGCGA AGGGTTCAAG ACAAACGGCA AGGATCCGAG CGAAGAGGAG
AAGAAGGATC TGGCCGATGT GAAAGCCGGA ACTCTGGTGG CTCCCAATGG CTGGTCTCCC
GAACATGGGC CGCGTGAAGA CGGTGTGATG CATGACCAGC AGCTCATTGC GGAGCTTTTC
TCCAATACCA TCAAAGCCGC CCGCATTCTG GGCAAGGACG CCGCCTGGGC CAAGAGCCTG
GAGGGCAAGC TGAAAAGGCT GGCCGGCAAC AAGATAGGCA AGGAAGGGAA TCTTCAGGAA
TGGATGATTG ACCGCATTCC CAAGACGGAC CACCGCCATA CGTCCCACCT TTTTGCCGTT
TTCCCCGGCA ACCAGATCAG CAAGCTCAAG ACGCCCAAGC TGGCGGAAGC CGCCCGCCTT
TCCTTGGAAT GGCGCGGCAC GACCGGAGAC AGCCGCCGTT CCTGGACGTG GCCGTGGCGC
ACGGCTCTGT GGGCCCGCCT GGGCGAGGGG AACAAGGCTC ATGAAATGGT TCAGGGACTC
TTGAAATTCA ACACTTTGCC GAATATGCTG ACTACTCATC CCCCTATGCA GATGGACGGC
AACTTCGGCA TTGTAGGCGG CATTTGTGAA ATGTTGGTGC AATCCCATGC CGGGGGCCTG
GACATCATGC CCTCTCCCGT GGAAGCGTGG CCTGAAGGTT CCGTGAAAGG GCTGAAAGCC
CGGGGCAACG TCACGGTGGA TTTTTCCTGG AAGGACGGCA AGGTAAGCAA CGTGAAGTTG
TATTCCGCCC AACCCAAGGT GTTGCCCGTG CGCGTTAACG GGAAGATGAC CCGCATGAAA
ACCCTGCCGC TGAAATCGGG AGCGGGATCT TCGCAGCCCG CGGCCAGGTA A
 
Protein sequence
MHFQSLYLSA VAVSAVSFGW GALDKPSASN LIWSDEPAVV VYPQEDKNSE GSFGKYRKPA 
SVWEAEGYPI GNGRVGAMIF SAPGRERLAL NEISLWSGGA NPGGGYGYGP DAGTNQFGNY
LPFGDLFVDF KKGDQPASLS VEDFTRSLDL RDGIHKVNYK ADGVTYDREA FSSTPANVLV
LNYKASKPGQ FSADFSVNSQ LGADISAKGS VITWKGMLKN GMNYEGRVLI RPKGGTLSAS
GDKISVKNAD SCMVVIAMET DYLMDYKKDW KGESPSRKLD RYAAKAASAD YAALKQAHIS
QYKSMFDRVK VNFGKTEEDV AKLPTPKRLE AYKKNPADPD LEETMFQFGR YLLLSSSRPG
TLPANLQGLW NDYVKPPWAC DYHNNINVQM AYWGAEPANL SECHEALVNY VEAMAPGCRD
ASQANKGFNT KDGKPVRGWT VRTSQNIFGG NGWQWNIPGA AWYALHIWEH YAFTGDRKYL
EKQAYPLMKE ICHFWEDHLK ELGAGGEGFK TNGKDPSEEE KKDLADVKAG TLVAPNGWSP
EHGPREDGVM HDQQLIAELF SNTIKAARIL GKDAAWAKSL EGKLKRLAGN KIGKEGNLQE
WMIDRIPKTD HRHTSHLFAV FPGNQISKLK TPKLAEAARL SLEWRGTTGD SRRSWTWPWR
TALWARLGEG NKAHEMVQGL LKFNTLPNML TTHPPMQMDG NFGIVGGICE MLVQSHAGGL
DIMPSPVEAW PEGSVKGLKA RGNVTVDFSW KDGKVSNVKL YSAQPKVLPV RVNGKMTRMK
TLPLKSGAGS SQPAAR