Gene Amuc_1260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1260 
Symbol 
ID6275415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1520651 
End bp1522081 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content49% 
IMG OID642613317 
Productglycoside hydrolase family 37 
Protein accessionYP_001877866 
Protein GI187735754 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGG ATTGGAAAAA AGAGATTCCC CTGCCAGTAT ATCCAAACCG GGAAATGACG 
GAACTTTATC ATCAAACATG GGAAATTGCG GCTGGGCGCG TCAGGAAAGG GCCGGAAGGG
CTCCCCGCGT CTCCCTATAT GGATGAAAAT TGCTACGAAG ATCAAATCTG GATATGGGAC
ACCTGTTTCA TGGTGCTTTT TGCCAAGTAT GCGCCCAGGG CGTTTCCCGG AATAGAAAGT
CTGGATAATC TTTACAAACC TATCCATGAA AAAGCCGCTA CGCCCCTTAG AATCCATTTA
GTGGATAATC CCCCGCTGTT TGCTTGGGTG GAAAAGGAAT ATTTTGATTT CACAGGAGAT
AAGAGGCGGC TTAATCATCT TCTTAATGAA AAGCGGTATT TGCAGAAGCA TTTCAAATGG
TTTGCCCGGG CTAAGGCTGG TGAACGGTTT GAATGTTCCC CCCAGCGTAT TTACTTGAAT
TCCATTGGGG ATGATGGCTT TACCTGGACG GGGAGAGCGA GCGGTATGGA CAATACTCCC
CGCGGGCGTG ATGCCGGAGG ATACCATAAG GTGCTATGGG TGGATGCCAT TTCCCAGCAG
GCTCTTAGCG CCCACTGCAT TGCTACCATG GAACAGGCTT TGGGAAATGA GAATGAAGCA
AGAAAATGGA ACGCTGAATA TGAAGCGCTT AAGAAAAAGA TCAACCATCT TTACTGGGAT
GAGCGGGATG GATTTTATTA CGATGTCACC ATTGCGGACA AACAGCCCTG CCGCGTTAAA
ACCATTGCTT CCTATTGGCC CCTTCTGGCC CGGATTGCGT CCAGGGAACA GGCGCGGAGC
ATGGTAAATC ATCTGATGAA TCCCGGGGAA TTCGGAGGCA GTTATCCTAC TCCTTCCCTG
GCCCGCTCGG ATAAGGATTA TCATCATCAA ACCGGGGATT ACTGGCGGGG AGGAATTTGG
CTGCCGACGA CATATATGGC GATTAAGGCC ATTGAAAAGT ATGGCTACCA TGAGGAGGCC
GATGCTATTG CCGAGAAGGT TATCAACCAG CAGCTTGCCG CTTACAGGAA TATGGAACCG
CATACTGTCT GGGAGTGCTA TAGCCCAAGC GGAGATGCCC CCTCCACAGA ACACGGACGC
CGTGTAAGAC CGGAATTTTG CGGCTGGTCA GCCCTGGGGC CGATTGCGTT GTTTATTGAA
AATGTGCTGG GATTTAAGAA AGTGTCTGCC GCCGGAAAGG AAGTCCGGTG GAGGTTGAAA
AAAAACAAGG GCCGCCATGG AATCAGGAAT TTGAGGTTTG GCGATATTGT AACCGATATT
GTTTTTGATG GTAAAGGCAC GGTGTCGGTC ACGTCGAATG CTTCTTACTC TTTAATCATT
AATGGCAATA CTTATTCAGT AAGGAAGGGG GATACTGAAA TTAAGCTGTA A
 
Protein sequence
MNRDWKKEIP LPVYPNREMT ELYHQTWEIA AGRVRKGPEG LPASPYMDEN CYEDQIWIWD 
TCFMVLFAKY APRAFPGIES LDNLYKPIHE KAATPLRIHL VDNPPLFAWV EKEYFDFTGD
KRRLNHLLNE KRYLQKHFKW FARAKAGERF ECSPQRIYLN SIGDDGFTWT GRASGMDNTP
RGRDAGGYHK VLWVDAISQQ ALSAHCIATM EQALGNENEA RKWNAEYEAL KKKINHLYWD
ERDGFYYDVT IADKQPCRVK TIASYWPLLA RIASREQARS MVNHLMNPGE FGGSYPTPSL
ARSDKDYHHQ TGDYWRGGIW LPTTYMAIKA IEKYGYHEEA DAIAEKVINQ QLAAYRNMEP
HTVWECYSPS GDAPSTEHGR RVRPEFCGWS ALGPIALFIE NVLGFKKVSA AGKEVRWRLK
KNKGRHGIRN LRFGDIVTDI VFDGKGTVSV TSNASYSLII NGNTYSVRKG DTEIKL