Gene Amuc_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1700 
Symbol 
ID6274086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2065567 
End bp2066790 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content57% 
IMG OID642613759 
Producttryptophan synthase subunit beta 
Protein accessionYP_001878299 
Protein GI187736187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.605674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000000413571 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCTCC ATACTTATTT ACGCAATTTT CCAGACGCCC AGGGCCGTTT TGGCGAATAC 
GGCGGCGTTT ACCTGCCGGA CGAGCTGGTT CCCGCTTTTG AAGAAATTAC GGAAGCCTAC
CAGACGATAG CCCACTCCGC CCAGTTCATT AATGAATTGC GGCGCATCCG CAAACAGTTT
CAGGGACGTC CCACCCCAGT TTACCACTGC GAACGCCTTT CCCGCCACCT GGGCACCTCT
CAAATTTACC TGAAGAGGGA AGATTTGAAC CATACAGGCG CCCATAAGCT CAACCATTGC
ATGGGGGAAG GCCTTCTGGC CAAATACATG GGCAAGAAGC GCATTATCGC GGAAACGGGC
GCAGGACAGC ATGGCGTGGC GCTGGCTACA GCCGCCGCCT TCTTCGGCCT GGAATGCGAA
ATTCACATGG GAGCGGTAGA TATTGCCAAG CAGGCTCCGA ATGTCACCCG CATGAAAATC
CTGGGTGCCA AGGTGGTTCC TGTCACGCAC GGCCTCCAAA GCTTGAAGGA AGCTGTGGAT
TCCGCCTTTG AATCTTACCT GAACAGTTAT CAGGATTCCA TTTACTGCAT CGGTTCCGTG
GTAGGCCCCC ACCCTTTCCC GCAAATGGTG CGCGATTTCC AAATGTGCAT CGGCGTGGAA
GCCCGGGAAC AATTCCTGGA AATGACGGGA CTTCTGCCGG ACGCCGTCTG CGCCTGCGTG
GGCGGCGGCA GCAATTCCAT GGGCATGTTT ACCGCCTTCC TGGGAGACCC GCTGGATATC
TATGGCGTAG AACCCCTCGG CAAAGGTCCC AGGCTTGGGG ATCATTCCGC CTCCATCACC
TATGGACGCA AGGGCGTCCT GCACGGTTTT GAAAGCATCA TGCTCCAGGA TGAAGACGGT
AATCCGGGAC CGGTTCATTC CGTAGCCAGC GGCCTGGATT ATCCTTCCGT AGGGCCGGAA
CACGCCTATC TGCACGACAT CGGCCGCGTT AACTATGTCA CCGCCACAGA CGAGGAAGCC
GTGGACGCCT TCTTCAAACT TTCCCGTTAC GAGGGGATCA TTCCCGCTCT GGAAAGCTCC
CACGCTATCG CGTATGCCAT GAAGTGGGCA CGGGAAAACA GAGGAGGCGC CATTCTGGTA
AACTGCTCCG GCCGCGGAGA CAAGGATGTG GATTACGTCG TGGAGCATTA CGGCTATGGG
GAAGACCACC AGTTCCCCGC CTGA
 
Protein sequence
MDLHTYLRNF PDAQGRFGEY GGVYLPDELV PAFEEITEAY QTIAHSAQFI NELRRIRKQF 
QGRPTPVYHC ERLSRHLGTS QIYLKREDLN HTGAHKLNHC MGEGLLAKYM GKKRIIAETG
AGQHGVALAT AAAFFGLECE IHMGAVDIAK QAPNVTRMKI LGAKVVPVTH GLQSLKEAVD
SAFESYLNSY QDSIYCIGSV VGPHPFPQMV RDFQMCIGVE AREQFLEMTG LLPDAVCACV
GGGSNSMGMF TAFLGDPLDI YGVEPLGKGP RLGDHSASIT YGRKGVLHGF ESIMLQDEDG
NPGPVHSVAS GLDYPSVGPE HAYLHDIGRV NYVTATDEEA VDAFFKLSRY EGIIPALESS
HAIAYAMKWA RENRGGAILV NCSGRGDKDV DYVVEHYGYG EDHQFPA