Gene Amuc_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1841 
Symbol 
ID6274738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2238138 
End bp2239082 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content51% 
IMG OID642613904 
ProductThioredoxin domain 
Protein accessionYP_001878439 
Protein GI187736327 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.928879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAATT CATTTTTTCC TGCAATTGGT GTTCTTTCCT TGGCCTCCGC TTTGATTTGT 
TCCGCGTCTT CCTCCTGGGA GACTGATTGG AATAAAGCGC TGGAGAAGGC CGGAAAGGGC
GGACATCCTG TGCTGGCTGA TTTTACCGGT TCCGACTGGT GTCCCGGATG CATTTACCTG
CGCAAAAATA TTTTTGACAC GGATGCGTTC GCCAAATATG CGGCGGATCA TCAATTCGTG
CTGCTGGAAC TGGATTTCCC CAAGGCTGCC GGGAAAATGC CGCCGGAACA GTTAAAATTC
CATGAAGAGC TGATGCGGCG TTATGGCGTT TCCTCGTTCC CATCCGTTCT GTTGATGGAA
GGAAATGGCG CTCCCTACGC TAAAATAGTG GGTGCCACCA GAACTCCGGA GGAATATCTG
AAAAAACTGG AGGCTGCCGG AGAAACGAGG AGGAAGTTGA AAGAGGCCGT AGCGGCGGCC
CAGCCATTGA AAGGAAAGGA AAAACTGGAG CAACTGGTTA AGGCCTTGAA CGTGCTTCCG
GAAGATTTGC AGCCTTTCCA GAAGGGGTTG ATTGCAGAAA TTTCCGCTCT GGACCCGGAG
GACAAATACG GTTTTGCAAA GAAGTCTGAA AAAGCCGCAG CCATGGAGAA GCAGCGGCTT
GTGTGGGAAC AGTTCTGCCA AAAATATTCG GGGAGGCTCT CCGCAGAAGA AACGCGCGCC
GGCCGGGAGG AAGCATTGCA GATGTTGGAA AAAAAGGATA CGCTTCCTCC CATCCGCCTG
AAGATCGCCA AATATATCAG TGATGGGTAT ACCTTGGAAC GTAATTTGCC CAAGGCTTTG
GAATACCTGG AAATTGCCCG TGATGCCGAT CCGGAGTCTC AAGCCGCCAA AAAACTGGAA
CCGTGGATTG ACAATATGCG GAAACATATC AATCAGGAGA AGTAA
 
Protein sequence
MRNSFFPAIG VLSLASALIC SASSSWETDW NKALEKAGKG GHPVLADFTG SDWCPGCIYL 
RKNIFDTDAF AKYAADHQFV LLELDFPKAA GKMPPEQLKF HEELMRRYGV SSFPSVLLME
GNGAPYAKIV GATRTPEEYL KKLEAAGETR RKLKEAVAAA QPLKGKEKLE QLVKALNVLP
EDLQPFQKGL IAEISALDPE DKYGFAKKSE KAAAMEKQRL VWEQFCQKYS GRLSAEETRA
GREEALQMLE KKDTLPPIRL KIAKYISDGY TLERNLPKAL EYLEIARDAD PESQAAKKLE
PWIDNMRKHI NQEK