Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1841 |
Symbol | |
ID | 6274738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2238138 |
End bp | 2239082 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642613904 |
Product | Thioredoxin domain |
Protein accession | YP_001878439 |
Protein GI | 187736327 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.928879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAATT CATTTTTTCC TGCAATTGGT GTTCTTTCCT TGGCCTCCGC TTTGATTTGT TCCGCGTCTT CCTCCTGGGA GACTGATTGG AATAAAGCGC TGGAGAAGGC CGGAAAGGGC GGACATCCTG TGCTGGCTGA TTTTACCGGT TCCGACTGGT GTCCCGGATG CATTTACCTG CGCAAAAATA TTTTTGACAC GGATGCGTTC GCCAAATATG CGGCGGATCA TCAATTCGTG CTGCTGGAAC TGGATTTCCC CAAGGCTGCC GGGAAAATGC CGCCGGAACA GTTAAAATTC CATGAAGAGC TGATGCGGCG TTATGGCGTT TCCTCGTTCC CATCCGTTCT GTTGATGGAA GGAAATGGCG CTCCCTACGC TAAAATAGTG GGTGCCACCA GAACTCCGGA GGAATATCTG AAAAAACTGG AGGCTGCCGG AGAAACGAGG AGGAAGTTGA AAGAGGCCGT AGCGGCGGCC CAGCCATTGA AAGGAAAGGA AAAACTGGAG CAACTGGTTA AGGCCTTGAA CGTGCTTCCG GAAGATTTGC AGCCTTTCCA GAAGGGGTTG ATTGCAGAAA TTTCCGCTCT GGACCCGGAG GACAAATACG GTTTTGCAAA GAAGTCTGAA AAAGCCGCAG CCATGGAGAA GCAGCGGCTT GTGTGGGAAC AGTTCTGCCA AAAATATTCG GGGAGGCTCT CCGCAGAAGA AACGCGCGCC GGCCGGGAGG AAGCATTGCA GATGTTGGAA AAAAAGGATA CGCTTCCTCC CATCCGCCTG AAGATCGCCA AATATATCAG TGATGGGTAT ACCTTGGAAC GTAATTTGCC CAAGGCTTTG GAATACCTGG AAATTGCCCG TGATGCCGAT CCGGAGTCTC AAGCCGCCAA AAAACTGGAA CCGTGGATTG ACAATATGCG GAAACATATC AATCAGGAGA AGTAA
|
Protein sequence | MRNSFFPAIG VLSLASALIC SASSSWETDW NKALEKAGKG GHPVLADFTG SDWCPGCIYL RKNIFDTDAF AKYAADHQFV LLELDFPKAA GKMPPEQLKF HEELMRRYGV SSFPSVLLME GNGAPYAKIV GATRTPEEYL KKLEAAGETR RKLKEAVAAA QPLKGKEKLE QLVKALNVLP EDLQPFQKGL IAEISALDPE DKYGFAKKSE KAAAMEKQRL VWEQFCQKYS GRLSAEETRA GREEALQMLE KKDTLPPIRL KIAKYISDGY TLERNLPKAL EYLEIARDAD PESQAAKKLE PWIDNMRKHI NQEK
|
| |