Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0409 |
Symbol | |
ID | 6274827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 489481 |
End bp | 490527 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642612459 |
Product | putative substrate-binding protein of aliphatic sulfonate ABC transporter |
Protein accession | YP_001877028 |
Protein GI | 187734916 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.830213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGTT CCAGATTCCT GATGATTTTA CCGGCCCTGT TTTTATGCCT GCTGGCTGTT TCCTGCGGAA AGAAGGAGAA GGAAGATCCC AATGTCGTGG AGTTGAACTT CGGTCATTTT CCCAACGTGA CACATGTGCA GGGCCTGGTG GCGCATCATT TTTCCCGGCA GGGGAAGGGG TGGTTTGAAG AACGGCTGAA GGAGGCTACC GGAAAGGATG TCAGGATCAA CTGGTACGTG TACAATGCGG GCCCCAGCGC CATGGAAGCC GTGTTCGCCC GGTCCATTGA ACTGGCTTAT GTGGGGCCCA GCCCGGCCAT CAATGCGTTT GTGCGTTCCC GCGGGGAGGA TATCAGGATG ATTGCCGGAG CTGTGGAAGG AGGCGCCGCC CTGGTGGTTC CGAAGGATTC CTTGCTGAAG GAGCCTGCGG ATTTCCGCGG CAAGGTGATT GCTACTCCCC AACTGGGGAA TACGCAGGAT GTTTCCGCCC GCGCCTGGTT TTCCCGCGGC GGCCTGCATG TGACGCAGCG CGGCGGGGAC GTGACGATTC TGCCGACTCC CAACCCCGAA CAGCTCAGCC TGTTCCGGCA GGGCAAGCTG GACGGCGTGT GGACGGTGGA ACCCTGGGTG AGCCGCCTGG TGCTGACGGC GGGAGGCAGG GTGCTGGTGG ACGAGAAGGA GTCCATCGCG ACCGTACTGG TATGCGGAGC GGAGTTTCTC AGGGAAAAGC CGGAGGTGGC GAAAGCCCTG GTGCAGGCGC ATGAGGAACT GAACGAATGG ATAAGGCTGC ATCCGGATGA GGCCCAGTTA ATCGTGGTCA GGGAGCTGGA GGAGCTGACG CATTCCAGAA TAGACCCGGC ATTGATTGCG GAGGCCTGGA AAAGCATTGT CATCAAGGAC AGGATTTCCA TTCCCAAGCT CCGGCAGTTT GTGCAGGACG CCCATCAGGC CGGATTTATG AAAGAGGTTC CGGATATAGG CGGCCTGGTG GTGCCGGAGG CGGCGGAGGA ACAGTTGACC ATGGCGAAGG AGGCGGGCGG AAGATGA
|
Protein sequence | MTRSRFLMIL PALFLCLLAV SCGKKEKEDP NVVELNFGHF PNVTHVQGLV AHHFSRQGKG WFEERLKEAT GKDVRINWYV YNAGPSAMEA VFARSIELAY VGPSPAINAF VRSRGEDIRM IAGAVEGGAA LVVPKDSLLK EPADFRGKVI ATPQLGNTQD VSARAWFSRG GLHVTQRGGD VTILPTPNPE QLSLFRQGKL DGVWTVEPWV SRLVLTAGGR VLVDEKESIA TVLVCGAEFL REKPEVAKAL VQAHEELNEW IRLHPDEAQL IVVRELEELT HSRIDPALIA EAWKSIVIKD RISIPKLRQF VQDAHQAGFM KEVPDIGGLV VPEAAEEQLT MAKEAGGR
|
| |