Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2326 |
Symbol | |
ID | 7407745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2463875 |
End bp | 2465107 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716690 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002574169 |
Protein GI | 222530287 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATATG TATTATTTTC AGGACCTTTT AGAGCATTAC GGCACAAAAA TTACAGATAC TACTGGTTTG GGCAGGCAAT ATCGGTGATT GGCTCATGGA TGCAGAACAT GGCAATGCAG TGGCTGGCTT TAAGCATTAC AAATTCAGCA CTGCTTCTTA GTATTGTCAC TGCATGTGAA CAAGTACCTG TAATGTTTAT TTCGCTTTTT GCAGGGGCAA TACTTGATAA AAGGCAAAAA AGAAGGATTA TTTTGTTAAC TCAAAGCCTT CTTCTATTCT TTGCTTTCAT TCTATTTTTG ATTACATATA CTCACACAGT TCGCTACTGG CACTTAGTAG TTTTAGCTAT TTTAAGAGGT CTTGTAACAA CATTTGATAA CCCTGCAAGA CAGTCTTATA TGATAACTCT CGTTGGAAAA GAAGACCTGC CGAACGCTGT TGGTCTTAAC TCTATGATTT TTAATCTTGC AAGAATCATA GGCCCTGCTG TCGCAAGCTT GGTTATATCA ACAGCAGGAA TTGAAATGTG TTTTTTGGCA AATGCTATAA GTTTTGTGCC AGTTATTATA GGTGTATTTC TGATTGACGC CAAAGAGCCT CAAAAAGAGG AAAATGGTAA AAGCGTTTTT TCAGAGGTGG TTGAAGGGCT CAAGTATGTA TATATGAACA AAGTGCTTCT GAGAGCAATA TCGCTTGTTT TAATCATGGG CATATTTATT CTCAATTTTA ATGTTCTTAT TCCTGTGTAT GCAAAACTTG CTCTGGGCAG AAATGAAACA GGTTTTGGTT TTTTGATGTC ATCGATGGGC ATTGGCTCAC TGATGGGCGC ATTTTTGACA GCTACAAGAA GAAAGGAAAA GATTAATTTA AATCTCCTTT TTAAGTTCAT CCTCTCTGTG TCAATAGTTT ACATTTTTCT TGGTCTTAAC AAAAGCTATG CAGTTGCTTG CGTACTATTT GTGTTTGTAG GGCTTCTTGC AATAAGCTTT AACACAAGCG CAAACGCACT TTTGCAGCTT TCATCAAGTG ATGACTTCAG AGCAAGGGTT CTGAGTATCT ACTTTCTTTG CAATGCTGGA ACAACACCAA TTGGAAATCT ATTTACAGGA ACAATTTCAC AAAAAATCTC TCCATGGGCT GGATTTTACA TACCTGGCCT TGCTACAATA GCTTTGACCA CAATGGTTCT TATCACCACA TTTAAGAAAA AGAACCTTGA AAAAACTAAA TAA
|
Protein sequence | MQYVLFSGPF RALRHKNYRY YWFGQAISVI GSWMQNMAMQ WLALSITNSA LLLSIVTACE QVPVMFISLF AGAILDKRQK RRIILLTQSL LLFFAFILFL ITYTHTVRYW HLVVLAILRG LVTTFDNPAR QSYMITLVGK EDLPNAVGLN SMIFNLARII GPAVASLVIS TAGIEMCFLA NAISFVPVII GVFLIDAKEP QKEENGKSVF SEVVEGLKYV YMNKVLLRAI SLVLIMGIFI LNFNVLIPVY AKLALGRNET GFGFLMSSMG IGSLMGAFLT ATRRKEKINL NLLFKFILSV SIVYIFLGLN KSYAVACVLF VFVGLLAISF NTSANALLQL SSSDDFRARV LSIYFLCNAG TTPIGNLFTG TISQKISPWA GFYIPGLATI ALTTMVLITT FKKKNLEKTK
|
| |