Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2645 |
Symbol | |
ID | 7407009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2776967 |
End bp | 2778391 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643717014 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002574483 |
Protein GI | 222530601 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000155318 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACACTC AAGAAGACAG AAAAATAAAT AAAAACATAC TCATCGCAAC AACACTTTCT TCTTTTTTAG TGCCGTTTAT GTCAAGCGCA GTCAATATTG CCGCACCAGA TATAGCAAAA AGTTTTAAGC TCAACGCTGA AGAGCTGAAC CTTGTGATAA GCATATTTTT GATATTCTCT GCAGCCTTCA TTCTTCCCAT GGGAAAGCTC TCTGACACAT TTGACAGGAC CAAGATATTC AAAACAGGGC TTTTGCTGTT TACACTTTCA ACCCTGATGT GTGCACTCTC AAACACAGTA GAAATTCTTT TTGTCTTCCG CGCACTTCAG GGATTTTTCT CAGCATTCAC ATTTGTGACA TCTATGCCAA TCTTGATTGA AGAACACTTA CCACAAATAA GGGGAAGGCT TTTAGGGATA AACACAGCAG TTGTGTACTT GGGGACATCC TTAGGACCTT TTTTGGGTGG TTTGCTTGTA AAACTTTGGG GATACAGAAG CATATTTTTG TTTGGATTTG CCATAGGACT TGTTGGTTCA TTTGTGAGTT TATTTTTACT CCAAAAAGAA GTGAAAAATA CAAGGCAGGC AAAACTACTT GACAGCCTTA AATCGCTTGA CAAAATGGGC ACAATCCTGT CGATGACAGG GCTTTTTCTT TTAATGTACG GAGCCTCCAC ATTTGAACTG GGAAATACCT CTAAAATTTT GTTCTTTGCA GGGTTAATTT TGATGGTAAT TTTTGTTGTT GCAGAGGCAA AACTTCAAAA TCCCATTTTG GACGTAAAAC TGTTTGTAAA AATCCCGCAG TTTGGATTTT CAAACTTAGC AGCGCTCATA AACTACAGCT GCACATTTTC TGCGTCTTAC CTTATGTCGC TGTACCTTCA ACTTGTAAAA GCTCTGCCAT CCCAGCTTGC AGGCTCTATT TTGATTGTTC AACCACTGTC GCAGGTTATT ACTTCATTAA TTTCCGGCAG AGCCTCTGAA AAGATAGAAC CAAGAAAGCT TGCAACATCT GGCATGGTTT TGACCACAGC TGGTCTTTTT ATTTTTTCAA CTTTTGCTGC TAAAACAAAC CTTGTTATTG TTATCTTAAA TCTGTTTATC ATGGGGATTG GTTTTGGACT TTTCTCATCG CCAAACACAA ATGTTGTTAT GAGCTGTGTA CCAAAATCAC TCTATGGCAC AGCATCATCA ACAATATCTG TCATGAGAGT TATAGGACAG GCATTCTCAA TGGCAATTGT TTCGTTTGTA TCAATCATGT TTTTGAAAGG CGTAAAACTT TCGCACGAAA ACTATCTTCT TATTCTAAAG AGCATGAAGA CAAGCTTTTT GGTGTTTGCA CTTCTCTCTA TTCTGGGAAT TGTTGCGTCA TACAAAAGAG GAAATATTTA CTCTGAAGTA AAACAAAGCA AATAA
|
Protein sequence | MHTQEDRKIN KNILIATTLS SFLVPFMSSA VNIAAPDIAK SFKLNAEELN LVISIFLIFS AAFILPMGKL SDTFDRTKIF KTGLLLFTLS TLMCALSNTV EILFVFRALQ GFFSAFTFVT SMPILIEEHL PQIRGRLLGI NTAVVYLGTS LGPFLGGLLV KLWGYRSIFL FGFAIGLVGS FVSLFLLQKE VKNTRQAKLL DSLKSLDKMG TILSMTGLFL LMYGASTFEL GNTSKILFFA GLILMVIFVV AEAKLQNPIL DVKLFVKIPQ FGFSNLAALI NYSCTFSASY LMSLYLQLVK ALPSQLAGSI LIVQPLSQVI TSLISGRASE KIEPRKLATS GMVLTTAGLF IFSTFAAKTN LVIVILNLFI MGIGFGLFSS PNTNVVMSCV PKSLYGTASS TISVMRVIGQ AFSMAIVSFV SIMFLKGVKL SHENYLLILK SMKTSFLVFA LLSILGIVAS YKRGNIYSEV KQSK
|
| |