Gene Athe_2645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2645 
Symbol 
ID7407009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2776967 
End bp2778391 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content38% 
IMG OID643717014 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002574483 
Protein GI222530601 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000155318 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACTC AAGAAGACAG AAAAATAAAT AAAAACATAC TCATCGCAAC AACACTTTCT 
TCTTTTTTAG TGCCGTTTAT GTCAAGCGCA GTCAATATTG CCGCACCAGA TATAGCAAAA
AGTTTTAAGC TCAACGCTGA AGAGCTGAAC CTTGTGATAA GCATATTTTT GATATTCTCT
GCAGCCTTCA TTCTTCCCAT GGGAAAGCTC TCTGACACAT TTGACAGGAC CAAGATATTC
AAAACAGGGC TTTTGCTGTT TACACTTTCA ACCCTGATGT GTGCACTCTC AAACACAGTA
GAAATTCTTT TTGTCTTCCG CGCACTTCAG GGATTTTTCT CAGCATTCAC ATTTGTGACA
TCTATGCCAA TCTTGATTGA AGAACACTTA CCACAAATAA GGGGAAGGCT TTTAGGGATA
AACACAGCAG TTGTGTACTT GGGGACATCC TTAGGACCTT TTTTGGGTGG TTTGCTTGTA
AAACTTTGGG GATACAGAAG CATATTTTTG TTTGGATTTG CCATAGGACT TGTTGGTTCA
TTTGTGAGTT TATTTTTACT CCAAAAAGAA GTGAAAAATA CAAGGCAGGC AAAACTACTT
GACAGCCTTA AATCGCTTGA CAAAATGGGC ACAATCCTGT CGATGACAGG GCTTTTTCTT
TTAATGTACG GAGCCTCCAC ATTTGAACTG GGAAATACCT CTAAAATTTT GTTCTTTGCA
GGGTTAATTT TGATGGTAAT TTTTGTTGTT GCAGAGGCAA AACTTCAAAA TCCCATTTTG
GACGTAAAAC TGTTTGTAAA AATCCCGCAG TTTGGATTTT CAAACTTAGC AGCGCTCATA
AACTACAGCT GCACATTTTC TGCGTCTTAC CTTATGTCGC TGTACCTTCA ACTTGTAAAA
GCTCTGCCAT CCCAGCTTGC AGGCTCTATT TTGATTGTTC AACCACTGTC GCAGGTTATT
ACTTCATTAA TTTCCGGCAG AGCCTCTGAA AAGATAGAAC CAAGAAAGCT TGCAACATCT
GGCATGGTTT TGACCACAGC TGGTCTTTTT ATTTTTTCAA CTTTTGCTGC TAAAACAAAC
CTTGTTATTG TTATCTTAAA TCTGTTTATC ATGGGGATTG GTTTTGGACT TTTCTCATCG
CCAAACACAA ATGTTGTTAT GAGCTGTGTA CCAAAATCAC TCTATGGCAC AGCATCATCA
ACAATATCTG TCATGAGAGT TATAGGACAG GCATTCTCAA TGGCAATTGT TTCGTTTGTA
TCAATCATGT TTTTGAAAGG CGTAAAACTT TCGCACGAAA ACTATCTTCT TATTCTAAAG
AGCATGAAGA CAAGCTTTTT GGTGTTTGCA CTTCTCTCTA TTCTGGGAAT TGTTGCGTCA
TACAAAAGAG GAAATATTTA CTCTGAAGTA AAACAAAGCA AATAA
 
Protein sequence
MHTQEDRKIN KNILIATTLS SFLVPFMSSA VNIAAPDIAK SFKLNAEELN LVISIFLIFS 
AAFILPMGKL SDTFDRTKIF KTGLLLFTLS TLMCALSNTV EILFVFRALQ GFFSAFTFVT
SMPILIEEHL PQIRGRLLGI NTAVVYLGTS LGPFLGGLLV KLWGYRSIFL FGFAIGLVGS
FVSLFLLQKE VKNTRQAKLL DSLKSLDKMG TILSMTGLFL LMYGASTFEL GNTSKILFFA
GLILMVIFVV AEAKLQNPIL DVKLFVKIPQ FGFSNLAALI NYSCTFSASY LMSLYLQLVK
ALPSQLAGSI LIVQPLSQVI TSLISGRASE KIEPRKLATS GMVLTTAGLF IFSTFAAKTN
LVIVILNLFI MGIGFGLFSS PNTNVVMSCV PKSLYGTASS TISVMRVIGQ AFSMAIVSFV
SIMFLKGVKL SHENYLLILK SMKTSFLVFA LLSILGIVAS YKRGNIYSEV KQSK