Gene Athe_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2326 
Symbol 
ID7407745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2463875 
End bp2465107 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content37% 
IMG OID643716690 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002574169 
Protein GI222530287 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATATG TATTATTTTC AGGACCTTTT AGAGCATTAC GGCACAAAAA TTACAGATAC 
TACTGGTTTG GGCAGGCAAT ATCGGTGATT GGCTCATGGA TGCAGAACAT GGCAATGCAG
TGGCTGGCTT TAAGCATTAC AAATTCAGCA CTGCTTCTTA GTATTGTCAC TGCATGTGAA
CAAGTACCTG TAATGTTTAT TTCGCTTTTT GCAGGGGCAA TACTTGATAA AAGGCAAAAA
AGAAGGATTA TTTTGTTAAC TCAAAGCCTT CTTCTATTCT TTGCTTTCAT TCTATTTTTG
ATTACATATA CTCACACAGT TCGCTACTGG CACTTAGTAG TTTTAGCTAT TTTAAGAGGT
CTTGTAACAA CATTTGATAA CCCTGCAAGA CAGTCTTATA TGATAACTCT CGTTGGAAAA
GAAGACCTGC CGAACGCTGT TGGTCTTAAC TCTATGATTT TTAATCTTGC AAGAATCATA
GGCCCTGCTG TCGCAAGCTT GGTTATATCA ACAGCAGGAA TTGAAATGTG TTTTTTGGCA
AATGCTATAA GTTTTGTGCC AGTTATTATA GGTGTATTTC TGATTGACGC CAAAGAGCCT
CAAAAAGAGG AAAATGGTAA AAGCGTTTTT TCAGAGGTGG TTGAAGGGCT CAAGTATGTA
TATATGAACA AAGTGCTTCT GAGAGCAATA TCGCTTGTTT TAATCATGGG CATATTTATT
CTCAATTTTA ATGTTCTTAT TCCTGTGTAT GCAAAACTTG CTCTGGGCAG AAATGAAACA
GGTTTTGGTT TTTTGATGTC ATCGATGGGC ATTGGCTCAC TGATGGGCGC ATTTTTGACA
GCTACAAGAA GAAAGGAAAA GATTAATTTA AATCTCCTTT TTAAGTTCAT CCTCTCTGTG
TCAATAGTTT ACATTTTTCT TGGTCTTAAC AAAAGCTATG CAGTTGCTTG CGTACTATTT
GTGTTTGTAG GGCTTCTTGC AATAAGCTTT AACACAAGCG CAAACGCACT TTTGCAGCTT
TCATCAAGTG ATGACTTCAG AGCAAGGGTT CTGAGTATCT ACTTTCTTTG CAATGCTGGA
ACAACACCAA TTGGAAATCT ATTTACAGGA ACAATTTCAC AAAAAATCTC TCCATGGGCT
GGATTTTACA TACCTGGCCT TGCTACAATA GCTTTGACCA CAATGGTTCT TATCACCACA
TTTAAGAAAA AGAACCTTGA AAAAACTAAA TAA
 
Protein sequence
MQYVLFSGPF RALRHKNYRY YWFGQAISVI GSWMQNMAMQ WLALSITNSA LLLSIVTACE 
QVPVMFISLF AGAILDKRQK RRIILLTQSL LLFFAFILFL ITYTHTVRYW HLVVLAILRG
LVTTFDNPAR QSYMITLVGK EDLPNAVGLN SMIFNLARII GPAVASLVIS TAGIEMCFLA
NAISFVPVII GVFLIDAKEP QKEENGKSVF SEVVEGLKYV YMNKVLLRAI SLVLIMGIFI
LNFNVLIPVY AKLALGRNET GFGFLMSSMG IGSLMGAFLT ATRRKEKINL NLLFKFILSV
SIVYIFLGLN KSYAVACVLF VFVGLLAISF NTSANALLQL SSSDDFRARV LSIYFLCNAG
TTPIGNLFTG TISQKISPWA GFYIPGLATI ALTTMVLITT FKKKNLEKTK