Gene Athe_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1067 
Symbol 
ID7409624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1162590 
End bp1163861 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content33% 
IMG OID643715433 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002572941 
Protein GI222529059 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0207259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACATCTG AAGACCAAGA ACGGGAATTT AAAAGACTTG AGGTTTTTGC CTATAAAAAT 
TTAAAGAAAA ATTCTATCAT TTCGATTGCA GATGGAGCAG TATTTGCAAT AGGAAGCGGT
ATGCTTCCAG TTTCTACTGT GATAGTTTAT TTTATTTCAC ATTATGTTCA CTCAAATACG
CTGATTGGAC TTTTAACCAC CTTGAATGTA CTTTTATCTA ACTCTCCGCA GATTCTTGTT
GCTAAAAAAT TAGAGATGCT TGATAGCTAC AAAGAGTATT TTATTAAAGT TGCCTTACTT
ATGAGACTTA TGTGGTTTTT ACTGGCAATT GATGTGTTTG TGTTTGCAAC CACAAATGAG
CTTTTATTTG TAATTCTCTT TTACCTAATT TTTAGTCTTC AAGGTTTTTT TGCTTCATTT
GCCAATATAA CATGGTTCAA TCTTATTCTA AAGCTTGTTC CTGAAAGACA AAGGAGCAAG
TTTTTTGGTA TAAGGTCTTC GATAGGGGGA CTGTGTGAGA CATTTGGAGC CTTTTTGATG
GGAAGAATAT TGAGGCTTTT ACACTTTCCT TATAACTATG GTCTTTTATT TTTAATTTCG
TTTTTGATAA TGATGCTCTC ATTGTACATA GCTTCTATGA TGAAAGAGAT TCCTATCAAG
AAACCCAAAA AGGTGATTGA TAATAAGCAT TATTTTAGGA GCATGTTTTT GATACTGAAA
GAAGATAGAA ATTTTAAATA TTATCTTCTT TCAGTTTTAT TTATTGGCGC ACTGGGTAAG
ATGCCATTTG GTTTTCAAAC CATATTTGCA AAAAATAGCC TGAGTATTTC AACACAACAT
GTTGCAATTG CAACCACAAT ATTGCTTTTT TCTCAGACAA TAGGATATAT GCTATGGGGA
ATAATCGGTT CTAAGTATGG GTTTAAAAGT ACTCTTTTGA TTTCTGCTTT GATGTTTTTA
CCTGCAATAT ATTTTACATA CCTTATGAGT TCTATAAGCG TTTATTATCT TTCTGTTGCT
CTGTTTGGGA TTGCTCAAAG TGCAAGGAAC GTAAACGAAA GCAATATGGC TGCAAAACTT
TGCAAGGACC CTTTAAAGCA GCCATCTTAT ATTGGTCTTA GAAATTTTTT GATGGGACCA
TTTTTTGCTT TTAATTCTAT AATAGCTGGA GGTATAATTG ATACTCTTGG TAAAAACATT
CTCTTTTTAA TTTCATTTAG CTGCATGGTG CTCGGATTTT TTATTCTGTG TTTTTTAGTC
AGAGAGGACT AA
 
Protein sequence
MTSEDQEREF KRLEVFAYKN LKKNSIISIA DGAVFAIGSG MLPVSTVIVY FISHYVHSNT 
LIGLLTTLNV LLSNSPQILV AKKLEMLDSY KEYFIKVALL MRLMWFLLAI DVFVFATTNE
LLFVILFYLI FSLQGFFASF ANITWFNLIL KLVPERQRSK FFGIRSSIGG LCETFGAFLM
GRILRLLHFP YNYGLLFLIS FLIMMLSLYI ASMMKEIPIK KPKKVIDNKH YFRSMFLILK
EDRNFKYYLL SVLFIGALGK MPFGFQTIFA KNSLSISTQH VAIATTILLF SQTIGYMLWG
IIGSKYGFKS TLLISALMFL PAIYFTYLMS SISVYYLSVA LFGIAQSARN VNESNMAAKL
CKDPLKQPSY IGLRNFLMGP FFAFNSIIAG GIIDTLGKNI LFLISFSCMV LGFFILCFLV
RED