Gene Athe_0890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0890 
Symbol 
ID7407465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp990500 
End bp993580 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content33% 
IMG OID643715264 
ProductMMPL domain protein 
Protein accessionYP_002572773 
Protein GI222528891 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000250771 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGTAA TTTTAAAGCG TCCGTGGCTG AGTACTATTA TATGGGCTAT TTTAACAGTA 
ATATTTGTAG TTACTATGCC AGATATAAAT AAAATAGTGC GACTCAAAGG TGAACCTAAA
ATTGGTCAGG AATACCCATT TCAAGTTGCA GCAGCGTTAG AAAAAGAATA TCAGGGAGTG
CCAAAGGATA AAAAGCTTTC AACAGTAGCA ATTGTATTTT ATGACAAAAA CGGTCTTTCA
GATGAAGACA TTGTGGCAAT TAAAAGGATA ATTTATGGAC TTAACAGCAA CAGAGAAAAA
TACAACATTG AAAATATTTC AACTCATTTC AAGAATCCTG AGCTAAAAAA CTACTACGTA
TCAGGCAACA ACAAGGTTAT CCTTGCAAGC ATTCAGGCAG ACAAGTCAAA GAGAGAAATT
ATTGAGATAA GAGACGATAT TTATAAGTTT ATAGATTCTG TTAGAAAACC AAAGAGTTTA
GAAGTGTATC TGACAGGTGG AGATTTAATT ACCCAAGACT TTGTAAAGGC TTCAGAAGAT
GGTGTTAAAA AGACCGATGT CATCACAATT GTATTTATAA TCATTGTGTT GGTCCTTGTT
TTCCGCTCAC CTGTCACTCC AATTGTTTCA CTTTTAACAG TTGGTGTTTC GTTTTTAATC
TCACTTAGCA TAGTTGGGCA CATAGCAGAT AAGTTTAATT TTCCAATCAA TACTTTCACA
AGAACATTTA TTGTTGTATC CTTGTTTGGA ATTGGAACGG ATTATAATCT ATTGATTATT
TCAAGATTTA GAGAAGAGCT TGCAAATGGA AAAGATGTCG ACAATGCAAT AATAACCACA
TTTAAAACCG CAGGAAAGAC AGTGGTATTT AGCTGTCTTG CAGTTTTCAC AGGTTTTGCA
GCTTTTGCAG CAGCATCTTT TAACTTTTTC AGAGCAATGT CAGCAATTTC AGTAAGTGTA
CTTGTACTTT TACTTGTGCT TTTAACCTTG GTTCCAGCAG TGCTCAAAAT TTTTGGTAAA
AATCTTCTCT GGCCATTTAA TAAAGAAATA CAGCACACCC ATAGCAGACT CTGGGAAGCT
GCTGCGAGAC TTTCGACTAA ATATCCTTAT GTTGTTTTGA CAACAGTTAT TTTAATTTTA
ATACCTTTTC TTTTATCAGT CAGAGGCGAT TTGTCATTCA ACACACTAAA CGAGCTTTCT
GAAAAGTACA AATCCGTAAA AGGATTTAAC ATAATTTCAA AAAACTTTAG TAGTGGAAAG
ACATTTCCTG TTACGATTTA TTTAAAATCT AATAAGAGCT TAGCAACTTC GGATGCACTA
TCTGACATTG AAAAAATAAC TGAGGTATTG AGCAAGCAAA AAGGTATAAA AGATGTATAT
TCAGTAACAA GACCTCAAGG GGAGATTATA AAACAATTTA CTTTGTCAAA CCAAGCAGAT
ACTGTTATAA AAGGTATTGA TAGTTTGAGA GATGGTCTTT TGAAAGTTAA CAATGCAATT
GAGACTATAA ACAAAAATTT AAATATGTCA ACAAAAGAAT TTGATGTTCA AAAACTTATA
ACTGGCATTT CACAGCTTGA GAATGGTGTA TATGAATTGA AAATTTCTTT AGAAAAACTA
AATGGTGGAT TTGAACAGGG GTTAAATGGT TCTAAGAAGA TATATGATGG TCTTAATAAG
CTTTCAATTG GAAGCAGCAA GCTGTCTGAA GGCTTTAAAG AATTCTATTC TGAGTATACA
AGGTCAATAA ATAAGGTAAA AAGGGAATTA CAGGGATTTG ATGTAAATCA AATTGAACTT
TTATTGACAG GTATTAATAC TGCTAATAAC AATTTAAAAG CACTTTCAAA TAAGTATCCG
GAAATCTCAA AGGATATCAA TTATATTATG GCATCTGAGA TTTTATCAAA AATTGAAACT
GAAGCAAATA AGTTTGCTCC AAAGTTAAAA GAGTTTTCTT CTGAATATGA AAAGATTTCA
GCCCAGATAA ATGAAGCTGA CAAAGCTCAG AAGCTTATTT CAAATTCTAT TGATAGCATA
GCTTCAGCTT CAAAACAGCT TGCTGATGCT CATGGGAAGG TTCTAAGTGG CTACTATCAA
ATTGACAAAG GTCAAAAGCA AATAATCTCA GGTGTAAACC TTATGTATTC AAAACTTCAG
GAGTTTGACA AACAAAAAGA ACAGATTCTA AAAAAAGTGG ATGAACTTCA GACAGGGTTT
ACAAGTTTAA AATCAGCGCT TTTGCAGATT TCTGATGCAC TTAACAAGAT GGCAAATTCT
ATGCAGAATA TGAAGAGTTA TTTTGAAGGA TACAAAACAA CAAACATCTT TTACGTCCCA
CCAGAGGCTA TAAAAAGCAG TAGCTTCAAA AAGGCTTTAG ATTCATATAT GAACAAAAAT
AGAACAATAA CAAAGATAAT CTTGATTCTT GACACAAATC CATATACAAA CAAGGCAATT
GACATAGTTG ACAATGTGGA AAAGGTTTTA AATAACTCTT TAGAGTTTAT AAACACAAAG
TTTGTGTCGG TAGGAGTTGG TGGGATATCA TCTTCAAACC ACGATTTGAG AAGCATTTAT
TTTAAGGATT TCAAGACTTT GAGACTTATA ATGATAATAA GCATCTTTAT TCTCATGTTC
CTAATTTCAA GGTCAATCTT TAATGCTGCA ATAATGGTAA TAATAGTTTT TGCCGACTAC
TACCTTGCAC TCTCTATAAC AGAGATGATT TTCAAAGGAA TATTCAAGTA CGAAGCACTC
AACTGGGCAG TGCCATTTTT CACCTTTGTT GTGTTCTTAG CTCTGGGTAT AGACTACAGT
GTATTTTTAC TGATAAGGTT TTACGAATAT AGAAATTTAG AACTTTCAGA AGCACTGAGA
CTTACCTCGG CAAACATTGG GCATGTTGTA ACATCAGCGG TGATAATTTT AGCCGGCACA
TTTGCGGCAA TGCTGCCATC TGGGATTTTG ACATTAATGC AGGTTTCCAT CTGTGTTGTG
ATAGGCCTTG TTCTGCTTGC ATTTTTTCTG CTTCCATTTT TGTACAATAC TTTGATGAGA
ATAAAAAGGG ATATAATATA A
 
Protein sequence
MRVILKRPWL STIIWAILTV IFVVTMPDIN KIVRLKGEPK IGQEYPFQVA AALEKEYQGV 
PKDKKLSTVA IVFYDKNGLS DEDIVAIKRI IYGLNSNREK YNIENISTHF KNPELKNYYV
SGNNKVILAS IQADKSKREI IEIRDDIYKF IDSVRKPKSL EVYLTGGDLI TQDFVKASED
GVKKTDVITI VFIIIVLVLV FRSPVTPIVS LLTVGVSFLI SLSIVGHIAD KFNFPINTFT
RTFIVVSLFG IGTDYNLLII SRFREELANG KDVDNAIITT FKTAGKTVVF SCLAVFTGFA
AFAAASFNFF RAMSAISVSV LVLLLVLLTL VPAVLKIFGK NLLWPFNKEI QHTHSRLWEA
AARLSTKYPY VVLTTVILIL IPFLLSVRGD LSFNTLNELS EKYKSVKGFN IISKNFSSGK
TFPVTIYLKS NKSLATSDAL SDIEKITEVL SKQKGIKDVY SVTRPQGEII KQFTLSNQAD
TVIKGIDSLR DGLLKVNNAI ETINKNLNMS TKEFDVQKLI TGISQLENGV YELKISLEKL
NGGFEQGLNG SKKIYDGLNK LSIGSSKLSE GFKEFYSEYT RSINKVKREL QGFDVNQIEL
LLTGINTANN NLKALSNKYP EISKDINYIM ASEILSKIET EANKFAPKLK EFSSEYEKIS
AQINEADKAQ KLISNSIDSI ASASKQLADA HGKVLSGYYQ IDKGQKQIIS GVNLMYSKLQ
EFDKQKEQIL KKVDELQTGF TSLKSALLQI SDALNKMANS MQNMKSYFEG YKTTNIFYVP
PEAIKSSSFK KALDSYMNKN RTITKIILIL DTNPYTNKAI DIVDNVEKVL NNSLEFINTK
FVSVGVGGIS SSNHDLRSIY FKDFKTLRLI MIISIFILMF LISRSIFNAA IMVIIVFADY
YLALSITEMI FKGIFKYEAL NWAVPFFTFV VFLALGIDYS VFLLIRFYEY RNLELSEALR
LTSANIGHVV TSAVIILAGT FAAMLPSGIL TLMQVSICVV IGLVLLAFFL LPFLYNTLMR
IKRDII