Gene Moth_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2355 
Symbol 
ID3832535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2475320 
End bp2476849 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content61% 
IMG OID637830277 
Productsecretion protein HlyD 
Protein accessionYP_431183 
Protein GI83591174 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0694634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACAG CGACGGTAAC GGCTAAAGGA GGATTCCTTA GCGGCAAAAG AATAATATGG 
CTGGTGGCGG TCTTTCTAAC TTTGATTATC GCCGCCGGGT GGTACTGGTG GGCCCGGGGG
CATCAGAAGG CTCCTAATTT CCTGACCGTT ACTGCCGGTA AAGGGAGCAT TGTGGAGACT
GTAAGCACCT CCGGGAAAAT TGAAGCATTA CATAGCATCG GCCTCAGCTT TAAGAATCCG
GGGACCATCA AGGCGATATA CGTCAAGGAA GGGCAGAGCG TCAAAGCCGG CCAGCTCCTG
GCCCTCCAGG ACCCAACCGA TCTGGAGCTT CAGGTCAAGG AGGCCCAGGC GAACCTGGAC
AATGCCATGG CCAAGCTAAA GAGCCTGCAG GCCGGGCCCC TGGCTACCGA CGTGGCCCAG
GCCGAGGCGG GTGTGGAGCA GGCTCAGGTT GAATATGACA ACGCCCAGGA TACTTTAAAA
CGCGATCAGG CCCTTTATGA TGCCGGCGCC CTGTCGGAAG TCGACCTGAA CAATGCCCGC
AAGGCGGAGG CGACGGCGGC AGCCAACCTG AAAAAAGCCC AGGCAGCCCT GGAGGCCCTG
AAAAACGGCA GCCGGCCGGA GGATATAGCC GCCGCCCAGG CCCAGGTAGA CGCGGCCCGG
GCCCAGCTGC AGCTGGCCCA GAATAACCTG GCGGCCACGG AGATCCGGGC ACCGTGGGAC
GGTATTGTCA CCAACGTCAA CGGGCAGGTG GGCCAGCGCG TGGGCAGCAA TACCAGCGCC
ACCGACGCTT CAAATAGTTT TATTTTTTTA ATCTCCCCCG AACTCCAGCT GCGGGTGCAA
GTCAATGAAG CCGACATCAA CAAAGTCAAG GTGGGGCAGG ATGTAGAGTT TACGGTCAAC
GCCCAGCCTG ACCGCACCTT TAAAGGCAAG GTGACGGCCA TTGCGCCTCA AGCCCAAACC
GTTTCCAACA TCCAACTTTA TGACGTCCTG GTAAACGTGG GAGACGCGGG AGCGTCCTTA
AAGGCCGGCG AGTCGGCCAG CGTGACCATA ATTATCAGCC GCAAAGATAA CGTCATTACC
ATCCCGCGGG CGGCCATCGC CTATGCCGAA GGCTACCTCT CCCAGGCCGG CAAGACCGCC
GGCGGCCGGG CAACCTCGTC AGGGGGGACC TCCGGCGGGG GTAACCGGAG TAGAAGTACC
CAGGGAACGG GAGGTTCGGC AGGCAGTTCG GTGACGGGCG CTTCTACCGC CGGAACTCCC
AGCGCCGGGG AAAACCGGGC CGTAATCCTG GTGCTGGAGG GCGGACGGCC GGTGCAGCGC
CAGGTGGTAA CCGGCGCCAG CGACGAGCGC AATATCGAAG TAGTCAGCGG CTTAAAGGCA
GGGGAACAGG TCGTAGTCGG CACAGGCAGC GCGAGTACAG GGAGTACCGG CAATACGGGA
AGCACAAGTA GCTCGGGCCG GTCGAGCAGC GGCAGTAACC GTAACACCGG CGGTCCCATG
CCGCCGGGCG GCTTTCTCGG GGGGAGATAA
 
Protein sequence
MATATVTAKG GFLSGKRIIW LVAVFLTLII AAGWYWWARG HQKAPNFLTV TAGKGSIVET 
VSTSGKIEAL HSIGLSFKNP GTIKAIYVKE GQSVKAGQLL ALQDPTDLEL QVKEAQANLD
NAMAKLKSLQ AGPLATDVAQ AEAGVEQAQV EYDNAQDTLK RDQALYDAGA LSEVDLNNAR
KAEATAAANL KKAQAALEAL KNGSRPEDIA AAQAQVDAAR AQLQLAQNNL AATEIRAPWD
GIVTNVNGQV GQRVGSNTSA TDASNSFIFL ISPELQLRVQ VNEADINKVK VGQDVEFTVN
AQPDRTFKGK VTAIAPQAQT VSNIQLYDVL VNVGDAGASL KAGESASVTI IISRKDNVIT
IPRAAIAYAE GYLSQAGKTA GGRATSSGGT SGGGNRSRST QGTGGSAGSS VTGASTAGTP
SAGENRAVIL VLEGGRPVQR QVVTGASDER NIEVVSGLKA GEQVVVGTGS ASTGSTGNTG
STSSSGRSSS GSNRNTGGPM PPGGFLGGR