Gene Moth_1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1902 
Symbol 
ID3831175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1966875 
End bp1968062 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content56% 
IMG OID637829835 
Productsecretion protein HlyD 
Protein accessionYP_430745 
Protein GI83590736 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0432629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGTCA AAAAGTGGCC CCCCTTATTG AAAAGATTGT TAATAGCCGG CGCCATCATT 
TTTGCCGGCG TCGCCTTTGC CCTTATTAGC CGTAGTCCTA AGCCTGCCAC CGTCGGCATA
TACGAAGGGC ATCCGGTTCT CTACGGTTCC GGGGTGATCG AGACAACCGA AATTGGTGTT
GGCGCTGAAA TTCCCGGCAG GATCCGTCAG GTGCTGGTGC AGGAGGGCCA GTTTGTAACG
GCAGGGAGCA TTATTGCTGT TTTAGATGAC GCCGAGCTGC AAGGACAGGT GGCCCAGGCC
AAAGCCGCGG TTGCCGGGGC CGAGGCCCAG CTGGCCCAGG TCAGAGCTGT GTATACAGCC
GAACAGGCCG GGGTGGAAGG GAACATTCAA ATGGCCCTGG CCGCCTTGCA GAAGCTGACG
GCAGGTGCCC GCCAGCAAGA GATTGATGCC GCCCAAAAAA AAGTCGACCA GGCCAGGGCT
AAATTACAAA GCGCCCAGGA ACAGCTCCGC CGCATGGAAA CGCTGCACCA GCAGGGGGTT
ATCTCCGACG AGCAGTACCA GCAGGCCAAA ACCAATTATG AAGTTGCCCG GGCCGATTTG
GGAGCGGCCG AAGACAATTT GTCTTTGCTT GTCTCCGGTT CCCGGCCGGA AGATATAGCC
GCCGCCAGGG CCAATTATGA AGTCGCTCTT GCCGGTCGCT CCCAGGTAGA GGCCCGCCGG
AAAGATGTAG ATGCCGCAGC CGCAGCCCTT GATAAAGCAA AGGCGGCGTT AAAGACTGCC
GAGGAGCAAC TGGCCAAGGC AACTATCCGG GCTAAAACCA GCGGTGTTGT TCTAAGGTGT
AATTTTAGTG CCGGCGAGGT TGTGAATCCC GGTATTCCCA TCGTTACCCT AAGCGATCCC
GCGGACCTCT GGCTGGCAAT CTACGTTCCC GAGACGGAAA TCGGTAAGGT AAAGGTGGGC
CAGCAGGCAG TCGTAACGGT GGATTCCTTC CCGGGTAAAC GCTTTAATGG CAGGGTGAAG
GAGATCGCCG GCCAGGCCGA ATTTACGCCC AAAAACATCC AGACCAAGGA AGAGCGGGTA
GACCTGGTGT TTAAGGTGAA AATTTCCCTG GCTAACGAAG AACAACTGCT AAAACCGGGT
ATGCCGGCGG ATGCCATGGT TTACCTGGAC AGCCAGGAGG CAAATTAA
 
Protein sequence
MLVKKWPPLL KRLLIAGAII FAGVAFALIS RSPKPATVGI YEGHPVLYGS GVIETTEIGV 
GAEIPGRIRQ VLVQEGQFVT AGSIIAVLDD AELQGQVAQA KAAVAGAEAQ LAQVRAVYTA
EQAGVEGNIQ MALAALQKLT AGARQQEIDA AQKKVDQARA KLQSAQEQLR RMETLHQQGV
ISDEQYQQAK TNYEVARADL GAAEDNLSLL VSGSRPEDIA AARANYEVAL AGRSQVEARR
KDVDAAAAAL DKAKAALKTA EEQLAKATIR AKTSGVVLRC NFSAGEVVNP GIPIVTLSDP
ADLWLAIYVP ETEIGKVKVG QQAVVTVDSF PGKRFNGRVK EIAGQAEFTP KNIQTKEERV
DLVFKVKISL ANEEQLLKPG MPADAMVYLD SQEAN