Gene Moth_2328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2328 
Symbol 
ID3831080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2447916 
End bp2449112 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content57% 
IMG OID637830252 
Productsecretion protein HlyD 
Protein accessionYP_431158 
Protein GI83591149 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTT TAAACCAAAT CAAATTCAAA AAAACGGCGG TAACCGTCTT GCTGGCAGGC 
ATTATCGCCG CAACCGTCTT TATCTTTCAT AATTACTTTT ACAAACAGCA AGCAGCAATC
GCCGAAGAGC GCGACAGCCT CACAGCCACC GGGACAATCG AAGCCAGGAC CGCCATGGCC
GCTTTTAAAG TCCCCGGCAA AATCGAAACC CTGCTTGTCG ATGAGGGCGC CAGGGTAGAG
CAGGGTCAGG AACTGGCCCG CCTGGACAGC AGCGAGCTAA ATGCCAAGCT GACCCAGGCC
GAAGGGGCTT ATGCCGCAGC CCAGGGGCAG GAGAGCCAGG CCAGTAACAA CGTCACCTAC
CAGAGCCAGC AAATCGAAGC CAAAATCAAA CAGGCCGAAG CAGGAGTAGC CGAGGCCCAG
GTGGGCGTTA AGGACGCCCA AGATCAGGTG AACGCAGCCG AGGTGGGCGT TAAAGATGCC
AAAGATCAGC TAAACAACGC CAAAGACCTC TACGACCGCC TCCGCCTCCT GCACGACCAG
GGCGCCATAG ATGACCGCAA ACTAGAAGAA GCCAAAAACG GCTACGAGCG GGCCCAAAAC
GCCTACAACG CCGCCCAGAT CAGCTACGAG CGGGCCCAGA ACGCCTACAA CGCCGCCCAA
AAGAAACTCC AGGAAGCCCA GGCTCTCCTC GACCAGGCAA TATCAGCCCG CACCGGGGTG
GCGGTAGCCC AGGCCCAGCA AGAAGCCGCC GCCGGCCAGG TCAAGCAAGC CGGCGGGGCG
GTAGAGGAAG CAAAAACCTA CCTGGCTGAC GCCATTTTAA AGGCCCCCAT AGCCGGCTTC
ATCACCCAGA AACTCCTCGA GCAGGGCGAA ATGGTCAACG CTGGAACGCC GGTCTTTGAA
ATAACCGACC TCCTCCATAC CTACGTCAAG GTTTACATAA GCGAAAAGAA GATCGCCCGC
GTCCACCTCG GCCAGGAGGC GGAGATAACG GTAGATGCTT TACCGGGCAA AGTCTTTAAA
GGCAAGGTCG TGTGGATCAA CGACGCCGGC GAGTTTGCCG TCAAAAAGGC GATTAACGAT
CAGTACGAGC ATGACATCCG CAGCTTCGAG GTTAAAATCG ATGTCCCCAA CCCGGACCTG
GTCTTAAAAA CCGGTATGAC GGCCAGGGTA AAAATCCTGG AAGGGAAGCA GCAATAA
 
Protein sequence
MNALNQIKFK KTAVTVLLAG IIAATVFIFH NYFYKQQAAI AEERDSLTAT GTIEARTAMA 
AFKVPGKIET LLVDEGARVE QGQELARLDS SELNAKLTQA EGAYAAAQGQ ESQASNNVTY
QSQQIEAKIK QAEAGVAEAQ VGVKDAQDQV NAAEVGVKDA KDQLNNAKDL YDRLRLLHDQ
GAIDDRKLEE AKNGYERAQN AYNAAQISYE RAQNAYNAAQ KKLQEAQALL DQAISARTGV
AVAQAQQEAA AGQVKQAGGA VEEAKTYLAD AILKAPIAGF ITQKLLEQGE MVNAGTPVFE
ITDLLHTYVK VYISEKKIAR VHLGQEAEIT VDALPGKVFK GKVVWINDAG EFAVKKAIND
QYEHDIRSFE VKIDVPNPDL VLKTGMTARV KILEGKQQ