Gene Moth_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2297 
Symbol 
ID3831329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2411422 
End bp2412399 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content62% 
IMG OID637830217 
Productdaunorubicin resistance ABC transporter ATP-binding subunit 
Protein accessionYP_431127 
Protein GI83591118 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID[TIGR01188] daunorubicin resistance ABC transporter ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.422878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGTAA TCGAAGTCCG GAACCTGGTC AAGCGCTTTA ATGATCTGGA AGCCGTAGCC 
GGGGTCACCT TCAATGTTGA AGAGGGGGAA ATCTTCGGCT TCCTGGGGCC CAACGGCGCC
GGTAAGTCGA CTACCATCAA GATGCTCTGC ACCCTCCTCA AACCCACGGC GGGCCGCCTT
ACCCTGGCCG GGTTTGACGT AGCCCGTGAG CCCGATGCCG TCCGGCGCTC CATCGGCCTG
GTTTTCCAGG ACAACTCCCT GGACGACCGC CTGACGGCAG AAGAAAACCT TTACCTGCAC
GGCCTCCTCT ATGGCCTCAG CCGGGCGGCC ATCAAAGAAA GGATAGTAGA AGTCCTGGCC
ATGGTCGACC TGGCTGACCG CCGTCGGGAC ATCGTCCGCA CCTTCTCCGG CGGCATGCGC
CGGCGGCTGG AGATCGCCCG CGGCCTGTTG CACCACCCCA GGGTACTCTT CCTGGACGAG
CCGACGGTAG GCCTGGACCC CCAGACCCGC AGCGCCATCT GGCAGCACAT CCACCGGCTG
CGCCGGGAAA AAAACATTAC CATCTTTATG ACCACCCACT ATATGGATGA AGCAGAAAAC
TGCGACCGCA TCGCCATCAT CGACCACGGC CGTATCCAGG CCCTGGACAC CCCGGACAAC
CTTAAACGCC AGCTGGGGGG CGACGTGGTC ACCCTGACGA CCATCGACGA TTCCCGGCTG
CAGCAGGAGA TCGCCGGCCG TTACGGGGTC AGGGTAATCA AAGACGAGGA GGGCCTGCGC
CTCCAGGTTA GTGACGGAGC CACCTTCATC CCCCGGGTGG CCGCCGATTT CGGCGGACAA
ATTAACAGTA TATCCTTGCG GCGCCCCACC CTGGACGACG TTTTCTTAAA CCTTACGGGC
CGGGCCATCC GCGAAGAAAA ACCCTCGGCT GCCAGCCTCA TGCGTCTGAA CCGCCGCCAC
GGCCGCCGGC GCCATTAG
 
Protein sequence
MAVIEVRNLV KRFNDLEAVA GVTFNVEEGE IFGFLGPNGA GKSTTIKMLC TLLKPTAGRL 
TLAGFDVARE PDAVRRSIGL VFQDNSLDDR LTAEENLYLH GLLYGLSRAA IKERIVEVLA
MVDLADRRRD IVRTFSGGMR RRLEIARGLL HHPRVLFLDE PTVGLDPQTR SAIWQHIHRL
RREKNITIFM TTHYMDEAEN CDRIAIIDHG RIQALDTPDN LKRQLGGDVV TLTTIDDSRL
QQEIAGRYGV RVIKDEEGLR LQVSDGATFI PRVAADFGGQ INSISLRRPT LDDVFLNLTG
RAIREEKPSA ASLMRLNRRH GRRRH