Gene Moth_0707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0707 
Symbol 
ID3832708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp738369 
End bp740075 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content58% 
IMG OID637828638 
ProductABC transporter related 
Protein accessionYP_429568 
Protein GI83589559 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01166] cobalt transport protein ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0149651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTA TTGAAATCCG CAACCTGGAA TTTACCTATA AAGGCGGTAC CCAACCGGCC 
CTTAAAGGGA TCGACCTGGA TATTAACCCC GGCGAATTTA TCGTCATTAT GGGTCACAGC
GGGGCCGGTA AGTCTACCCT GTGCCTGACT TTAAACGGCC TGGTTCCCAA TTTGAAAAAG
GGTGCTTTTA AAGGCGAGGT AAAGATTAAA GGTACACCAA CGGCGGGGCA TAAGGTCAGC
CACTTTGCTC GGACGGTGGT CCTGGTATTC CAGGATTTTG AAACCCAGCT CTTCTCCACC
AGCGTAGAAC TGGAGGTCGC CTTCGGCCCG GAGAATTTCA ATGTTCCTCC GGAGGAAATC
AAGGAACGGG TGCGGGAGGC CCTGCAAAAG GTCCGCCTGA CGGGATTTGA AAACCGGCAG
CCGGCCAACC TTTCCGGAGG CCAGAAACAG CGCCTGGCTA TAGCCTCGGT GCTTTCCATC
CAACCGGAGA TTATTTGCAT GGATGAGCCG ACGACCGATC TCGATCCTGT AGGCAAGTAT
GAGGTTTTCA GCATCGCCAG CGCCCTGCGG CAGGAAAAAC ATATGTCCAT GATTATTGTC
GAGCATGAGG TGGAGGAGGC CCTGGAGGCC GACCGCATTA TCCTCATGAA GGAAGGGACT
ATCCTGGCCC AGGGCACGCC AAGGGAGATA CTGAGCCAGA GCGATCTCCT GGCCGGCTGC
GCGGTGAAAC CCCTGGACAT GGCCGAACTC TTCGGCCGGC TGGGTTTCAA AGAACTGCCT
TTGACGGTAG AGGAAGGCCT GGAGGCGTGG CGGCGGGCGG GGTTAGCCTT AAACCAGGAG
CGTTACCGGT CCATGGTTAA AGCCCAGCAG GCAGCCCGGG AGGGGAAATA CGGCGAACCC
ATTATTGAAG TCAAGGGCTT ACGCCATAGC TATGATAAAG ATTTTGAAGC CCTGAAGGGC
ATAGATCTCA CTATCCGCCA GGGTGAGTTT GTAGCCATCC TGGGGCAGAA CGGCAGCGGC
AAGACTACCC TGGTCAAGCA CTTCAACGGC CTGCTCCGGC CGACCGCCGG GGAAGTGGTG
GTAGCCGGGA TGGATACGCG GGCGGAAAGC ATCGATAAAT TCGGCCGCGT GGTGGGGTAT
GTCTTCCAGA ACCCGGATCA CCAGATCTTT GCCGGGACTG TAAAGGAAGA AGTGGCCTAC
GGTCCCAAAC TCTACGGCGT GCCCCCGGCC GAGATTGAGG AGCGGGTCAG GGACGCCCTG
CAGGCCGTCG AACTGGCTGG CTTGGAAGAG GAGGACCCCT TTTCCTTGAC CAAAGGCCAG
CGGCAGCGGG TGGCCGTAGC CTCGATCCTG GCGGCCAAAC CCCGGGTGAT CATCCTGGAT
GAACCGACGA CGGGCCTGGA TTATAAAGAG CAGCGGGGTA TGATGGAGTT AGTCCGGCGG
CTCAATGACA TGGGTCATAC TATTATCATG GTCACCCACA GCATGTGGGT CACGGCCGAA
TATGCCCACC GGGTAATCGT CGTTAAGGAT GGGCGGGTGG TAATGGACGG CCCGACGCGG
GAGGTCTTTG CCCGGGAAGA AGAACTGGAA GCCGCCTTCT TAAAGCCGCC CCAGATCGTT
CGTTTCAGCA ACCGCCTGGG TGCTACCTGC CTCAGCGTGG AGGAAGTAAT GGCCTGCCTG
GAAGGCCGCG AGGGGGGAGC AGAGTGA
 
Protein sequence
MSLIEIRNLE FTYKGGTQPA LKGIDLDINP GEFIVIMGHS GAGKSTLCLT LNGLVPNLKK 
GAFKGEVKIK GTPTAGHKVS HFARTVVLVF QDFETQLFST SVELEVAFGP ENFNVPPEEI
KERVREALQK VRLTGFENRQ PANLSGGQKQ RLAIASVLSI QPEIICMDEP TTDLDPVGKY
EVFSIASALR QEKHMSMIIV EHEVEEALEA DRIILMKEGT ILAQGTPREI LSQSDLLAGC
AVKPLDMAEL FGRLGFKELP LTVEEGLEAW RRAGLALNQE RYRSMVKAQQ AAREGKYGEP
IIEVKGLRHS YDKDFEALKG IDLTIRQGEF VAILGQNGSG KTTLVKHFNG LLRPTAGEVV
VAGMDTRAES IDKFGRVVGY VFQNPDHQIF AGTVKEEVAY GPKLYGVPPA EIEERVRDAL
QAVELAGLEE EDPFSLTKGQ RQRVAVASIL AAKPRVIILD EPTTGLDYKE QRGMMELVRR
LNDMGHTIIM VTHSMWVTAE YAHRVIVVKD GRVVMDGPTR EVFAREEELE AAFLKPPQIV
RFSNRLGATC LSVEEVMACL EGREGGAE