Gene Moth_1259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1259 
Symbol 
ID3833054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1301959 
End bp1303542 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content50% 
IMG OID637829195 
Productpropionate CoA-transferase 
Protein accessionYP_430116 
Protein GI83590107 
COG category[I] Lipid transport and metabolism 
COG ID[COG4670] Acyl CoA:acetate/3-ketoacid CoA transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00406341 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGACGTC CCAGTTTTAT GACAGGTGAA GAAGCTGCCA AATTAATCTT TGACGGGGCA 
GTAATTGCTT CGGTGGGAAT GACCCTGGTA AGTGCCAGCG AAGAAATCCT TAAAGGCATT
GAGAAGCGTT TCTTAGAAAC CGGCCATCCC CGCAATTTAA CCCTGATCCA CTCAGCCGGC
CAGAGTAACC GCGACCGGGG CATCCAGCAC TATGCCCACG AAGGACTGGT CACGCGTATC
ATCGGTTCCC ACTGGGGCCT GCAACCGCGC TGGATGGAGA TGATTAGCAA CAACAAAGTA
GAAGCATATT GTATGCCCCA GGGCCAGCTG GCGCAGATAT ACCGTAGTGT AGCCTGCGGG
CAACCGGGTA AATTAACTAA AATCGGCCTG GGCACTTTCG TCGATCCACG TATCGAGGGC
GGTAAGATGA ACGAGCGAAC TAAACCCCTG GAGGACTTGG TCGAAGTTGT CACCCTGGAC
GGCGAAGAGT ACCTGCTCTA TAAAATTCCA CCCATTGATT TCGTAATTAT CCGGGGTACG
ACGGCTGATG AAAACGGTAA CATTACCACG GAAGAAGAGG GAATGAAACT TGAGGTTTTA
CCAGCCGTCC TGGCAGCCAA GCGTTTTGGC GGTAAGGTCC TCTGCCAGGT TAAGCGGGTA
GCCCAGGCCC ATACCCTGCA TCCCAAGCAG GTGGTTGTAC CCGGTGTCTT TGTAGATGCC
ATTGTCGTTT GCCAGAATCC TGACGAAGAC CACCGCCAGA CCTCCAGCTG GGTTTTCGAT
CCTGCCTACT GTGGCGATTT ACGGGTGCCG GTTTCCAGGA TAGAACCCCT GCCGTTGAAT
ATCCGCAAGG TAATCGGCCG GCGAGCCATG ATGGAATTGG AGCCTGATGC CATCATCAAC
CTGGGTACAG GCATTCCTAA CGACGTTGTC GGCCCCATTG CCGCCGAAGA AGGTATCCTG
GATGATATTA CAATCACCGT AGAGTCTGGG GTCTACGGTG GTATTCCTGC CGGTGGCATT
GACTTTGGTA TTGCCAAAAA TACCGATGCT TTAATCCGGC ACGACGATCA ATTTGATTTT
TATACCGGTG CAGGAGTTGA TTTCACCTTC ATGGGTGCCG GGGAAATGGA TGCTAACGGA
AATGTCAATG CCACTAAAAT GGGGTCTCGT GCAGCCGGTG CCGGCGGGTT TATCGATATA
ACCCAGGGTG CCAAGCATGT AATCTTCTGT TCAACTTTTA CTACCGGCGG CTTAAAAGTC
GATTTTGTTG ATGGGAAAGT GAAGATCTTG CAGGAAGGAT CAATCAAAAA ACTGGTAAAG
CAAGTTCAGC AAATTTCCTT CAGCGGTCTC CTGGCTCGTC AGAAAAAACA AAAGGTGCAC
TTTATAACCG AGAGGGCTGT TTTTGAGCTT CAGGAAGACG GACCCGTACT GGTAGAAATT
GCACCCGGAA TTGACCTGGA AAAGGATGTC CTCAACCAAA TGGAATTTCG ACCGCGTATT
GCTGAACCTC TACGGGTTGG CGATCCCTGT ATTTATAACC AGGGTAAATA TGGGCTCAAA
GAGATAATTA GCGCTAAAAA ATAA
 
Protein sequence
MRRPSFMTGE EAAKLIFDGA VIASVGMTLV SASEEILKGI EKRFLETGHP RNLTLIHSAG 
QSNRDRGIQH YAHEGLVTRI IGSHWGLQPR WMEMISNNKV EAYCMPQGQL AQIYRSVACG
QPGKLTKIGL GTFVDPRIEG GKMNERTKPL EDLVEVVTLD GEEYLLYKIP PIDFVIIRGT
TADENGNITT EEEGMKLEVL PAVLAAKRFG GKVLCQVKRV AQAHTLHPKQ VVVPGVFVDA
IVVCQNPDED HRQTSSWVFD PAYCGDLRVP VSRIEPLPLN IRKVIGRRAM MELEPDAIIN
LGTGIPNDVV GPIAAEEGIL DDITITVESG VYGGIPAGGI DFGIAKNTDA LIRHDDQFDF
YTGAGVDFTF MGAGEMDANG NVNATKMGSR AAGAGGFIDI TQGAKHVIFC STFTTGGLKV
DFVDGKVKIL QEGSIKKLVK QVQQISFSGL LARQKKQKVH FITERAVFEL QEDGPVLVEI
APGIDLEKDV LNQMEFRPRI AEPLRVGDPC IYNQGKYGLK EIISAKK