Gene Moth_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1156 
Symbol 
ID3833124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1187963 
End bp1189516 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content58% 
IMG OID637829087 
Productcarboxyl transferase 
Protein accessionYP_430013 
Protein GI83590004 
COG category[I] Lipid transport and metabolism 
COG ID[COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000234749 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATA TAACGGCCAG GCTGGCCGAC CTGGAAGTCA GGCGCGCCAG GGTTATGGCC 
GGCGGTGGTG AAGAACGGGT AGCCCGGCAG CACGCTGCCG GCAAGTTAAC GGCCCGGGAA
CGGCTGGAGC TATTGCTGGA TCCCGGCAGT TTTCTGGAAC TGGATCAATT TGTGCGCCAC
CGGGCCACTG ATTTCGGCAT GCAGGACATA GAAACACCTG GTGAGGGTGT AGTTACCGGT
TCCGGGACCA TAAATGGTCG GCCAGTCTAC GTCTATGCCC AGGATTTTAC CGTTATGGGC
GGTTCCCTGG GGGAGATGCA TGCTGCCAAA ATCTGTAAGG TCATGGATCT AGCCCTGAAA
ACCGGTGTGC CGGTCATAGG TTTGAACGAT TCCGGCGGGG CCCGTATCCA GGAGGGGGTA
GCGGCCCTCA ATGGTTATGG GGAAATCTTC CGCCGCAACA CCCATGCTTC AGGGGTTATT
CCCCAAATCG CCGCTATTAT GGGCCCCTGT GCCGGGGGCG CCGTCTATTC GCCGGGGCTA
ATGGACTTCA TTTTTATGGT TGATAATACA GCCCAGATGT TTATTACCGG CCCCCAGGTA
ATCAAAGCCG TCACGGGGGA AGAAGTCAGC GCGGAAGAGC TGGGAGGTGC CGTCACCCAT
GCCACCAGGA GCGGGGTGGC CCACTTCCGG ACGCCAACGG AAGAAGATTG CCTGCACTTG
ATTCGCACCC TGCTGGATTA TTTACCTGCC AATAACTTGG AGGACCCGCC CTTCAGGCCC
AGTTCTGACC CGGCTGAGCG CCAGAATCCC AACCTGGCGG CACTGGTGCC TGTCGATCCT
AATAAACCGT ATAACGTGAA GGAAATCATC TACGGAGTGG TTGATGACGG CCTGTTCCTG
GAGATTCAAG GTGAATATGC CGCCAATATG GTTATTGGCC TGGCCCGCTT GGCGGGGTAT
ACTATAGGTA TTGTGGCCAA CCAGCCCCAG TACCTGGCCG GTTGCCTGGA TATCAACGCC
GCCGATAAAG CAGCACGTTT CGTGCGTTTT TGTGACGCCT TCAATATACC CCTCCTGACC
CTGGTGGATA CCCCCGGCTA CCTTCCCGGG GTGGAACAGG AACAGGGAGG CATTATCCGT
CATGGGGCCA AACTGTTATA TGCCTTCGCC GAGGCCACGG TACCCAAACT CACCCTCGTC
CTGCGCAAGG CGTACGGCGG TGCCTACCTG GCCATGTGCT CCCGCTCCCT GGGAGCCGAT
CATGTCGTTG CCTGGCCTAC AGCGGAAATA GCCGTCATGG GTCCCGAGGG GGCTGCTAAT
ATAATCTTCC GCCAGGAAAT CAGCCAGGCA GATGACCCGG CCCGGGTACG GCAGGAAAAA
GTCGCCGCTT ACCGCGATAA GTTTGCCAAC CCCTATGTGG CCGCGGGCCT GGGGCTGGTT
GATGCTGTTA TCGACCCGGC TCTGACCCGC CCCCATTTGA TCCGGAACCT GCTGACCCTG
CTCAGTAAAC GGGAAAGCCG CCCGGGTAAA AAACACGGCA ACTTCCCTGT CTAG
 
Protein sequence
MEDITARLAD LEVRRARVMA GGGEERVARQ HAAGKLTARE RLELLLDPGS FLELDQFVRH 
RATDFGMQDI ETPGEGVVTG SGTINGRPVY VYAQDFTVMG GSLGEMHAAK ICKVMDLALK
TGVPVIGLND SGGARIQEGV AALNGYGEIF RRNTHASGVI PQIAAIMGPC AGGAVYSPGL
MDFIFMVDNT AQMFITGPQV IKAVTGEEVS AEELGGAVTH ATRSGVAHFR TPTEEDCLHL
IRTLLDYLPA NNLEDPPFRP SSDPAERQNP NLAALVPVDP NKPYNVKEII YGVVDDGLFL
EIQGEYAANM VIGLARLAGY TIGIVANQPQ YLAGCLDINA ADKAARFVRF CDAFNIPLLT
LVDTPGYLPG VEQEQGGIIR HGAKLLYAFA EATVPKLTLV LRKAYGGAYL AMCSRSLGAD
HVVAWPTAEI AVMGPEGAAN IIFRQEISQA DDPARVRQEK VAAYRDKFAN PYVAAGLGLV
DAVIDPALTR PHLIRNLLTL LSKRESRPGK KHGNFPV