Gene Moth_1525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1525 
Symbol 
ID3831990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1570431 
End bp1572362 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content60% 
IMG OID637829457 
Productpyruvate carboxylase subunit B 
Protein accessionYP_430377 
Protein GI83590368 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG0511] Biotin carboxyl carrier protein
[COG5016] Pyruvate/oxaloacetate carboxyltransferase 
TIGRFAM ID[TIGR00531] acetyl-CoA carboxylase, biotin carboxyl carrier protein
[TIGR01108] oxaloacetate decarboxylase alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000118935 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0137179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTG CAGAGCCAGC CATGGGGAAT GTCGCCCTGG CTTTAGCCAG GGTTTTTCTT 
TACGGGGGAG GAGCAGTTTT GAGCCAGTTG AAGATTACAG ACACCACCTT GCGCGATGGC
CACCAGAGCC TTTGGGCCAC CAGGATGACT ACGGCCGATA TGCTGCCCAT TATCGAAAAG
ATCGATAGTG TCGGGTACCA CTCCCTGGAG GTCTGGGGTG GGGCCACCTT CGATGTCTGC
ATGCGCTTCC TGGATGAGGA TCCCTGGGAG CGCCTCCGTA CCTTGAAAAA ATATGCCCGG
CGGACGCCCC TGCAGATGCT TCTCCGGGCC CAGTCCCTGG TGGGCTACCA GCTGTACCCG
GACGATGTAG TACGGGCCTT TATCGCCAGG GCCGCGGCCA ACGGCATTGA TATTATCCGC
ATTTTCGACG CCCTGAACGA CCTGCGCAAC ATGGAAGTCC CGGTGGAAGC CGCCAAAAAG
GAAGGTGTCC ACGTCCAGGG AACGGTGGTC TATACCATCA GCCCGGTGCA TACCACCGAG
CACTACCTGA AAACAGCCCT GGAACTGGAG AGTATGGGGG TCGATTCCAT TTGCATTAAA
GATATGGCCG GCCTCCTGGC TCCCTTTGAA GCCTATAAAC TGGTCAAGCT CTTTAAAGAA
AAACTCCACG TCCCCGTTCA GCTCCATAGC CATTACATCG GCGGTCTGGC GGTTGGGGCC
TACCTGGAAG CGGCCCGGGC CGGGGTGGAT GTCGTCGACA CAGCCTCTGT TCCCCTGGCC
TTCGGCGCCT CCCAGCCGCC GGTGGAGACG GTGGTCCGGG CCCTGGAGGG AACGCCCTAT
GACACCGGCC TGGATCTCAA CCTCCTCTTT GAAATCGCCC GCTATTTCGA CGACCTGCGC
CGGGAACTGG GTTACGAGCG CGGCGTCACC CGGATTACCG ATATGTGGGT TTTCCAGCAC
CAGGTCCCCG GGGGCATGAT CTCCAACCTG GTCAGCCAGT TAAAGGAACA AAAGGCCGCC
GACCGGATCA ACGAGGTCCT GGCCGAGATA CCCCGGGTAC GGGCCGACCT GGGTTACCCG
CCCCTGGTCA CGCCCACCAG CCAGATTGTC GGTACCCAGG CGGTTTTAAA CGTCCTCCTG
GGCGAACGTT ACAAGATGGT ACCCGGCGAG GTGAAGAATT ACGTCCGGGG CCTGTACGGC
CGGCCGCCGG CACCCATTTC CGAGGAGATC CGGCGCCTGA TAATCGGCGA TGAAGAGCCC
ATCCAGGGGC GACCGGCCGA CATCCTGGAG CCCCGCCTGG AGGAAGCGCG GCGGGAGATC
GGCGATCTGG CCCGTAATGA GGACGACGTG GTTGCCTACG CCATGTTTCC CCAGATTGCC
CGGAAATTCT TTGAGAAGCG CCAGCAGGGC CAGCTCCGGC CGGGCCGGCA GTCTTTACCA
TCCAGGGGCC AGGATAAGGC AGAGGCAGGA GGTACAGCAA AGATGGATCT CAAGGATATC
ACCCAGCTCA TTAAAGCCCT GGAAGAGACG GGGATTACCG AACTGAACCT GGAGAGCGAA
GGGGTAAAGG TCATGATCCG CCGCGGTAGC GGCCAGGGGG CGGCGGAGAT CCCCGCACCG
GAGATAAAAA CTGCTGCAGA GGTACTGGCC ACAGACGGCG GCGCTCAGGA AACACCTGTC
CCGGCCGGAG ATATCATTGA AGTCCGGGCG CCCATGGTGG GTACCTTTTA CCGGGCTCCC
TCTCCCGACG CGCCCCCTTT TGTCGAGGTG GGGACCAGGG TTAAGGCAGG ACAAACCCTG
TGCATAATAG AGGCGATGAA GCTAATGAAC GAGCTCACGG CTGAAACCGG CGGCCAGGTG
GTGGCCATCC TGGCCGAAAA CGGCCAGCCG GTGGAATACG GCCAGGTCTT GTTCCAGATC
AAGAAGGATT AA
 
Protein sequence
MAVAEPAMGN VALALARVFL YGGGAVLSQL KITDTTLRDG HQSLWATRMT TADMLPIIEK 
IDSVGYHSLE VWGGATFDVC MRFLDEDPWE RLRTLKKYAR RTPLQMLLRA QSLVGYQLYP
DDVVRAFIAR AAANGIDIIR IFDALNDLRN MEVPVEAAKK EGVHVQGTVV YTISPVHTTE
HYLKTALELE SMGVDSICIK DMAGLLAPFE AYKLVKLFKE KLHVPVQLHS HYIGGLAVGA
YLEAARAGVD VVDTASVPLA FGASQPPVET VVRALEGTPY DTGLDLNLLF EIARYFDDLR
RELGYERGVT RITDMWVFQH QVPGGMISNL VSQLKEQKAA DRINEVLAEI PRVRADLGYP
PLVTPTSQIV GTQAVLNVLL GERYKMVPGE VKNYVRGLYG RPPAPISEEI RRLIIGDEEP
IQGRPADILE PRLEEARREI GDLARNEDDV VAYAMFPQIA RKFFEKRQQG QLRPGRQSLP
SRGQDKAEAG GTAKMDLKDI TQLIKALEET GITELNLESE GVKVMIRRGS GQGAAEIPAP
EIKTAAEVLA TDGGAQETPV PAGDIIEVRA PMVGTFYRAP SPDAPPFVEV GTRVKAGQTL
CIIEAMKLMN ELTAETGGQV VAILAENGQP VEYGQVLFQI KKD