Gene Moth_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1524 
Symbol 
ID3831989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1569058 
End bp1570404 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content59% 
IMG OID637829456 
Productacetyl-coenzyme A carboxylase carboxyl transferase subunit alpha / biotin carboxylase 
Protein accessionYP_430376 
Protein GI83590367 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000243805 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0117003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGAA GGGTTTTAGT TGCCAACCGC GGCGAGATTG CCGTACGGAT TATCCGGGCC 
TGCCGGGAAC TGGATATTGA AACGGTGGCC GTTTATTCAG AGGCTGACCG GGATTCCCTG
CATACCCGCC TGGCCGATAA GGCCGTCTGT ATCGGGCCGG CCCCGGCTAA CCGCAGCTAC
CTGCATATTC CCAGCATCAT TACCGCCGCC AGGATGAGCG GAGCCGACGC CATTCACCCC
GGTTACGGTT TCCTGGCGGA GAATCCCTAC TTCGCCGAGA TGTGCGAAAC GTCGGGGATT
ACCTTCATCG GCCCCTCGCC TCGTTCCATG CAGCTTATGG GGGATAAGGC CACGGCCCGG
GCAACCATGA TCGCCGCCGG GGTGCCGGTA GTCCCCGGCT CCGAGGGTGT AATCAAAGAC
CTGGACGCCG CCCTGGCGGT AGCCAAAGAG ATAGGATACC CGGTGTTGAT TAAAGCTGCG
GCCGGCGGTG GCGGCCGGGG GATCCGCGTC GCCCAGGGGC CCAGGGAGCT ACGCCAGGCC
GTTTTTACCG CCCAGCGGGA AGCCGAGGCC GCCTTTGGAA ACTCCCAGGT TTACCTGGAG
AAATATATTG AAGAACCGCG CCATATAGAG TTTCAAATAA TCGGCGACAG GGAAGGAAAT
ATCATCCACC TTGGGGAGCG CGACTGCTCC TTGCAGCGGC GCAACCAGAA AATCCTGGAG
GAGGCTCCTT CAGGAGCCCT TACCCCCGAA CTGCGCCAGG AAATGGGCGC CCTGGCCCTG
AAGGCCGCCA GGGCCGCCAA TTACTACAGC ACCGGCACGG TAGAGTTTTT ACTGGATAAA
TACGGCCATT ACTATTTTAT AGAAATGAAT ACCCGCATCC AGGTGGAACA CCCGGTTACC
GAGGCCGTCA CCGGCATCGA CCTGGTTCAG GAACAGATTA AAATTGCCGC CGGCGAGCCG
CTGCGCCTGG CCCAGGAGGA TGTCCAGATC CGTGGCCATG CCCTGGAGTG CCGGATCAAT
GCCGAGGACC CAGCCCATAA CTTCCGGCCG GCCCCGGGCC GTATTGAACG CTATCACGCG
CCAGGGGGAT TCGGCATCCG GGTGGAGAGC GCTGTTTACA GCGGTTACAC CATCCCGCCC
TTTTATGACT CCTTGATTGC CAAGGTTATT GCCTGGGCCC CGGACAGGGA AGCAGCCATC
AACCGCATGA GCGGGGCTTT GAAAGAAATG GTGATTGAAG GGGTGCCTAC TACCATTCCC
TTTCACCAGC AGATTATGGC CAATGCCTTT TTCCGGCGCG GGGAGATCTA CACCAACTTC
ATCCAGCGCC GCTTAATGGC CGGTTAA
 
Protein sequence
MFRRVLVANR GEIAVRIIRA CRELDIETVA VYSEADRDSL HTRLADKAVC IGPAPANRSY 
LHIPSIITAA RMSGADAIHP GYGFLAENPY FAEMCETSGI TFIGPSPRSM QLMGDKATAR
ATMIAAGVPV VPGSEGVIKD LDAALAVAKE IGYPVLIKAA AGGGGRGIRV AQGPRELRQA
VFTAQREAEA AFGNSQVYLE KYIEEPRHIE FQIIGDREGN IIHLGERDCS LQRRNQKILE
EAPSGALTPE LRQEMGALAL KAARAANYYS TGTVEFLLDK YGHYYFIEMN TRIQVEHPVT
EAVTGIDLVQ EQIKIAAGEP LRLAQEDVQI RGHALECRIN AEDPAHNFRP APGRIERYHA
PGGFGIRVES AVYSGYTIPP FYDSLIAKVI AWAPDREAAI NRMSGALKEM VIEGVPTTIP
FHQQIMANAF FRRGEIYTNF IQRRLMAG