Gene Haur_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0749 
Symbol 
ID5732472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp853715 
End bp855082 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content52% 
IMG OID641277879 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_001543525 
Protein GI159897278 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000629358 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGCA AAATTTTAAT TGCCAATCGT GGTGAAATTG CGGTGCGAAT TATTCGTGCT 
TGCCACGAGC TAGGCATCAA AGCAGTTGCC GCCTATTCCG AGGCCGATCG CGATTCGCTG
GCGGTGCGTA TGGCCGATGA GGCGATTTGT ATTGGCCCGC CACCACCTGC CAAATCCTAT
TTGAATGCGC CAGCCTTGAT TAGCGCTGCG CTGATTAGCG ATTGCGATGG GATTCACCCA
GGTTATGGCT TTTTGTCGGA AAACCCCTAT TTTGCTGAAA GCTGCCGTGA GTGTGGTCTG
ACTTTTATTG GCCCTTCAGC CGATTCGATT CAGCGCATGG GCGATAAAGC GCTGGCCAAG
CAAGCCATGA AGTTGGCTGG CCTGCCGCTT GTGCCTGGCA CCGAAAACCC CTTGACCAGC
GTTGAAGAAG CTCAAAGCCT TGCTGATGGT ATTGGCTACC CGGTTTTGCT CAAAGCTGTG
GCTGGCGGTG GCGGGCGGGG CATGCGCGTG GTCAATCAGC CTGATGAATT GGCCCGAGCT
TTTAATACTG CCCGCGCTGA GGCCGAAGCT GCCTTTGGCC GTGGCGATTT GTATATGGAA
AAATACTTGC CAGTGGTGCG CCACGTTGAA ATTCAGATTT TGGCTGATCA ACATGGCCAT
GCAATTCACC TTGGCGAGCG TGATTGCTCG TTGCAACGTC GCCACCAAAA AGTGGTGGAA
GAAGGCCCAT CGCCTGCCTT GACCCCAGAA TTACGCCAGA AAATGGGCGA AGCCGCCTTG
CATGGCGTGC GCGAAATTGG CTACTACAAC GCTGGCACAA TGGAATTTTT ACTCGATCAT
CAGGGAAATT TCTATTTTAT GGAAATGAAC ACCCGTTTGC AGGTTGAGCA CCCTGTGACT
GAATGGCTGA CCGGACTTGA TCTGGTTAAG TGGCAAATTC GGATTGCTTC CGGCGAACGC
TTGACGCTCA CTCAGGATGA CATTAAAATA CGCGGGCATG CGATTGAATG TCGGATTAAT
GCCGAAGATG CCGACCGTGA TTTTATGCCT GCTGGCGGGA CTGTCGATCT CTACTTGCCG
CCAGGTGGCC CAGGGGTACG GGTCGATTCG CATCTTTATT CAGGTTATCG CACTCCTACC
AACTACGATT CGATGCTTGC CAAAGTGATC GTCTGGGGGG AAACGCGGCT TGAGGCAATT
GAACGTATGC GGCGAGCATT AAGCGAATGT GTGATCAATG GCATTACGAC CACCTTGCCA
TTTCAACTGC GCATGATGAA CGAGCCAGCT TTTGTGAGCG GCGATGTTGC AACGCACACC
TTGGCTGATA TTTTAAATCA ACAGGCTGCC AAAGAAGCGA CAGCGTAG
 
Protein sequence
MLRKILIANR GEIAVRIIRA CHELGIKAVA AYSEADRDSL AVRMADEAIC IGPPPPAKSY 
LNAPALISAA LISDCDGIHP GYGFLSENPY FAESCRECGL TFIGPSADSI QRMGDKALAK
QAMKLAGLPL VPGTENPLTS VEEAQSLADG IGYPVLLKAV AGGGGRGMRV VNQPDELARA
FNTARAEAEA AFGRGDLYME KYLPVVRHVE IQILADQHGH AIHLGERDCS LQRRHQKVVE
EGPSPALTPE LRQKMGEAAL HGVREIGYYN AGTMEFLLDH QGNFYFMEMN TRLQVEHPVT
EWLTGLDLVK WQIRIASGER LTLTQDDIKI RGHAIECRIN AEDADRDFMP AGGTVDLYLP
PGGPGVRVDS HLYSGYRTPT NYDSMLAKVI VWGETRLEAI ERMRRALSEC VINGITTTLP
FQLRMMNEPA FVSGDVATHT LADILNQQAA KEATA