Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0749 |
Symbol | |
ID | 5732472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 853715 |
End bp | 855082 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277879 |
Product | acetyl-CoA carboxylase, biotin carboxylase |
Protein accession | YP_001543525 |
Protein GI | 159897278 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000629358 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACGCA AAATTTTAAT TGCCAATCGT GGTGAAATTG CGGTGCGAAT TATTCGTGCT TGCCACGAGC TAGGCATCAA AGCAGTTGCC GCCTATTCCG AGGCCGATCG CGATTCGCTG GCGGTGCGTA TGGCCGATGA GGCGATTTGT ATTGGCCCGC CACCACCTGC CAAATCCTAT TTGAATGCGC CAGCCTTGAT TAGCGCTGCG CTGATTAGCG ATTGCGATGG GATTCACCCA GGTTATGGCT TTTTGTCGGA AAACCCCTAT TTTGCTGAAA GCTGCCGTGA GTGTGGTCTG ACTTTTATTG GCCCTTCAGC CGATTCGATT CAGCGCATGG GCGATAAAGC GCTGGCCAAG CAAGCCATGA AGTTGGCTGG CCTGCCGCTT GTGCCTGGCA CCGAAAACCC CTTGACCAGC GTTGAAGAAG CTCAAAGCCT TGCTGATGGT ATTGGCTACC CGGTTTTGCT CAAAGCTGTG GCTGGCGGTG GCGGGCGGGG CATGCGCGTG GTCAATCAGC CTGATGAATT GGCCCGAGCT TTTAATACTG CCCGCGCTGA GGCCGAAGCT GCCTTTGGCC GTGGCGATTT GTATATGGAA AAATACTTGC CAGTGGTGCG CCACGTTGAA ATTCAGATTT TGGCTGATCA ACATGGCCAT GCAATTCACC TTGGCGAGCG TGATTGCTCG TTGCAACGTC GCCACCAAAA AGTGGTGGAA GAAGGCCCAT CGCCTGCCTT GACCCCAGAA TTACGCCAGA AAATGGGCGA AGCCGCCTTG CATGGCGTGC GCGAAATTGG CTACTACAAC GCTGGCACAA TGGAATTTTT ACTCGATCAT CAGGGAAATT TCTATTTTAT GGAAATGAAC ACCCGTTTGC AGGTTGAGCA CCCTGTGACT GAATGGCTGA CCGGACTTGA TCTGGTTAAG TGGCAAATTC GGATTGCTTC CGGCGAACGC TTGACGCTCA CTCAGGATGA CATTAAAATA CGCGGGCATG CGATTGAATG TCGGATTAAT GCCGAAGATG CCGACCGTGA TTTTATGCCT GCTGGCGGGA CTGTCGATCT CTACTTGCCG CCAGGTGGCC CAGGGGTACG GGTCGATTCG CATCTTTATT CAGGTTATCG CACTCCTACC AACTACGATT CGATGCTTGC CAAAGTGATC GTCTGGGGGG AAACGCGGCT TGAGGCAATT GAACGTATGC GGCGAGCATT AAGCGAATGT GTGATCAATG GCATTACGAC CACCTTGCCA TTTCAACTGC GCATGATGAA CGAGCCAGCT TTTGTGAGCG GCGATGTTGC AACGCACACC TTGGCTGATA TTTTAAATCA ACAGGCTGCC AAAGAAGCGA CAGCGTAG
|
Protein sequence | MLRKILIANR GEIAVRIIRA CHELGIKAVA AYSEADRDSL AVRMADEAIC IGPPPPAKSY LNAPALISAA LISDCDGIHP GYGFLSENPY FAESCRECGL TFIGPSADSI QRMGDKALAK QAMKLAGLPL VPGTENPLTS VEEAQSLADG IGYPVLLKAV AGGGGRGMRV VNQPDELARA FNTARAEAEA AFGRGDLYME KYLPVVRHVE IQILADQHGH AIHLGERDCS LQRRHQKVVE EGPSPALTPE LRQKMGEAAL HGVREIGYYN AGTMEFLLDH QGNFYFMEMN TRLQVEHPVT EWLTGLDLVK WQIRIASGER LTLTQDDIKI RGHAIECRIN AEDADRDFMP AGGTVDLYLP PGGPGVRVDS HLYSGYRTPT NYDSMLAKVI VWGETRLEAI ERMRRALSEC VINGITTTLP FQLRMMNEPA FVSGDVATHT LADILNQQAA KEATA
|
| |