Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4207 |
Symbol | |
ID | 5736919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5360845 |
End bp | 5362611 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281362 |
Product | acetyl-CoA carboxylase, biotin carboxylase |
Protein accession | YP_001546967 |
Protein GI | 159900720 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTCG ATACAGTGCT GATCGCCAAT CGCGGCGAGA TTGCCCTGCG CGTGATGCGG GCATGCAAGG AATTAGGTTT ACGCACCGTC GCCGTCTACT CAGAAGCGGA TCGCGATTCG CTCCACGTCC GTTATGCCGA CGATGCCTTT TTGATCGGTC CCCCTCCAGC AGTCCAAAGT TATTTGCAAA CTGAAACAAT CCTCGATGTT GCGCGGCGCA GTGGCGCTGG CGCGATTCAC CCTGGCTACG GCTTCCTCTC GGAAAATACC GATTTTGTGC GCACTTGTGA TGCGGCGGGC ATTGCCTTTA TTGGGCCAAC CGCTGATGCC ATGGATTTGA TGGGCGGTAA AATTCACGCT CGTCAAGTGG CTTTACGTGC TCATGTCCCG CTGGTTCCAG GCACAACCGA GGCGGTTGAA AGTGTCGCCG AAGCCCTTGA ACTTGGCGAA CAATATGGCT ACCCAATTGC CATCAAAGCT AGTGCTGGCG GCGGTGGCCG TGGCTTGAAG GTGGCCTACC AGCCTGAAGA AGTTGAATTT GCCTTCGAAA GTGCACGTCG CGAGGCCGAA GCGGCTTTCA AAAACGGCGA ATTGTTTGTT GAAAAATATG TGCTCGATCC ACGCCACATT GAAATTCAGA TTCTGGCTGA TCAATATGGC AATGTGGTGT ATTTGGGCGA GCGCGATTGT TCGGTGCAGC GTCGCCACCA AAAATTGATT GAAGAAACTC CATCGCCAGC AGTTTCGCCT GAATTGCGCC GCACCATGGG CGAATGTGCT TTGCGCTTAT GCCGCGAAAC GAGTTATGTT GGCGCTGGCA CTTTGGAGTT TTTGCTAGCC CCCGATGGTC AATTCTATTT CCTCGAAATG AACACGCGGA TTCAAGTTGA GCATACTGTG ACCGAAATGG TTACGGGCAT CGATTTGGTG CAAGCCCAAC TCCGCATCGC CCAAGGCGAA AAGCTTTGGT TCACCCAAGA GGAAATTCAG CTACGCGGCC ATGCGATTCA ATGCCGGATC AATGCTGAGG ATGCGGCGGC AGGTTTTCGC CCAGCGCTTG GCACGATCAG CGCTTACAAC GAACCTAAAG GCTATGGCGT GCGGGTCGAT GCTGGGGTTG AGCAAGGCAC AACCATTCCA CCATACTACG ATTCGATGCT TGCCAAATTG GTGACTTGGG GTGCAACTCG TGGCGAAGCC TTGCAACGCA TGCGCCGTGC CCTCAACGAT TACACGATCG AAGGCATTAC CACGGTGATT CCTTTCCATA AATTGGCTTT GGCTGAGCCA GCTTTTGAGC ATGGCGATGT GACGGTGAGC TTTATTCCAC GCTATTTGGA AGCAAAACTC AAGCAATTGC CAAGCGCCAC GCCGAGCACT ACCGAGCCAG CCGCCGAGCA ACCTAGCCGC GAATTAATGG TTGAGGTCAA TGGGCGACGC TTTGCTGTGC GCGTTGCTGG CGAAGGTTTG AATACACCAA TTGCTAGCAA TAAAGCGGCT GCGCCAACCC GCCATCGCGC TGCCAACAAA AAGCGTGAAG TTGCGACAGA TCCCAATGCA GTGATTTGTC CAATTCAAGG TACAATTGTG GCGATTAAAA CCAGCGTTGG CGCGGCGGTT GAGGCTGGCC AAGTGGTGTT TGTCGTCGAA GCAATGAAGA TGGAGAACGA AATCGCAACT CCACGGGCTG GTACAATTGC TACAATCAAT GCCGAGGTTG GCAAAAGCAT CGAGGCTGGC AGCGTGCTAG CAACCCTCGA AGCATAA
|
Protein sequence | MSFDTVLIAN RGEIALRVMR ACKELGLRTV AVYSEADRDS LHVRYADDAF LIGPPPAVQS YLQTETILDV ARRSGAGAIH PGYGFLSENT DFVRTCDAAG IAFIGPTADA MDLMGGKIHA RQVALRAHVP LVPGTTEAVE SVAEALELGE QYGYPIAIKA SAGGGGRGLK VAYQPEEVEF AFESARREAE AAFKNGELFV EKYVLDPRHI EIQILADQYG NVVYLGERDC SVQRRHQKLI EETPSPAVSP ELRRTMGECA LRLCRETSYV GAGTLEFLLA PDGQFYFLEM NTRIQVEHTV TEMVTGIDLV QAQLRIAQGE KLWFTQEEIQ LRGHAIQCRI NAEDAAAGFR PALGTISAYN EPKGYGVRVD AGVEQGTTIP PYYDSMLAKL VTWGATRGEA LQRMRRALND YTIEGITTVI PFHKLALAEP AFEHGDVTVS FIPRYLEAKL KQLPSATPST TEPAAEQPSR ELMVEVNGRR FAVRVAGEGL NTPIASNKAA APTRHRAANK KREVATDPNA VICPIQGTIV AIKTSVGAAV EAGQVVFVVE AMKMENEIAT PRAGTIATIN AEVGKSIEAG SVLATLEA
|
| |