Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1000 |
Symbol | |
ID | 5732903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1143673 |
End bp | 1145634 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278134 |
Product | acetate--CoA ligase |
Protein accession | YP_001543776 |
Protein GI | 159897529 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02188] acetate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0778524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAGC ATGCTGCACC AACCACTGTT GGTCAATTTT CGGCAAGCAA CGACGCTGAA TCAGCATTCT ACTACCCACC TGCTGCCTTG GTTGAAAACT CGAACGTGCT GGCCTACGCT CGTGAAAAAG GTTTCGCCGA TGTCGAAGCC ATGCGTCAAT GGTCGATTGA GCATTATCAG GATTTCTGGG CCGACATGGC AACTCGGATG GTCGATTGGT ATATGCCATT TAGCAAGGTG CTCGACGATA GCAAAGCACC GTTCTACCAA TGGTTTAACG ATGGCAAGAT CAACATTGTG CACAATGCGC TTGATCGCCA TGTCAAAACT TGGCGCAAGA ATAAGCTGGC CTTGATTTGG GAAAGTGAAA AGGGCGATAA CAAGACCTAT AGCTATTGGC AATTGTTCAA ACGAGTCAAT AAATTTGCCA ATGTGCTCAA ATCGATGGGG GTCAAAAAGG GCGATACCGT GACGATCTAT TTGCCACGGG TTCCCGAAAT TGTGATTGCG ATGTTGGCTT GTGCCAAAAT TGGCGCGATG CACAGCGTGG TCTATGGTGG TTTTAGTGTC GAAGCGCTGC AAACTCGCAT CCAAGATGCC CAATCGCGGG TAGTGATTAC GGCTGACGGC GGTTATATGA ATGGCAAAGT CGTTGAGCTG AAAAAGATCA CCGACGATGC GATTAAGCAT TCACCAGTGG TCGAAATTGT GATTGTAGTC AAGCGCACAG GCCACGAAGT CGAAATGCAA CAAGGCCGCG ACTTGTGGTA TGAAGAATTG ATGGCCTTGC CAATCGCTTC GACCAAGTGC GAAACCGAGC AACTTGATGC TGAACATCCT TTATATATGC TCTACACTTC GGGCACGACT GGCGCTCCTA AAGGTTTAGT GCATACCCAT GGTGGCTATC AAGTTGGGGT TGCGACGACC TTGCACTTTA ATCTTGATAT CAAAGAAGAT GATGTGTATT GGTGTGCCGC CGATCCAGGC TGGGTCACGG GCCACTCCTA CATTGTCTAT GGTCCCTTGA TGCTCGGCGC AACCCAAGTG ATGTATGAAG GCGCACCAAC CTTCCCCTTC CCCAACCGCT GGTGGAATAT TGTCGAGCGC TATGGCGTAA CCGTGCTCTA CACTGCTCCC ACCGCAATTC GCGGCTTGAT GCGCTTTGGC GAAGCCTGGC CTAATCGCCA TGATCTTGGC TCGTTGCGCT TGCTTGGCTC AGTCGGCGAA CCAATTAACC CCGAAGCATG GCGCTGGTAT CATCGGGTGA TTGGCCGCAA CAATTGCCCA ATTATGGATA CTTGGTGGCA AACTGAAACC GGCAGCATGA TGATCACGCC TAATCCAACC ACGCCACTCA AGCCAGGCTC AGGCACTCGT GCCTCGTTTG GCATCGATGC CGATGTGGTC AACGATCAGG GTGAGCATGC CAGCGATGAT GAAGATGGTT TGTTGATTAT CCGCAATCCA TGGCCTTCGA TGTTGCGCAC GATTTATAAC AACCCTGAGC GCTATATCGA ACAATATTGG AGCCGAATTC CAGGTGTGTA CACCGCTGGC GATTCAGCCC GCAAAGACGA AGACGGCTAT TTCTGGGTAA TTGGTCGGAT CGACGATGTA ATCAAGGTTT CAGGCTATCG GTTGGGCACG GCTGAAGTTG AGTCGGCTTT GGTTTCGCAT CCATCAGTTG CCGAAGCTGC CGCAATCGGC TTGCCTCACG AAGTCAAAGG CAATGCGATT CATACCTTTG TGATTTTGAA GAATGGCTAC GAAGCCAACC AAGATCTTGA GGATGCGCTG ATTGCCCACG TTGGCAAAGT GATGGGGCCA ATTGCCCGCC CTGAGGCAGT ACAATTTGTG CCAAGTTTGC CTAAAACCCG CTCAGGCAAA ATTATGCGCC GCGTCCTCAA AGCCCGTGCC CTAGGCTTGC CCGAAGGCGA TTTGAGCACT TTGGAACAAT AA
|
Protein sequence | MTEHAAPTTV GQFSASNDAE SAFYYPPAAL VENSNVLAYA REKGFADVEA MRQWSIEHYQ DFWADMATRM VDWYMPFSKV LDDSKAPFYQ WFNDGKINIV HNALDRHVKT WRKNKLALIW ESEKGDNKTY SYWQLFKRVN KFANVLKSMG VKKGDTVTIY LPRVPEIVIA MLACAKIGAM HSVVYGGFSV EALQTRIQDA QSRVVITADG GYMNGKVVEL KKITDDAIKH SPVVEIVIVV KRTGHEVEMQ QGRDLWYEEL MALPIASTKC ETEQLDAEHP LYMLYTSGTT GAPKGLVHTH GGYQVGVATT LHFNLDIKED DVYWCAADPG WVTGHSYIVY GPLMLGATQV MYEGAPTFPF PNRWWNIVER YGVTVLYTAP TAIRGLMRFG EAWPNRHDLG SLRLLGSVGE PINPEAWRWY HRVIGRNNCP IMDTWWQTET GSMMITPNPT TPLKPGSGTR ASFGIDADVV NDQGEHASDD EDGLLIIRNP WPSMLRTIYN NPERYIEQYW SRIPGVYTAG DSARKDEDGY FWVIGRIDDV IKVSGYRLGT AEVESALVSH PSVAEAAAIG LPHEVKGNAI HTFVILKNGY EANQDLEDAL IAHVGKVMGP IARPEAVQFV PSLPKTRSGK IMRRVLKARA LGLPEGDLST LEQ
|
| |