Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3798 |
Symbol | |
ID | 5735662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4768124 |
End bp | 4769623 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280950 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001546562 |
Protein GI | 159900315 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR01923] O-succinylbenzoate-CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACATTG GTGACTGGCT TGGCAAGCGC GAGTTGCTTA CGCCAGAGCG ACTAGCGCTG GTTGACGACC GTGATGGCGA GCGCTATAGC TACCGCCAAT TGAATAGCCG TGCCAATCGT TTGGCTGCGA GTTTGCGCCA ACGTTTTGGT GTAGGCAAAG GCGATCGAGT AGCAATTTTG GCCAAAAATC AAATTGGCTA CCTCGATGCC TTATTTGCCA CTGGCAAGCT TGGGGCGATT TTAGTGCCAC TCAATTGGCG CTTAACCGAG CATGAATTAA TTTATATGCT CAAAGATAGT GCATCGAGCA TATTGCTCTA CGATAGCCAA TTTGCGCCGC TGCTGCCAAC CTTGCGCAGT CAAACCCCAA TCAAGCAGTG TGTCCAGTTT GGGCCTGAAT ACGATCAACT GCTGACCCAA GCTAGCGATT TGCCGATCAG CGAATCAGTT GATCTTGATG ATCCGCACTT GATTTTGTAT ACCTCAGGCA CAACTGGCGC ACCCAAAGGC GCAGTACTTT CGCATCGGGT GCTGGTGTGG AATTCGCTCA ATACCAATGT TGGCTGGGAT TTACACGCCG ATGACGTGAG CATTATTCAT ACGCCACTAT TTCATACTGG CGGCCTGAAT GTGCTAACCC TGCCGATTTT GCATGCTGGT GGCACAATGG TTTTGATGCA AGAATGGAAT CCCGAGCGCT GTTTGCAATT GATTGAGCAA GAGCATGTGA CGATCTTTTT TGCCGTGCCA ACCATGTTTG AGATGCTGCT GCAAGCGCCC AATTTTGTCC AAACCAACCT CAGCAGTCTG CGTTTTTGCA TCGCTGGCGG CTCGCCCTGC CCAATTCCCT TGATCGAAGC CTATCAGCAG CGCAATATTC CGTTTCGCCA AGGCTATGGC CTGACCGAAG TTTCGGTTAA TTGCTTCACG CTCAACCCAG AGGATGCAAT TCGTAAGGCT GGATCGGTGG GCAAGCCGAT TTTTCACCTT GATGCCCGCA TTGTCGATGA GGCGGGCCGC GATGTGCCGA CCAACAGCAT TGGCGAATTG ATTTTATATG GGCCGACGGT GTGCAATGGC TACTGGCGCA ATCCGGTCGC AACCGCCCAA GCCCTGCAAA AAGGTTGGTT CTACACTGGC GATCTAGCAC GGGTCGATGC TGAAGGTTAT TTCTACATCG TTGATCGCAA GAAGGATATG TATATTTCTG GCGGCGAGAA TGTTTACCCT GCTGAGGTTG AAAACGTGCT CTATCAGCAC CCTGCGGTAC AAGAATGCGC CGTGATTGGC ATACCCGATA GTCGCTGGGG CGAGGTTGGG CGGGCTTTAG TGGTGTTGCG GCCAAGCACG CAGCTTGATG AGCCAACCCT GATCGCTTTT TGTCGCGAAC GCCTGGCTAG CTACAAAACC CCAAAATCGA TTTATTTCTT GCCTGAGTTG CCGCATAACG CCAGTGGCAA GGTCGTCAAG CCTGAGCTAC GCAAATTGTT TGGCTATTAG
|
Protein sequence | MYIGDWLGKR ELLTPERLAL VDDRDGERYS YRQLNSRANR LAASLRQRFG VGKGDRVAIL AKNQIGYLDA LFATGKLGAI LVPLNWRLTE HELIYMLKDS ASSILLYDSQ FAPLLPTLRS QTPIKQCVQF GPEYDQLLTQ ASDLPISESV DLDDPHLILY TSGTTGAPKG AVLSHRVLVW NSLNTNVGWD LHADDVSIIH TPLFHTGGLN VLTLPILHAG GTMVLMQEWN PERCLQLIEQ EHVTIFFAVP TMFEMLLQAP NFVQTNLSSL RFCIAGGSPC PIPLIEAYQQ RNIPFRQGYG LTEVSVNCFT LNPEDAIRKA GSVGKPIFHL DARIVDEAGR DVPTNSIGEL ILYGPTVCNG YWRNPVATAQ ALQKGWFYTG DLARVDAEGY FYIVDRKKDM YISGGENVYP AEVENVLYQH PAVQECAVIG IPDSRWGEVG RALVVLRPST QLDEPTLIAF CRERLASYKT PKSIYFLPEL PHNASGKVVK PELRKLFGY
|
| |