Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3958 |
Symbol | |
ID | 5735819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4965375 |
End bp | 4967498 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641281108 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001546718 |
Protein GI | 159900471 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAGG CTTTTATCCA AACGACATCG ACGAAATATC CCCAGAAAAC CGCAGTTATT GATGGACCCA GACGTATAAC CTACGAGCAA CTTGCCGCTT CCATCGGTTC ATTCGCCAAT GAACTGACTG CGGCCGGGGT TACTGAAGGC GAAAGCATCG CCTTGGTGCT ACCAAATTGC GCGGAGTTTG TCATCGGCTT TTACTCCACC CTGCATATCG GCGCGGTTGT TTTGGCACTC AATCCTCTGT TGAAGCACAA CGAGATCAAC TACTATCTTG CCGATGCCCA GGCCAGGGTT ATTCTCACTA CCAAGCTGTA CATGGGCATG TGCCGCGAGA TCGTCGCCGC AGCTGGCCGC TCAATCGAAA TCATTGCCCT GGATGGTGTG CTGGAGGGTT CCCGCGCTGC CGCATCGGAA CGCGCCGCGC CCGCGGCTGC CGACCCGCAT CGGCCAGCGT TGTTTCAGTA TTCCTCTGGT TCGACCGGGC GCCCGAAAAA AGTGATGCGC ACCTATGGCA ATCTGTGTGC CGAGGGTGAT AATTTTACCG CCACTGTCGG TATGACCCAC GACGATGTGA TTTTGTGCCT GGTTCCGCTT TTTCATGCCC ACGGGCTGGG CAATTGCTTG CTGGCTGCCA CCATGGTCGG AGCAACCCTA GTGATTTTGG AGCAGCCGAT GGACGGAAAT GCTGTTGTCG ACATGCCCTT TATCGCCCGT TGTGCCAGAG TCTTCGAGCT GATCGAAATC GAGCGGGTTA GCGTTTTGCC CGGCGTCCCC TATGTGTTTA GCGCACTCAG CAGTGCACAG GTGGGCTTCG AGCCTGCGCT GGGATCGCTG CGTTTGTGTT TTTCCGCCGG CAATTTCTTG ACCAGGGATG TGTTCGACGC CTTCCTGGAC CGCTTTGGCA TTGCCATCAA GCAACTCTAC GGCTGTACCG AGGCGGGATC GGTCACCATC AACCTTGAGG ATGATCCGAG CCTCGCCGCC AGCGTGGGGC TGCCGATACG CAATGTTGAA CTCCATATCT GCGATGAGCA AAAAAACCGG CTCGCGCCCG ACGCCATCGG CGAAATAGCT TTCAAAAGCC CCATGCTGAC CAGCGGCTAT GTCGGCCTGG AAGACATCAA CCGGGACATG TTTCGCGATG GCTTTTTCTT CACCGGGGAT CTCGGCAGGC TTGATGAGGC CGGGCGCTTG ACCATCACCG GTCGCAAAAA AATTTTTATC GATGTCGGCG GCAGGAAGGT CGATCCCCTG GAGATCGAGG ATGTCCTGCT AACGCATCCC CGGGTCAAGG AAGCGGTCGT CGTCGGCATC AAGGCGCCCT ATGGCGGCGA GTTTGCCAAG GCTGTGGCCG TGCTGGACGG CGAATGCACC CAGACGGAGC TCCTCCAGTA TTGCAAGGAC CGCTTGGCCG ACTTCAAAGT CCCGCGCATG ATTGAATTCC GCAACGAGAT TCCAAAAAGC CCCCTCGGCA AAATTTTACG CAAAAATCTG GTTGATGACT CAGCCGTGGC GGAGGTTGAA GCCCTTGGAT CAACCCTGAG CCAGCACATG CGATCCACTT CGTCTAGGGA ACAGCGCCTT TCGCTGGCCA AGCAATGCGT GCGCCAGCAG ATTGCCCGCA TCTCGGGTCT TGATGTCGCC CAGATCGGCC TTTCGAACGC CCTCAGCGAC TTCGGGCTTG ATTCGGCGCG GGCGATTGAA TTGCAGATGT CCCTGGAGAA TCTGATGGGT GCGGGTTTAT CGGCCACGAT GGTGTGGCAA TATCCCGATC TGGACTCGTT GAGCGGGTAT CTGGTGGATA TTGTTGACGC GCAGACGGCG GGCGCCGATC CCGCAGCCGT GCCGGATCGG GCGGCCGCGC CGCCGGCCGC TCGCCCCTCC GCTATCCAGG CGATCGACGA CCTTTCAGAT GATGCCATTG AGGCGCTTTT GCGTTCACAG GTCGATGGCA TCCTCCAGCC GCAGAACACG ACAAACGCCC ATACCATCCC CGGTTTGAAT GAGGGTGACT CGGCGGGTAT TGATCGGCTT GCCCAACTTT CCGACGAGGA TGTCACCGAT CTGCTGCTCA AGGAATTCGC ACGACTAAGC CGAACCGGCC AACCTGAAGC ATAG
|
Protein sequence | MSEAFIQTTS TKYPQKTAVI DGPRRITYEQ LAASIGSFAN ELTAAGVTEG ESIALVLPNC AEFVIGFYST LHIGAVVLAL NPLLKHNEIN YYLADAQARV ILTTKLYMGM CREIVAAAGR SIEIIALDGV LEGSRAAASE RAAPAAADPH RPALFQYSSG STGRPKKVMR TYGNLCAEGD NFTATVGMTH DDVILCLVPL FHAHGLGNCL LAATMVGATL VILEQPMDGN AVVDMPFIAR CARVFELIEI ERVSVLPGVP YVFSALSSAQ VGFEPALGSL RLCFSAGNFL TRDVFDAFLD RFGIAIKQLY GCTEAGSVTI NLEDDPSLAA SVGLPIRNVE LHICDEQKNR LAPDAIGEIA FKSPMLTSGY VGLEDINRDM FRDGFFFTGD LGRLDEAGRL TITGRKKIFI DVGGRKVDPL EIEDVLLTHP RVKEAVVVGI KAPYGGEFAK AVAVLDGECT QTELLQYCKD RLADFKVPRM IEFRNEIPKS PLGKILRKNL VDDSAVAEVE ALGSTLSQHM RSTSSREQRL SLAKQCVRQQ IARISGLDVA QIGLSNALSD FGLDSARAIE LQMSLENLMG AGLSATMVWQ YPDLDSLSGY LVDIVDAQTA GADPAAVPDR AAAPPAARPS AIQAIDDLSD DAIEALLRSQ VDGILQPQNT TNAHTIPGLN EGDSAGIDRL AQLSDEDVTD LLLKEFARLS RTGQPEA
|
| |