Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1856 |
Symbol | |
ID | 5733745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2159191 |
End bp | 2160930 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279000 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001544627 |
Protein GI | 159898380 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGATC ATGATCTGAT GTTGGAGGCG GGTGGCAGTT CTGCGACGAT TCAATCCAAC CCTGCAAGTT TTGTTGAAAT TATTCTTCAG CGGGCCAAAA CCACTCCTGA AGCGATTGCG CTTAATTTTT ATACAGCTGA TGGCTATCGT ACCAGCCTAT CATATGGCTT ATTGGCGCAG CGTTGCCAAG CAATTGCTGC CGTTTTGCAA CGGGTGACCA AACCAGGCGA TCGCGCTTTG CTATTATACC CGCCTGGCTT GGGTTATGTG GAAGCATTTT TTGGCTGTTT ATTCGCTGGC GTGATTGCCG TTCCTGCTTA TCCGCCGCGA CCCAATCGCC GCGATCAACG TTTGGAAGCA ATTATTAGTA ATGCTCAAGC ACGGGTGATG CTGGCAGCCA GCGAGGTCGT GGCTCAGCAA CGCGCTTTAT CGCAACAATA TCCTGGCCTT GCTCAATTGC AATGGATCGC TAGTGATCTG GTCAATAGCC AATTGGCCAG TATGTGGCAA GCTCCCAGCA TCAATACCCA TGATTTGGCC TTTTTGCAAT ATACCTCTGG CTCAACTTCG CAACCACGCG GTGTAATGGT CAGCCACGAA AATCTGGTCT ATAACTCGGG CTTGATTGCC CAAAGTTTTG GCATCACGGC TGATGATCAT GTGTTTATTT GGTTGCCGCC TTACCACGAT ATGGGCTTGA TCGGTGGGAT TATGCAGCCG TTGTTTACTG GCTGTGAATT AAGTTTGATC GATCCCCTAA CCTTTTTGCA ACAACCGCTG ATTTGGCTCC GCATGATCAG CGATTTAGGG GTGACGGTCA CTGGCGGGCC AAATTTTGCC TACGATTTAT GCGTTGCCAA AGCCAAGCCC GAAGCCTTGG CAGGCGTTGA TCTAAGCCGT TTGCGGGTGG CATTTAATGG CGCTGAGCCA ATTCGTGCAG CAACCCTTGA GCGATTTAGC CGCACTTTTG CCCCGTTAGG CTTCAAACCT CAGGCCTTCT TGCCGTGTTA TGGCTTAGCC GAAGCCACGC TCTTTGTCAG TGGTGCGCCG CATGCTGCCG AACCAACCAC ACTCACGGTC GATGCTCAAG CCTTGAGCCA ACATCAAGCG CTGCCAAGTG AGCGTGGGAC ATTGTTGGTA AGTTCGGGCA TGGTGGCCGC GCCGCAAATT GTGGCGATTG TCGATCCTGA GCAGGGCCAG GTTTGTGCCG ATGGTTGGGT TGGCGAGGTC TGCATTCACG GGCGTAGCAT TGCCCATGGC TATTGGGATA ACTCCGCCGC TAGCGAGGCT ACTTTTCAAT TGATCTTGCC TGATGGCAGC GGCCCCTTCC TGCGCACGGG CGACCTTGGT TTTATCCACG AAGGGCAGTT GTATATTACT GGGCGGCTCA AAGACCTGAT TATTATTGAT GGGCGCAATC ATTACCCCCA AGACTTGGAA TTGAGCGTTG AATTGGCGCA TCCGGCAATC CGCCAAGGTG GTTGTGCCGC TTTTGCCGTC GATGGCGCTG ATGGTGAGCA AATTGTGATT GTGGCCGAAA TTCGCCGCCC CAATCAAGCC GAAGAAGCGG CTCAGGCTGT GCGCTTGGCC CTCCAGCAAC AATACGATTT AGCAATTGCC GATTTGATGT GGGTACGGCC TGGTCAAGTG CCCAAAACCT CAAGCGGCAA GGTGCGCCGC CGCGAATGCC GCCAACGCTA TTTGAGCCAA ACCCTCAATA GCTTGGAGGG CGAGGAATGA
|
Protein sequence | MLDHDLMLEA GGSSATIQSN PASFVEIILQ RAKTTPEAIA LNFYTADGYR TSLSYGLLAQ RCQAIAAVLQ RVTKPGDRAL LLYPPGLGYV EAFFGCLFAG VIAVPAYPPR PNRRDQRLEA IISNAQARVM LAASEVVAQQ RALSQQYPGL AQLQWIASDL VNSQLASMWQ APSINTHDLA FLQYTSGSTS QPRGVMVSHE NLVYNSGLIA QSFGITADDH VFIWLPPYHD MGLIGGIMQP LFTGCELSLI DPLTFLQQPL IWLRMISDLG VTVTGGPNFA YDLCVAKAKP EALAGVDLSR LRVAFNGAEP IRAATLERFS RTFAPLGFKP QAFLPCYGLA EATLFVSGAP HAAEPTTLTV DAQALSQHQA LPSERGTLLV SSGMVAAPQI VAIVDPEQGQ VCADGWVGEV CIHGRSIAHG YWDNSAASEA TFQLILPDGS GPFLRTGDLG FIHEGQLYIT GRLKDLIIID GRNHYPQDLE LSVELAHPAI RQGGCAAFAV DGADGEQIVI VAEIRRPNQA EEAAQAVRLA LQQQYDLAIA DLMWVRPGQV PKTSSGKVRR RECRQRYLSQ TLNSLEGEE
|
| |