Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2107 |
Symbol | |
ID | 5733995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2639277 |
End bp | 2640791 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279248 |
Product | condensation domain-containing protein |
Protein accession | YP_001544875 |
Protein GI | 159898628 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCTC TCCAGCAACG CTTGGCGCAG CTTTCGCCCG AAAAACGCGC CTTGCTTGAA CAATTACGCT TGCAACGCCA AGCAACGCCA GCGATTACGC CGCGTGATCC TGCGCAAATG GTGCCGTTAT CGTTGGCACA GCAACGGCTT TGGCTGGTTG AGCAAATGCG GCCAGGCCAG GCTACCTACA TCATGCCCTG TTTGCTGATG ATCAACGGTT CGTTGGATCG TGCTGCTTTG GCGACGAGCC TGAGTTGGTT GCTCCAACGC CATGAAGTGC TACGCACCAG CATTCAGCCT GACAACCAAG GTGCATATCA ACAGATTCAC GAGCCTTGGC AGGTAGAATT GGGGTTGGTT GAGCTTGATC AAACCCAGCT TGAAACTCAG TTACGCCACT ATGCCCGACA ACCATTTGAT TTGCAGCACG CGCCGTTATG GCGTGCCGCG TTGTGGCGCT GTGCTCCAGA GCAGCATGTT TTAGGGTTGA TGCTGCATCA CATTATTGCT GATGGTTGGT CGTTAGGTGT GTTGATGCAC GATTTGGCGC TGGCCTATGC GCATTTTGCC ACAGGCGCAG CGTTAAACCT TGCTCCGCTG GCAATTCACT ATGGCGATTA TGCCTGTTGG CAAGCCCAAC AGCCAGCAAC GCTGACCCAA AGCAGCCGTG ATTTTTGGCT GAAGCTCTTG CACGATGTAC CAACTCTGGC CTTGCCAACC GACTATCCTC GACCTGCTGT TCAATCGTTT CAGGGGGCAC AAATCGCTTG GCGCTTGCCG AGGGAGTTGG TCGAACAACT CAAGCAACTT AGTCAAAGCC AGAGCGCAAC CTTATTTATG AGTTTGCTAA CGGCATTTTA TTGGCTTTTA CACTGGCTAA GTGAGCAAAC TGATTTAGTG GTTGGCACTG ATGTTGCAGG GCGACAACAG CCTGAGACCC ATCAATTGAT CGGCTTTTTC ATTAATCAAT TGGTGCTACG TCAACAAGTT CAGCCGCATC AATCGTTCCA AGCAAGCCTA CAACAAACCC GCCAGCTCAC ATTAGCGGCC TTTGACCACC AACATGTGCC ATTCGATCAA GTTGTCGAAT GGTTGAACGT TGCCCACGAC CCCAGCCGCA CGCCGTTGTT TCAGGTCAAA TTTGTGTTGC AAAACACGCC CTTGCCCAAC TTACAACTAG CAGGTGTGCA GCTTGAACGC ATGGATCTTG ATCCGCACAC TGCCAAATTT GATTTGCTGA TCAATCTTTG GGAAGAGCAT GCAGGCTTAG CGGGAACCCT TGATTACAAC ACCGATATTT TCGCAGCGCA GCGCATGCAG CAATTACTTG AGCGCTATCA GCTGGTGCTT CAGTTGATCG TGCTTGAACC GACGATCACG CTTGCGGTTG CGGTGCAGCG TTGCCAAAGC CACGAGCGCG AGCTTCAACA GCAACGGCTG GAGCAACGCA AGGCAGCAAA TCGTTCCAAA TTAATCCTGA ATCGGCGACG GTCGAGCACC GAGCCAACCC AATAG
|
Protein sequence | MDALQQRLAQ LSPEKRALLE QLRLQRQATP AITPRDPAQM VPLSLAQQRL WLVEQMRPGQ ATYIMPCLLM INGSLDRAAL ATSLSWLLQR HEVLRTSIQP DNQGAYQQIH EPWQVELGLV ELDQTQLETQ LRHYARQPFD LQHAPLWRAA LWRCAPEQHV LGLMLHHIIA DGWSLGVLMH DLALAYAHFA TGAALNLAPL AIHYGDYACW QAQQPATLTQ SSRDFWLKLL HDVPTLALPT DYPRPAVQSF QGAQIAWRLP RELVEQLKQL SQSQSATLFM SLLTAFYWLL HWLSEQTDLV VGTDVAGRQQ PETHQLIGFF INQLVLRQQV QPHQSFQASL QQTRQLTLAA FDHQHVPFDQ VVEWLNVAHD PSRTPLFQVK FVLQNTPLPN LQLAGVQLER MDLDPHTAKF DLLINLWEEH AGLAGTLDYN TDIFAAQRMQ QLLERYQLVL QLIVLEPTIT LAVAVQRCQS HERELQQQRL EQRKAANRSK LILNRRRSST EPTQ
|
| |