Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1878 |
Symbol | |
ID | 5733767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2237957 |
End bp | 2239396 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279022 |
Product | condensation domain-containing protein |
Protein accession | YP_001544649 |
Protein GI | 159898402 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTG AGAATTTAGA AGATATCTAT GGTTTGTCGC CTATGCAAAA GGGGATGCTT TTTCATACCC TGCTTGTACC CAATTCGAGT GCCTACCAAA ATCAATCAGT TTGGCAACTT GAGGGCCAGC TGAATCTGGC AGCCTTTGAA TGGGCTTGGC AACAGGTTAT CCAGCGCCAT TCAGTGCTGC GGACTGCCTT CTTTTGGGAA GATCTTGAGG AGCCGCTGCA AGTGGTGTTG CGCCAAGTTA CGCTACCATT GCAGATCATG GATTGGCGGG AATACTCGGC TGAGCAGCAA GCCCAACAAT TGGAGCCTTG GCTAGCGGCA GATTTAGCCC AAGGCTTTCA ATTAACTGCC GCACCCTTGC TACGCTTGAG CTTGATTCAA ATCGGCCCGA CCAGCTATTG GTTTTGTTGG AGCCGCCATC ATCTCTTGCT TGATGGTTGG TCGCAGGCGA TTGTGCTGAA GGATTTGTTT ACCTTCTATG AGGTCTATTG CTATGGCGAG CAAGCCAGCT TTAATCAGCA GGCAGTTTTA GGCCCACGCC GCCCGTATGG CGAATACATT GCTTGGCTTC AACAGCAGGA TCAGGCCAAG GCGGAGGCTT TTTGGCAACA ACTGTTGAAT AATTGGGCGG GGCCAGCGCG ACTTAGTTTT GCTCGTCAAG GTCGTTCGCA GCACAGTTAT GCTACCCAGC TGCTCCAGCT GGCAAGCGAA TTAACTAGCC AAGTTCAAAT GGTGATGCAG CAGGCCGAAT TAACCATCAA TAGCTTAATT CAGGGTGTGT GGGTTTGTCT CTTAGGTCAA TATAGCAATC AACATGATGT ACTGTTTGGG GTGACGGTTT CAGGCCGTCC GCCGAGCCTG CCAGCAATTG AAAGCATGGT TGGTTTGTTT ATCAATACGC TGCCGTTGCG AGCTCAAATT CAGCCTGAGC AGCTGTTTCT CGATTTGCTC AAACAGGTGC AAAGCCAGCA GTTGGCCATG AGCCAATATG AGTATAGCTC GTTGGTTGAT ATTCAAGGCT GGGCCAAATT GCCGCGTGAA CAAGCCATGT TCGAGACAGT CGTCGTGTTT GAAAATTACC CGATGGATAC GGCGGCTTTT ACCCAGCACT CAAGCCTTAA GCTCGATTTG CAGCGCACCT TTGTGCAAAA TAGCATGCCA TTGACCTTAC GGGCAATCCC AGGCGATCAG CTAACCCTCG ATGTGCTCTA CGATACTGAG CGTTTTACCG TAACCCAGAT CGAACGAGTA TTGCACGATT GTCAGTTGGT GTTGCAAGCC ATTGCTGCCA CGCCAATGAT TGCCGTTGCT GAGATTATGC ACCATTTACA ACAAGCTGAA GACCAATTTC AACAATTGGA AGAGCAACGA TTAAGAGATG CTAATGCTCA GAAACTCAAA ACGATCAAGC GCCGTTCGGT CGTTTCATAA
|
Protein sequence | MKVENLEDIY GLSPMQKGML FHTLLVPNSS AYQNQSVWQL EGQLNLAAFE WAWQQVIQRH SVLRTAFFWE DLEEPLQVVL RQVTLPLQIM DWREYSAEQQ AQQLEPWLAA DLAQGFQLTA APLLRLSLIQ IGPTSYWFCW SRHHLLLDGW SQAIVLKDLF TFYEVYCYGE QASFNQQAVL GPRRPYGEYI AWLQQQDQAK AEAFWQQLLN NWAGPARLSF ARQGRSQHSY ATQLLQLASE LTSQVQMVMQ QAELTINSLI QGVWVCLLGQ YSNQHDVLFG VTVSGRPPSL PAIESMVGLF INTLPLRAQI QPEQLFLDLL KQVQSQQLAM SQYEYSSLVD IQGWAKLPRE QAMFETVVVF ENYPMDTAAF TQHSSLKLDL QRTFVQNSMP LTLRAIPGDQ LTLDVLYDTE RFTVTQIERV LHDCQLVLQA IAATPMIAVA EIMHHLQQAE DQFQQLEEQR LRDANAQKLK TIKRRSVVS
|
| |