Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1749 |
Symbol | |
ID | 8544131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2403826 |
End bp | 2407425 |
Gene Length | 3600 bp |
Protein Length | 1199 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646386456 |
Product | amino acid adenylation domain protein |
Protein accession | YP_003266191 |
Protein GI | 262194982 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.82306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0351239 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCGA TCGTGTCCCT GCTCAGCCAG CTCCGTGAGC GCGATATCCG CGTGTGGCTC GACGGCGAGC GCGTTCGCCT CGACGCCCCC GCGGGCGCGC TGAGCGACGA GCTGCGCATC GCGCTCAAAG CCCGACGCGA CGAGCTGATC GCGTTCTTGC GCCAGGCGGA GAAGGCCCGT ACCGGCGCCG ACGAGCGTAT CGAGCCGGCG CCGCGCACCG GCCCACTGCC GCTGTCCATC GCCCAGCAGC GGCTGTGGAT CATCGAGCAA TTCGACGACA CCCACGGCGC CTATCACCTG CCCGCGGCCC TGCGCCTCGA GGGACCTTGC GATATCGACG CGCTGCGCGC CTGCCTCGAC GCCCTGGTCG CGCGCCACGA GATCCTGCGC ACGGTGTTCC CCCGCCGCGG CGGCAAACCC GTCCAGGTGG TCAGAGATCC CCAGGGGCTG CCGCTACCCA TCATCGAGCT GGCCCACGGC GCCGAGCAGC CGGCCGAGGC CTTTGTCACC CACGCCGAGC AGATCCGCCG CCACGCCGCC GAGGAGGCCG CGCGCCCCTT CGATCTGCAA CGCGAGCCGC CGCTGCGCGC CAGCCTGCTG CGCCTGGGCG CCGAAGCCCA CGTGCTGCTG CTGACCCTGC ACCACATCGC CGGCGACGCC CACTCGGTCG ACCAGATCTC GCGCGAGCTG AGCGAGCTAT ACCGGGCACA CGTCACCGGA CAGGCGGTCG ATCTCCCGGC GCTGCCCGTG CAGTACGCCG ACTACGCCCA CTGGGAGGCC GATCGGGCCC AGCGCGGCGC CTTCGATGCC GATCTCGATT ACTGGCAGAC GCGCCTGGCC GAGGCGCCCG CGCTCATCGA GCTGCCCACC GATCACGCCC GCACCCCGGC CGCGCGCTTC CGCGGCGACG CCGTCGAGCT AGCGGTGCCG GCGCCGCTGC TGCGCTCGCT GCGCGCTCTC GCCGACGAAT CCGGGGCGAC GCTGTTCATG ACCCTGCTGG CCGGCTACGG CGTCCTGCTG GCGCGCCACA GCGGCCAGAG CGACGTGGTC ATCGGCACCC CGGTGGCCAA CCGCGACGAC CAGACCGAGC ATCTCGTCGG GCTGTTCGTC AACACCTTGC CGCTGCGCGT GAGCGCCGAC ATGGAGCGCG GATTTCGGAC ACTTCTCGCG TCCGTGCGCG CGACCACGCT CGAGGACTTC GCGCATCGCG AGGTCCCGCT CGCCGAGCTG GTCGCCCGTA TCCAGCCCGA ACGCGACCCC AGTTACAATC CGCTGTTTCA AGTGACATTC GACTTGCAGC AGCGCCCGCC GGCCGAAGCC CTCGACCTCG CCGGCCTGTC GCTGAGCCTG CTCGACAGCG CCGAGGCCAG CACACAGTTC GATCTGGTCC TGTCGCTCAC GCCGCACGCG GGCGGGCTCC AGGGCGCGTT CCACTACCAG CGCGATCTCT TCGAGCGCGC CACCATCGAG CGCCTGCGCG ACCATTTCCT CACCCTGCTC GCGGCCATCG CCGCCGAGCC CGAGCGCGCA CTCGCGACCC TGCCGCTCAT GGCCGAGGAA GAGCGCGAGG CGCTGCTGGC CTCGTGCCGG CCCCAGGCCA GCTTCGCCAA CCCGGCGTGC TTGCACGAGC GCTTCGCCGC CCAGGCGCAG CGCCGCCCCG AGGCCGTCGC CCTGGTCTGT GAGGGGACCC AGCTCAGCTA CGGCGAACTG CACCGCCGCG CCAACCGCCT CGCACACCGG CTCCAGGCCC TGGGCGTGGC TCCCGAGGTG CGCGTCGGCC TGTGCATCGA GCGCTCGCTC GACATGCTCG TCGCCATCCT CGGCACCCTG CAAGCCGGCG GCGCCTACGT GCCCCTGGAC CCCGACTATC CGCCCGAGCG CGTGGGTTTC TGCGTGGCCG ACAGCGGTAT CAAGCACCTG ATCACGCGCA CCGCCGAGCG CGGCAAGCTC GGCGACATCG ACGAACTCGA CGGTGTGCAC GAGATTATTC TCGACCGCGA GGTCGACGAA CTCGCCGCCC TGCCCGACAC AGCGCCGCCG TGCGCGGCCA CCGCAGACAG CCTGGCCTAC GTCATCTACA CATCGGGCTC CACCGGTACG CCCAAGGGCG TGCAGGTCAC GCACAAAAAC GTCACCCGCC TGCACGACGC CACCGCCGAC ACCTACGGCT TCCACGACGG CGACGTGTGG CCCCTGTTCC ACTCCTACGC CTTCGACGTG TCGGTGTGGG AGATCTGGGG CGCGCTGCTG CACGGCGGTC GCCTGGTGAT CGTCCCCTGG CTGGTCACGC GCTCGCCCGT GGACTTCTAT CGACTGCTCG CCGACACCGG CGCCACCGTG CTCAACCAGA CCCCGTCCGC GTTCCGGCAA TTCGTCCATG CGGACCAACA ACTCGGCGAC GACGCTCCGG CGCTGTCGCT GCGCTACGTG ATCTTCGCCG GCGAAGCGCT CGAGCCGGCC TCGCTGCAGC CCTGGGTGGC CCGCCATGGG CTTGAGGCGC CGCGGCTGAT CAACATGTAC GGTATCACCG AGACCACCGT GCACAGCACC ATCCGCGAGC TCCGCGCCGC CGACCTGGTC CGCACCAAGA GCCCCATCGG CCGCCCCATC CCCGACCTCG GCCTATATCT GCTCGACGAG CATGGCCAGC CCGTGCCCGC GGGCGTGAGC GGCGAGATCT ACGTCGGCGG CGCCGGCGTC GCCCGCGGCT ATCTCGAGCG CCCGGAGCTC ACGGCCGCGC GGTTTCTCGC CGACCCCTTC GTCGCCGATG CCGACGCGCG CATGTACCGC TCGGGCGACC TCGCGCGCTG GACCCACGAC GGCGATCTCG AATACCTGGG CCGCAACGAC GCCCAGGTCA AAATCCGCGG CTTCCGCATC GAACTCGGCG AAATCGAGTC GCGTCTGGGC GCACACCCCG ATATCCGCGT CGCCGCGGTG CAGCCGTGGT CGCGCGGCGC CGACGGCCAA TCCGAGCAGC TCGTCGCCTA CGTGGTGCCG AGCGCGGCCG ACATCGTCCC CGACCCTGTG GCCCTGCGCC AGCACCTGCG CGGAGCGCTG CCCGACTACA TGATCCCGGC CGCCTTCGTG GTCCTCGATG CGCTGCCGCT CACGCCCTCG GGCAAGCTCG CGCGCCGCGC CCTGCCCGCG CCCGAACAAG CCGGCCAGGT GGCCGCGCCC GAGCGCCAGG GGCCGCGCAC GCCGCTCGAA AGCGAGCTGG TGGCCATCTG GCGCGAGGTG CTGGGCCCGG TGTCCGTGGG CGTGCTCGAC AGCTTCTTCG ACCTCGGCGG TCACTCCCTG AGCGCGCTGC AGATCCTGGC CCGCATTCAG GAGCGCTACG ACGTCGAGCT GCCCATGCGC GGCTTTTTCG AACGCTCGTC CATCGAACAA GTCGCCGAGT CGCTCACCGC GGCGCTGAGC GAAGCGAGCG CGAGCGAGGC CGCCCCCGGC AGCGACGCCG CGAGCGCGCT GGCCGCTGCG CCTGCACCGT CCGAAACCAC GTCCGCCCCT GCGTCTACGC CGCCGCGCCT CACCCGGCGT TCACGCGAAG CGCGCCGAGT CCGCGCTGTG CGCCCGCCCG CAGACTCCTC CGACTCCTAG
|
Protein sequence | MTPIVSLLSQ LRERDIRVWL DGERVRLDAP AGALSDELRI ALKARRDELI AFLRQAEKAR TGADERIEPA PRTGPLPLSI AQQRLWIIEQ FDDTHGAYHL PAALRLEGPC DIDALRACLD ALVARHEILR TVFPRRGGKP VQVVRDPQGL PLPIIELAHG AEQPAEAFVT HAEQIRRHAA EEAARPFDLQ REPPLRASLL RLGAEAHVLL LTLHHIAGDA HSVDQISREL SELYRAHVTG QAVDLPALPV QYADYAHWEA DRAQRGAFDA DLDYWQTRLA EAPALIELPT DHARTPAARF RGDAVELAVP APLLRSLRAL ADESGATLFM TLLAGYGVLL ARHSGQSDVV IGTPVANRDD QTEHLVGLFV NTLPLRVSAD MERGFRTLLA SVRATTLEDF AHREVPLAEL VARIQPERDP SYNPLFQVTF DLQQRPPAEA LDLAGLSLSL LDSAEASTQF DLVLSLTPHA GGLQGAFHYQ RDLFERATIE RLRDHFLTLL AAIAAEPERA LATLPLMAEE EREALLASCR PQASFANPAC LHERFAAQAQ RRPEAVALVC EGTQLSYGEL HRRANRLAHR LQALGVAPEV RVGLCIERSL DMLVAILGTL QAGGAYVPLD PDYPPERVGF CVADSGIKHL ITRTAERGKL GDIDELDGVH EIILDREVDE LAALPDTAPP CAATADSLAY VIYTSGSTGT PKGVQVTHKN VTRLHDATAD TYGFHDGDVW PLFHSYAFDV SVWEIWGALL HGGRLVIVPW LVTRSPVDFY RLLADTGATV LNQTPSAFRQ FVHADQQLGD DAPALSLRYV IFAGEALEPA SLQPWVARHG LEAPRLINMY GITETTVHST IRELRAADLV RTKSPIGRPI PDLGLYLLDE HGQPVPAGVS GEIYVGGAGV ARGYLERPEL TAARFLADPF VADADARMYR SGDLARWTHD GDLEYLGRND AQVKIRGFRI ELGEIESRLG AHPDIRVAAV QPWSRGADGQ SEQLVAYVVP SAADIVPDPV ALRQHLRGAL PDYMIPAAFV VLDALPLTPS GKLARRALPA PEQAGQVAAP ERQGPRTPLE SELVAIWREV LGPVSVGVLD SFFDLGGHSL SALQILARIQ ERYDVELPMR GFFERSSIEQ VAESLTAALS EASASEAAPG SDAASALAAA PAPSETTSAP ASTPPRLTRR SREARRVRAV RPPADSSDS
|
| |