Gene Hoch_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1749 
Symbol 
ID8544131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2403826 
End bp2407425 
Gene Length3600 bp 
Protein Length1199 aa 
Translation table11 
GC content72% 
IMG OID646386456 
Productamino acid adenylation domain protein 
Protein accessionYP_003266191 
Protein GI262194982 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.82306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0351239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGA TCGTGTCCCT GCTCAGCCAG CTCCGTGAGC GCGATATCCG CGTGTGGCTC 
GACGGCGAGC GCGTTCGCCT CGACGCCCCC GCGGGCGCGC TGAGCGACGA GCTGCGCATC
GCGCTCAAAG CCCGACGCGA CGAGCTGATC GCGTTCTTGC GCCAGGCGGA GAAGGCCCGT
ACCGGCGCCG ACGAGCGTAT CGAGCCGGCG CCGCGCACCG GCCCACTGCC GCTGTCCATC
GCCCAGCAGC GGCTGTGGAT CATCGAGCAA TTCGACGACA CCCACGGCGC CTATCACCTG
CCCGCGGCCC TGCGCCTCGA GGGACCTTGC GATATCGACG CGCTGCGCGC CTGCCTCGAC
GCCCTGGTCG CGCGCCACGA GATCCTGCGC ACGGTGTTCC CCCGCCGCGG CGGCAAACCC
GTCCAGGTGG TCAGAGATCC CCAGGGGCTG CCGCTACCCA TCATCGAGCT GGCCCACGGC
GCCGAGCAGC CGGCCGAGGC CTTTGTCACC CACGCCGAGC AGATCCGCCG CCACGCCGCC
GAGGAGGCCG CGCGCCCCTT CGATCTGCAA CGCGAGCCGC CGCTGCGCGC CAGCCTGCTG
CGCCTGGGCG CCGAAGCCCA CGTGCTGCTG CTGACCCTGC ACCACATCGC CGGCGACGCC
CACTCGGTCG ACCAGATCTC GCGCGAGCTG AGCGAGCTAT ACCGGGCACA CGTCACCGGA
CAGGCGGTCG ATCTCCCGGC GCTGCCCGTG CAGTACGCCG ACTACGCCCA CTGGGAGGCC
GATCGGGCCC AGCGCGGCGC CTTCGATGCC GATCTCGATT ACTGGCAGAC GCGCCTGGCC
GAGGCGCCCG CGCTCATCGA GCTGCCCACC GATCACGCCC GCACCCCGGC CGCGCGCTTC
CGCGGCGACG CCGTCGAGCT AGCGGTGCCG GCGCCGCTGC TGCGCTCGCT GCGCGCTCTC
GCCGACGAAT CCGGGGCGAC GCTGTTCATG ACCCTGCTGG CCGGCTACGG CGTCCTGCTG
GCGCGCCACA GCGGCCAGAG CGACGTGGTC ATCGGCACCC CGGTGGCCAA CCGCGACGAC
CAGACCGAGC ATCTCGTCGG GCTGTTCGTC AACACCTTGC CGCTGCGCGT GAGCGCCGAC
ATGGAGCGCG GATTTCGGAC ACTTCTCGCG TCCGTGCGCG CGACCACGCT CGAGGACTTC
GCGCATCGCG AGGTCCCGCT CGCCGAGCTG GTCGCCCGTA TCCAGCCCGA ACGCGACCCC
AGTTACAATC CGCTGTTTCA AGTGACATTC GACTTGCAGC AGCGCCCGCC GGCCGAAGCC
CTCGACCTCG CCGGCCTGTC GCTGAGCCTG CTCGACAGCG CCGAGGCCAG CACACAGTTC
GATCTGGTCC TGTCGCTCAC GCCGCACGCG GGCGGGCTCC AGGGCGCGTT CCACTACCAG
CGCGATCTCT TCGAGCGCGC CACCATCGAG CGCCTGCGCG ACCATTTCCT CACCCTGCTC
GCGGCCATCG CCGCCGAGCC CGAGCGCGCA CTCGCGACCC TGCCGCTCAT GGCCGAGGAA
GAGCGCGAGG CGCTGCTGGC CTCGTGCCGG CCCCAGGCCA GCTTCGCCAA CCCGGCGTGC
TTGCACGAGC GCTTCGCCGC CCAGGCGCAG CGCCGCCCCG AGGCCGTCGC CCTGGTCTGT
GAGGGGACCC AGCTCAGCTA CGGCGAACTG CACCGCCGCG CCAACCGCCT CGCACACCGG
CTCCAGGCCC TGGGCGTGGC TCCCGAGGTG CGCGTCGGCC TGTGCATCGA GCGCTCGCTC
GACATGCTCG TCGCCATCCT CGGCACCCTG CAAGCCGGCG GCGCCTACGT GCCCCTGGAC
CCCGACTATC CGCCCGAGCG CGTGGGTTTC TGCGTGGCCG ACAGCGGTAT CAAGCACCTG
ATCACGCGCA CCGCCGAGCG CGGCAAGCTC GGCGACATCG ACGAACTCGA CGGTGTGCAC
GAGATTATTC TCGACCGCGA GGTCGACGAA CTCGCCGCCC TGCCCGACAC AGCGCCGCCG
TGCGCGGCCA CCGCAGACAG CCTGGCCTAC GTCATCTACA CATCGGGCTC CACCGGTACG
CCCAAGGGCG TGCAGGTCAC GCACAAAAAC GTCACCCGCC TGCACGACGC CACCGCCGAC
ACCTACGGCT TCCACGACGG CGACGTGTGG CCCCTGTTCC ACTCCTACGC CTTCGACGTG
TCGGTGTGGG AGATCTGGGG CGCGCTGCTG CACGGCGGTC GCCTGGTGAT CGTCCCCTGG
CTGGTCACGC GCTCGCCCGT GGACTTCTAT CGACTGCTCG CCGACACCGG CGCCACCGTG
CTCAACCAGA CCCCGTCCGC GTTCCGGCAA TTCGTCCATG CGGACCAACA ACTCGGCGAC
GACGCTCCGG CGCTGTCGCT GCGCTACGTG ATCTTCGCCG GCGAAGCGCT CGAGCCGGCC
TCGCTGCAGC CCTGGGTGGC CCGCCATGGG CTTGAGGCGC CGCGGCTGAT CAACATGTAC
GGTATCACCG AGACCACCGT GCACAGCACC ATCCGCGAGC TCCGCGCCGC CGACCTGGTC
CGCACCAAGA GCCCCATCGG CCGCCCCATC CCCGACCTCG GCCTATATCT GCTCGACGAG
CATGGCCAGC CCGTGCCCGC GGGCGTGAGC GGCGAGATCT ACGTCGGCGG CGCCGGCGTC
GCCCGCGGCT ATCTCGAGCG CCCGGAGCTC ACGGCCGCGC GGTTTCTCGC CGACCCCTTC
GTCGCCGATG CCGACGCGCG CATGTACCGC TCGGGCGACC TCGCGCGCTG GACCCACGAC
GGCGATCTCG AATACCTGGG CCGCAACGAC GCCCAGGTCA AAATCCGCGG CTTCCGCATC
GAACTCGGCG AAATCGAGTC GCGTCTGGGC GCACACCCCG ATATCCGCGT CGCCGCGGTG
CAGCCGTGGT CGCGCGGCGC CGACGGCCAA TCCGAGCAGC TCGTCGCCTA CGTGGTGCCG
AGCGCGGCCG ACATCGTCCC CGACCCTGTG GCCCTGCGCC AGCACCTGCG CGGAGCGCTG
CCCGACTACA TGATCCCGGC CGCCTTCGTG GTCCTCGATG CGCTGCCGCT CACGCCCTCG
GGCAAGCTCG CGCGCCGCGC CCTGCCCGCG CCCGAACAAG CCGGCCAGGT GGCCGCGCCC
GAGCGCCAGG GGCCGCGCAC GCCGCTCGAA AGCGAGCTGG TGGCCATCTG GCGCGAGGTG
CTGGGCCCGG TGTCCGTGGG CGTGCTCGAC AGCTTCTTCG ACCTCGGCGG TCACTCCCTG
AGCGCGCTGC AGATCCTGGC CCGCATTCAG GAGCGCTACG ACGTCGAGCT GCCCATGCGC
GGCTTTTTCG AACGCTCGTC CATCGAACAA GTCGCCGAGT CGCTCACCGC GGCGCTGAGC
GAAGCGAGCG CGAGCGAGGC CGCCCCCGGC AGCGACGCCG CGAGCGCGCT GGCCGCTGCG
CCTGCACCGT CCGAAACCAC GTCCGCCCCT GCGTCTACGC CGCCGCGCCT CACCCGGCGT
TCACGCGAAG CGCGCCGAGT CCGCGCTGTG CGCCCGCCCG CAGACTCCTC CGACTCCTAG
 
Protein sequence
MTPIVSLLSQ LRERDIRVWL DGERVRLDAP AGALSDELRI ALKARRDELI AFLRQAEKAR 
TGADERIEPA PRTGPLPLSI AQQRLWIIEQ FDDTHGAYHL PAALRLEGPC DIDALRACLD
ALVARHEILR TVFPRRGGKP VQVVRDPQGL PLPIIELAHG AEQPAEAFVT HAEQIRRHAA
EEAARPFDLQ REPPLRASLL RLGAEAHVLL LTLHHIAGDA HSVDQISREL SELYRAHVTG
QAVDLPALPV QYADYAHWEA DRAQRGAFDA DLDYWQTRLA EAPALIELPT DHARTPAARF
RGDAVELAVP APLLRSLRAL ADESGATLFM TLLAGYGVLL ARHSGQSDVV IGTPVANRDD
QTEHLVGLFV NTLPLRVSAD MERGFRTLLA SVRATTLEDF AHREVPLAEL VARIQPERDP
SYNPLFQVTF DLQQRPPAEA LDLAGLSLSL LDSAEASTQF DLVLSLTPHA GGLQGAFHYQ
RDLFERATIE RLRDHFLTLL AAIAAEPERA LATLPLMAEE EREALLASCR PQASFANPAC
LHERFAAQAQ RRPEAVALVC EGTQLSYGEL HRRANRLAHR LQALGVAPEV RVGLCIERSL
DMLVAILGTL QAGGAYVPLD PDYPPERVGF CVADSGIKHL ITRTAERGKL GDIDELDGVH
EIILDREVDE LAALPDTAPP CAATADSLAY VIYTSGSTGT PKGVQVTHKN VTRLHDATAD
TYGFHDGDVW PLFHSYAFDV SVWEIWGALL HGGRLVIVPW LVTRSPVDFY RLLADTGATV
LNQTPSAFRQ FVHADQQLGD DAPALSLRYV IFAGEALEPA SLQPWVARHG LEAPRLINMY
GITETTVHST IRELRAADLV RTKSPIGRPI PDLGLYLLDE HGQPVPAGVS GEIYVGGAGV
ARGYLERPEL TAARFLADPF VADADARMYR SGDLARWTHD GDLEYLGRND AQVKIRGFRI
ELGEIESRLG AHPDIRVAAV QPWSRGADGQ SEQLVAYVVP SAADIVPDPV ALRQHLRGAL
PDYMIPAAFV VLDALPLTPS GKLARRALPA PEQAGQVAAP ERQGPRTPLE SELVAIWREV
LGPVSVGVLD SFFDLGGHSL SALQILARIQ ERYDVELPMR GFFERSSIEQ VAESLTAALS
EASASEAAPG SDAASALAAA PAPSETTSAP ASTPPRLTRR SREARRVRAV RPPADSSDS