Gene Hoch_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2953 
Symbol 
ID8545341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4024524 
End bp4027868 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content66% 
IMG OID646387632 
ProductAcyl transferase 
Protein accessionYP_003267360 
Protein GI262196151 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.622829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGTC GTGAACTCAA GCTTGCCGAG TACGTCAAGC GCCTGACCCA AAAGCTGGAG 
GCAGCGGAGG CGCGCACCCA ACGCCTTGAA GCGAAGCTGA GCGAGCCCAT CGCCATCATC
GCCATGAGCT GCCGCTACCC GGGCGGTGTC TGCTCGCCAG AAGGGCTGTG GGAAGCATTG
CGCGAGGGTC GCGATCTCGT GTCGGCGTTT CCCGACGATC GGGGATGGGA TGTCGCCACC
CTGTACGACG AAGACCCGGA TGCGAAGGGC AAAACGTACA CCCGCGAGGG CGGGTTCTTG
GCCGAGCCGG GTCACTTCGA CCCTGCCTTC TTCGGCATCA GCGCCAAGGA AGCGCTGCTC
ATCGATCCGC AGCAGCGCCT GCTGCTCGAG ACCACGTGGG AAGCATTCGA ACGCGCTGGC
ATCGTCCCGT CGAAACTGCA GGGAAGCCCA ACCGGCGTGT TCATCGGCAT CATGTCTCAG
GACTACGTGA CGCGTCTTCT GACCGAGGAG GGCCTCGAGG ACGATGGCAC TCTGGGCCTC
GGTAGCTCCG CGAGCATCGC CTCGGGGCGA ATCGCCTATA AGTTCGGCCT CGAGGGCCCT
ACGGTCACGG TCGACACTGC CTGCAGTTCG TCCTTGGTCG CGTTGCACCT TGGAAGCCAG
GCCCTCCGCC AAGGCGATTG CAATCTGGCC CTCGTGGGCG GGGCAACCAT CATGGCAAGC
CCCACTACCT TCGTGGAGTT CAGCCGAATG CGTGGACTCT CGCCGGACGG TCGCTGCCGC
GCCTTTTCCC AGAACGCCAA CGGCACGGGC TTGTCGGAGG GCGTGGGGCT CGTTCTACTC
GAGCGCCTCT CCGACGCGCA GAAAAACGGC CATCCGATCC TGGCGACCGT ACGCGGCTCC
GCGGTCAACC AAGACGGCAA GAGCCAGGGC CTCACGGCGC CGAGTGGTAC CTCGCAGCAG
CGGGTGATTC GACATGCCCT GGCCGCCGCA GGGTTGTCCG CGCGCGAGGT CGACGCAGTC
GAGGCTCATG GCACAGGAAC GACGTTGGGC GACCCGATCG AAGCGCAAGC GCTCATCGCA
ATCTACGGTC AAGATCGCAC GAAGCACACG CCTCTGTGGC TCGGTTCGCT CAAGTCTACC
CTTGGACACG CCCAGGCTGC AGCCGGCATC GGGGGTGTCA TCAAGATGGT GATGGCGATG
CAGCACGGGC TTCTGCCTCG CACCCTACAC TCCGAGCCTC CCTCCTCCGA GGTCGACTGG
TCTGCCGGCA GCGTACGGCT CCTCAGCGAA GCCCGCCCGT GGAACGAAAA CGGCCATCCG
CGCCGGGCGG GCGTCTCTTC CTTTGGCGTC AGCGGCACCA ATTCCCACGC GATCCTCGAG
CAGCCGCGGG CGCAGGCTTC TATGCCCACC GACAGCTCGC CGACTCATAC CGCGCTTCCG
CTCTTGCTCT CGGGCCTCAC GGAGCCAGCG CTCCGTGCCC AGGCCGCCAA GCTGTCTGAG
TACCTCGCCC TTCACCCCCA GCTCAGTCTC GCCGACCTCG CCCATTCTCT CGCTCACACG
CGTAGTCATT TTGCCCAGCG GGCCGCCGTC GTCGCACACG ACCACGCTGC GCTGCGCAGC
GGTCTCGACG CCATCGCACG TGGCGAGTTC GCACAGCACG TGGTCTCTGG CCGACACAAG
CAAGTCCGTA AGGTTGCATT CGTGTTCCCA GGCCAAGGCT CCCAGTGGCC TGAGATGGCG
CGCTCGCTGC TCGTCCACTC GAGCGTCTTC CGCGCTCAGA TGGAGGACTG CGAGCGCGCG
CTGGCGCCCT TCCTGACCTG GTCGCCGCTC GCGGTTCTCG AGGGAGGGCT CGACATCGAC
ATGGAGCGTC TGGATGTCGT GCAACCCCTG CTTTTTGCCA TGATGGTCTC GTTGGCAGCC
GTTTGGCGCC ACTACGGTGT AGAGCCGGAC GCGGTCATCG GACACAGCCA GGGCGAGATT
GCAGCGGCCT ATGTCGCCGG CGCTCTGTCA CTCGAGGATG CGGCCAGCGT CGTCGCGCTG
CGCAGCCGTA CCCTTACCTG CCTGCGCGGC AAAGGGGCGA TGGCTGCCGT CGAGCTCTCC
GTCAAGGATC TCGAGCCGCA CCTACAGCGA TTCGGTCAGC GCGTCGCGGT CGCGGGCATC
AACGGCCCTC ACTCCACGGT GCTCTCCGGA ACGCCCGAGG CCATCGACGC CTTGCTCGCG
ACGCTCGCCG AACAAGCCGC GCCTCAAATC TTCGCACGCA AGATCCGAGT GGACTACGCC
TCCCACGGTC CCCAAGTCAC GTCGCTGCGC GGCGAACTTG CGGAACAGCT CTGCGGCATC
CGCCCCCGCC CAACCAAGGT CCCGTTCTAC TCCACGGTGA GCGGGCAGCG AATCGACGGC
GCGGCCCTCG ATGGAGACTA CTGGTACCAG AATCTCCGGC ATACCGTGCT GTTCGATGCC
GCGGTTCAAC GGCTGCTCGA CGACGAGCAT CGGGTCTTCG TCGAGATAAG CCCGCACCCG
ATACTGAAGC TCGTCCTCCA CGAGGCGCTC GAGACTCGAA AAGGTTCCGG CATCGTCGTA
GGCTCGCTGC AACGCGGGAA CGGAAGCCTG GACCGCATCG TACTCGCGCT GGCCGAACTG
CATGTCCATG GCTACGCCGT GGACTGGTCC AAGGTTCTCC CCAGAGCGAA CACCGTTCTC
CTCCCTACCT ACGCCTTTCA GCGCCAACGC TACTGGCCGG AGAGTCCACA CCGCCCGCCC
CACCAACCCG GCTCGGCCCG GGAACGTGAC TTTCAGGACG CCGCGAAGCG GGCCGACCTG
GACGCCCTCG CGACGATCCT GCGCATCGAG GACGAGGCCG AGCGTGCTTC TCTCGCAGCC
GTCCTGCCCG CGCTCACTCG CCTCTATCGC GAACGTCCGG TGCCTCTGCA CTCAGGCAAC
CTGGAGCAGG CGGAGCAGAC CGACGAGTTG GAGCGGGCCG AGTCCGCTGC GCAGCGTCAC
CCGCGCCCCC CGGTCTCGGC CGCCTACGTC GCGCCATCTA CGGACCTCGA GTACCTGCTC
GCAGACGCCT GGCAACAGGT CCTCGGCGTG CGCGAGGTCG GTATTCACGA CGACTTCTTC
GAGCTAGGCG GAAACTCGCT CAACGCCAGT CAGATCTTGA CGCGGCTCAA GCGCTCCTTT
CCGGTGCCGA TTCGATTCGA CCTCTTCTTT GGAAACCCAA CGATCGCCAT GCTGGCTCCG
CGCATCGAGG AGCTACTCGT CGCCGCGCTC GAGTCCCTGC CAGATGAAGA AGTGCAGCGG
CTCCTGGCGA CCCACCCACC CGAACCCCGA GGAACTGATG CCTGA
 
Protein sequence
MSSRELKLAE YVKRLTQKLE AAEARTQRLE AKLSEPIAII AMSCRYPGGV CSPEGLWEAL 
REGRDLVSAF PDDRGWDVAT LYDEDPDAKG KTYTREGGFL AEPGHFDPAF FGISAKEALL
IDPQQRLLLE TTWEAFERAG IVPSKLQGSP TGVFIGIMSQ DYVTRLLTEE GLEDDGTLGL
GSSASIASGR IAYKFGLEGP TVTVDTACSS SLVALHLGSQ ALRQGDCNLA LVGGATIMAS
PTTFVEFSRM RGLSPDGRCR AFSQNANGTG LSEGVGLVLL ERLSDAQKNG HPILATVRGS
AVNQDGKSQG LTAPSGTSQQ RVIRHALAAA GLSAREVDAV EAHGTGTTLG DPIEAQALIA
IYGQDRTKHT PLWLGSLKST LGHAQAAAGI GGVIKMVMAM QHGLLPRTLH SEPPSSEVDW
SAGSVRLLSE ARPWNENGHP RRAGVSSFGV SGTNSHAILE QPRAQASMPT DSSPTHTALP
LLLSGLTEPA LRAQAAKLSE YLALHPQLSL ADLAHSLAHT RSHFAQRAAV VAHDHAALRS
GLDAIARGEF AQHVVSGRHK QVRKVAFVFP GQGSQWPEMA RSLLVHSSVF RAQMEDCERA
LAPFLTWSPL AVLEGGLDID MERLDVVQPL LFAMMVSLAA VWRHYGVEPD AVIGHSQGEI
AAAYVAGALS LEDAASVVAL RSRTLTCLRG KGAMAAVELS VKDLEPHLQR FGQRVAVAGI
NGPHSTVLSG TPEAIDALLA TLAEQAAPQI FARKIRVDYA SHGPQVTSLR GELAEQLCGI
RPRPTKVPFY STVSGQRIDG AALDGDYWYQ NLRHTVLFDA AVQRLLDDEH RVFVEISPHP
ILKLVLHEAL ETRKGSGIVV GSLQRGNGSL DRIVLALAEL HVHGYAVDWS KVLPRANTVL
LPTYAFQRQR YWPESPHRPP HQPGSARERD FQDAAKRADL DALATILRIE DEAERASLAA
VLPALTRLYR ERPVPLHSGN LEQAEQTDEL ERAESAAQRH PRPPVSAAYV APSTDLEYLL
ADAWQQVLGV REVGIHDDFF ELGGNSLNAS QILTRLKRSF PVPIRFDLFF GNPTIAMLAP
RIEELLVAAL ESLPDEEVQR LLATHPPEPR GTDA