Gene Hoch_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1366 
Symbol 
ID8543748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1811125 
End bp1813413 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content69% 
IMG OID646386078 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_003265813 
Protein GI262194604 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade
[TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0518359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGACGC TACTCGTGGA TAACTACGAT TCGTTCACGT TCAATCTCTA CCAAATGATC 
GCAGAGGTGA ACGGAGAGGA ACCGATCGTC ATTCACAACG ATCAGCTCGC GTGGACGGAG
GTCGACGAGA GCGTCTACGA CAACATCGTG ATCTCCCCGG GACCGGGCAG ACCGGAGCGC
GCGGATGACT TCGGCCTGTC CCGCGACGCC ATCGAGCACG CGCGGATTCC GCTTCTGGGT
ATCTGTCTCG GCCATCAGGG CATCGGCCAC GTCTGCGGCG GGCGGGTGGT CCACGCGCCG
ACGGTGATGC ACGGCCGGCG CAGCTCTGTA TTCCACAACG GCGCGCCGCT GTTTCGCGAT
ATCCCGCAGG GGGTCGCGGT GGTGCGCTAT CACTCGCTGA CGCTGGAGGA GCCGCTGCCC
CCGGAGCTGG AGCGGCTGGC GTGGACGGCC GACGGTGTGC TCATGGGTCT GCGGCATCGC
CAGCGGCCGC TCTGGGGCGT GCAGTTCCAT CCGGAATCGA TCTGCACCGA GCTGGGCGAC
CAGCTCCTGA AGAATTTTCG CGACCTGAGC GCGGCCCACG CGGCGTCGAG CGGTCGGCCC
CGGGTGGGCG GACGCGGCGC GCGTCAGCCG CAGTCGGTGA TCCCGAAGGG CACCCGGCGC
GAGCGCACGA GCTCGGCGCT GACGCTGCAC ACGCGCCAGC TCGCGCACAT GCCGGACCCG
GAGCAGGTGT TCGTACAGCT CTACGGCGCC GCGCGCTATG CGTTCTGGCT GGACAACGCG
GCCCAGGCGA CCGGGGCCGA GCGCTTCTCG TTCATGGGCG CGGCGGACGG TCCCCACGCG
CGCGCGCTGC GCTACGCGGC CACGAGCGGG GAGCTGGAGA TCCGGCGCAC GGGCGAGCCG
ACAGAGACAG GGACAGGGAC AGGGACAGGC GAGCGCGTCG AGCGGCGAAC GCAGAGCATC
TTCGATTATC TGCATCGCGA GCTCGACGAG ATGTATGTGA GCTCGCCCGA GCTGCCCTTC
GACTTCAACA GCGGCTTCGT GGGCTATTTC GGCTACGAGC TGAAGCAGGA GTGCACGGGC
GTGCGCGGCC ACGACTCGCC CCACCCGGAT GCGCAGTTTC TGTTCGCCGA TCGCTTCATC
GTTTTCGACC ACGTGGAGAA CACGGGGTTC CTGGTCTGTC TGGCGCACGC GCGGGAAGCC
GAGGCGGCGC GGGCCTGGCT CGATGCCACG GAGCGGGCGC TGGGCGAGCT GGGGCCGGTG
GCGCCGCCCC AGCCCTGCGG CGGGCCCAGG CCGGTCCGCT TTGGCTTGCG GCGCCCGCCC
GAGGACTACT GCGACGACAT CCGCGCCTGC CTGCGCGAGA TCCACGAGGG CGAGACCTAC
GAGGTCTGCC TCACCAACAT GCTGAGCGCC GAGCTCGCGC TCGACCCGCT CGACTTCTAC
CGCGGGCTGC GCCGGAGCAA CCCCGCGCCC AAGGCCATGT ATCTGCGCTT TGGCGACATG
GCCGTGGCGT CGTCTTCGCC CGAGAGTTTT TTGCGCATCA CGGCGGATGG CTGGGTGGAG
TCCAAGCCCA TCAAGGGCAC GCTGCCGCGC GGCAGCACGC CCGAGGACGA CGCCATGCTG
CGCACGCAAC TGCGCGACAG CGAGAAGAAT CGCTCCGAGA ACCTGATGAT CGTCGACCTT
CTGCGCAACG ATCTCGGCAT CGTCTGCGAC ATCGGCACGG TGAGCGTGCC CGGGCTCATG
CTGGTCGAGA GCTACCGCAG CGTGCATCAG CTCGTGAGCA CCATCCGCGG CCACCTGCGC
GAGGACATGC GGGCCATCGA CTGCGTGCGC GCGACCTTTC CCGGCGGCTC GATGACCGGG
GCGCCCAAGG TGCGGACCAT GCACATCATC GACGATCTGG AGAGAGGCGC GCGCGGCGTC
TACTCGGGCT CGGCCGGCTA CCTGGCGCTG GGCGGCGCGG CGGATCTGAA CATCGTGATC
CGCACCGCCG TGATCGAGTC GGAGCGCGTG TCCATCGGCG TGGGCGGCGC GATCGTGGCG
CTCTCCGATC CCGAGGAGGA GATGGGCGAG ACCGTGATCA AGGGCGACGC TTTGATGCGC
GCGGTCGCAG TCGCCGCGCA CGGGAGCGAG GGCGCGCTCG ACTTCGCCGT CGACGGGGTC
GGCGGTCCGA GGGATCTGAG CTGGCCGGTG CGCAAGCAGG CCACGCCGGT GCGCGCGGAG
CCGGAGCCGG CGACCGAGCC GGCGACCGAG GGCGACGGGG CGACGACACA AAAGGCCGCG
TCGATCTAG
 
Protein sequence
MKTLLVDNYD SFTFNLYQMI AEVNGEEPIV IHNDQLAWTE VDESVYDNIV ISPGPGRPER 
ADDFGLSRDA IEHARIPLLG ICLGHQGIGH VCGGRVVHAP TVMHGRRSSV FHNGAPLFRD
IPQGVAVVRY HSLTLEEPLP PELERLAWTA DGVLMGLRHR QRPLWGVQFH PESICTELGD
QLLKNFRDLS AAHAASSGRP RVGGRGARQP QSVIPKGTRR ERTSSALTLH TRQLAHMPDP
EQVFVQLYGA ARYAFWLDNA AQATGAERFS FMGAADGPHA RALRYAATSG ELEIRRTGEP
TETGTGTGTG ERVERRTQSI FDYLHRELDE MYVSSPELPF DFNSGFVGYF GYELKQECTG
VRGHDSPHPD AQFLFADRFI VFDHVENTGF LVCLAHAREA EAARAWLDAT ERALGELGPV
APPQPCGGPR PVRFGLRRPP EDYCDDIRAC LREIHEGETY EVCLTNMLSA ELALDPLDFY
RGLRRSNPAP KAMYLRFGDM AVASSSPESF LRITADGWVE SKPIKGTLPR GSTPEDDAML
RTQLRDSEKN RSENLMIVDL LRNDLGIVCD IGTVSVPGLM LVESYRSVHQ LVSTIRGHLR
EDMRAIDCVR ATFPGGSMTG APKVRTMHII DDLERGARGV YSGSAGYLAL GGAADLNIVI
RTAVIESERV SIGVGGAIVA LSDPEEEMGE TVIKGDALMR AVAVAAHGSE GALDFAVDGV
GGPRDLSWPV RKQATPVRAE PEPATEPATE GDGATTQKAA SI