Gene Hoch_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3579 
Symbol 
ID8545969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4931853 
End bp4933577 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content73% 
IMG OID646388248 
Productanthranilate synthase component I 
Protein accessionYP_003267974 
Protein GI262196765 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.736027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0487715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCATC CATCATCCGA GGCCTTCGTC GCCGCAGCCG AGCGCGGCAA TCTGATCCCC 
GTGTACCGCG AGATCGTGGC CGACGGCGAC ACGCCGGTGT CGGCCTACGC CAAGCTCGGC
CGCGGTCCCT ACAGCTTCCT GCTCGAGTCC GTGGTCGGCG GCTCCACCTG GGCGGCGTAC
TCGTTCATCG GCGTGGCGCC GCACGCGATC CTGCGCTGCA GCGACGGACG CGCCGAGCTG
GTGCACTGCG GCGCGGGCGA GAAGCGGCGC ACCGAGATGT GGGACGCCCC CGACCCGAGC
GCGGCGCTGG CCCAGGTCAT GAGCCGCTAC CGGCCGGTGC CGGTGGCCGG TCTGCCGCGC
TTCTTCGGCG GCGCCGTGGG CTGGATGGGC TACGAGGTGG TGCGCGCCTT CGAGCGCCTG
CCCACGAACG CGCCGCCGGG CGTGGACGTG CCCGACCTGT GCATGGTGCT CACCGACACC
CTGGTGATCT TCGACAATCT GCGCCAAACC GTGAAGGTGG TCTCGTGCGC CCACGTGCCC
GCGCTCGAGC GCGCCGAGGA GGCCTACCGC GCGGCCCAGG CGCGCATCGA CGAGATCGTC
GAGCGATTGT CCGAGCGCGG CCCCGGGCTG CCCTTCTTGC AGGCGCCGCC GGTGGGCGAG
GACGGCTCGA GCGCGCTGCG CTGGGGCGGG GCCGAGGAGC CCGAGTCCTC GTTTTCGCGC
GAGGCCTATC AGGAGGCGGT CGAGCGCATC CGCTCGTACA TCCTGGCCGG CGACATCTTC
CAGGCGGTGC TGTCGCAGCG GCTGCGCCTG CCGCGCGCGG GCCTCGACCT GTTCGACGTC
TACCGCGCGC TGCGCATCAT CAACCCCTCG CCCTACATGT TCCACCTGGC CTTCCCCGAG
GCCACGGTCA CCGGCGCCTC GCCCGAGACC CTGGTGCGCT GCGCCGAGGG CAAGGTCGAT
GTGCGGCCCA TCGCCGGCAC CCGGCCACGC GGCGTCGACG AGCGCCAGGA TCGCGCGCTG
GCCGACGAGC TGCGGGCCGA TCCCAAGGAG TGCGCCGAGC ACCTCATGCT GGTCGACCTG
GGCCGCAACG ATGTCGGTCG CGTGGCCGAA ATCGGCTCGG TCGAGGTGTC CGAGTACATG
AGCATCGAGC GCTACTCGCA CGTCATGCAC ATGGTCTCGC ACGTCCAGGG CACGCTGGCC
GAGGGGCTGA GCTGGCACGA CGTGCTGCGG GCCGCGTTCC CGGCCGGCAC GCTCAGCGGC
GCGCCCAAGA TCCGGGCCAT GGAGATCATC GACGAGCTCG AGCCGCACCG CCGCGGCATC
TACGGCGGCG CGGTCGGCTA CGTGTCGTAC TCGGGCAACA TGGACTCGGC CATCGCCATC
CGCACGCTGG TGGCCACCGA GCACGATATC TACGTGCAGG CGGGCGCCGG CATCGTCCAC
GACTCCGACC CCAAGGCGGA ATACGAAGAG ACGCTCAACA AGGCGCGCGC GCTGCTGCGC
GCGGTGGCGC TGGCGCGCGG CGAGGCCGCC GTGGTGGCGG AGCCCGAGGA GCGCGCCGAG
GCCGGTGATG AGGCCGATGC CCACGCCGGT CGCGCTGGCG CAGCGAGCGC GGCCGCCCAG
CCCGCGACCA CGCCCCGGCA GGCGGCGGCC GCGACGACCG AGGCCGCGAT TCCCCTGGGA
ACCCTGGACG TGGGCGAGAC CGTGCCCACC GAGCCGAAAG GCTGA
 
Protein sequence
MYHPSSEAFV AAAERGNLIP VYREIVADGD TPVSAYAKLG RGPYSFLLES VVGGSTWAAY 
SFIGVAPHAI LRCSDGRAEL VHCGAGEKRR TEMWDAPDPS AALAQVMSRY RPVPVAGLPR
FFGGAVGWMG YEVVRAFERL PTNAPPGVDV PDLCMVLTDT LVIFDNLRQT VKVVSCAHVP
ALERAEEAYR AAQARIDEIV ERLSERGPGL PFLQAPPVGE DGSSALRWGG AEEPESSFSR
EAYQEAVERI RSYILAGDIF QAVLSQRLRL PRAGLDLFDV YRALRIINPS PYMFHLAFPE
ATVTGASPET LVRCAEGKVD VRPIAGTRPR GVDERQDRAL ADELRADPKE CAEHLMLVDL
GRNDVGRVAE IGSVEVSEYM SIERYSHVMH MVSHVQGTLA EGLSWHDVLR AAFPAGTLSG
APKIRAMEII DELEPHRRGI YGGAVGYVSY SGNMDSAIAI RTLVATEHDI YVQAGAGIVH
DSDPKAEYEE TLNKARALLR AVALARGEAA VVAEPEERAE AGDEADAHAG RAGAASAAAQ
PATTPRQAAA ATTEAAIPLG TLDVGETVPT EPKG