Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3579 |
Symbol | |
ID | 8545969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 4931853 |
End bp | 4933577 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646388248 |
Product | anthranilate synthase component I |
Protein accession | YP_003267974 |
Protein GI | 262196765 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.736027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0487715 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCATC CATCATCCGA GGCCTTCGTC GCCGCAGCCG AGCGCGGCAA TCTGATCCCC GTGTACCGCG AGATCGTGGC CGACGGCGAC ACGCCGGTGT CGGCCTACGC CAAGCTCGGC CGCGGTCCCT ACAGCTTCCT GCTCGAGTCC GTGGTCGGCG GCTCCACCTG GGCGGCGTAC TCGTTCATCG GCGTGGCGCC GCACGCGATC CTGCGCTGCA GCGACGGACG CGCCGAGCTG GTGCACTGCG GCGCGGGCGA GAAGCGGCGC ACCGAGATGT GGGACGCCCC CGACCCGAGC GCGGCGCTGG CCCAGGTCAT GAGCCGCTAC CGGCCGGTGC CGGTGGCCGG TCTGCCGCGC TTCTTCGGCG GCGCCGTGGG CTGGATGGGC TACGAGGTGG TGCGCGCCTT CGAGCGCCTG CCCACGAACG CGCCGCCGGG CGTGGACGTG CCCGACCTGT GCATGGTGCT CACCGACACC CTGGTGATCT TCGACAATCT GCGCCAAACC GTGAAGGTGG TCTCGTGCGC CCACGTGCCC GCGCTCGAGC GCGCCGAGGA GGCCTACCGC GCGGCCCAGG CGCGCATCGA CGAGATCGTC GAGCGATTGT CCGAGCGCGG CCCCGGGCTG CCCTTCTTGC AGGCGCCGCC GGTGGGCGAG GACGGCTCGA GCGCGCTGCG CTGGGGCGGG GCCGAGGAGC CCGAGTCCTC GTTTTCGCGC GAGGCCTATC AGGAGGCGGT CGAGCGCATC CGCTCGTACA TCCTGGCCGG CGACATCTTC CAGGCGGTGC TGTCGCAGCG GCTGCGCCTG CCGCGCGCGG GCCTCGACCT GTTCGACGTC TACCGCGCGC TGCGCATCAT CAACCCCTCG CCCTACATGT TCCACCTGGC CTTCCCCGAG GCCACGGTCA CCGGCGCCTC GCCCGAGACC CTGGTGCGCT GCGCCGAGGG CAAGGTCGAT GTGCGGCCCA TCGCCGGCAC CCGGCCACGC GGCGTCGACG AGCGCCAGGA TCGCGCGCTG GCCGACGAGC TGCGGGCCGA TCCCAAGGAG TGCGCCGAGC ACCTCATGCT GGTCGACCTG GGCCGCAACG ATGTCGGTCG CGTGGCCGAA ATCGGCTCGG TCGAGGTGTC CGAGTACATG AGCATCGAGC GCTACTCGCA CGTCATGCAC ATGGTCTCGC ACGTCCAGGG CACGCTGGCC GAGGGGCTGA GCTGGCACGA CGTGCTGCGG GCCGCGTTCC CGGCCGGCAC GCTCAGCGGC GCGCCCAAGA TCCGGGCCAT GGAGATCATC GACGAGCTCG AGCCGCACCG CCGCGGCATC TACGGCGGCG CGGTCGGCTA CGTGTCGTAC TCGGGCAACA TGGACTCGGC CATCGCCATC CGCACGCTGG TGGCCACCGA GCACGATATC TACGTGCAGG CGGGCGCCGG CATCGTCCAC GACTCCGACC CCAAGGCGGA ATACGAAGAG ACGCTCAACA AGGCGCGCGC GCTGCTGCGC GCGGTGGCGC TGGCGCGCGG CGAGGCCGCC GTGGTGGCGG AGCCCGAGGA GCGCGCCGAG GCCGGTGATG AGGCCGATGC CCACGCCGGT CGCGCTGGCG CAGCGAGCGC GGCCGCCCAG CCCGCGACCA CGCCCCGGCA GGCGGCGGCC GCGACGACCG AGGCCGCGAT TCCCCTGGGA ACCCTGGACG TGGGCGAGAC CGTGCCCACC GAGCCGAAAG GCTGA
|
Protein sequence | MYHPSSEAFV AAAERGNLIP VYREIVADGD TPVSAYAKLG RGPYSFLLES VVGGSTWAAY SFIGVAPHAI LRCSDGRAEL VHCGAGEKRR TEMWDAPDPS AALAQVMSRY RPVPVAGLPR FFGGAVGWMG YEVVRAFERL PTNAPPGVDV PDLCMVLTDT LVIFDNLRQT VKVVSCAHVP ALERAEEAYR AAQARIDEIV ERLSERGPGL PFLQAPPVGE DGSSALRWGG AEEPESSFSR EAYQEAVERI RSYILAGDIF QAVLSQRLRL PRAGLDLFDV YRALRIINPS PYMFHLAFPE ATVTGASPET LVRCAEGKVD VRPIAGTRPR GVDERQDRAL ADELRADPKE CAEHLMLVDL GRNDVGRVAE IGSVEVSEYM SIERYSHVMH MVSHVQGTLA EGLSWHDVLR AAFPAGTLSG APKIRAMEII DELEPHRRGI YGGAVGYVSY SGNMDSAIAI RTLVATEHDI YVQAGAGIVH DSDPKAEYEE TLNKARALLR AVALARGEAA VVAEPEERAE AGDEADAHAG RAGAASAAAQ PATTPRQAAA ATTEAAIPLG TLDVGETVPT EPKG
|
| |