Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0156 |
Symbol | |
ID | 6973548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 170440 |
End bp | 172008 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643389690 |
Product | anthranilate synthase component I |
Protein accession | YP_002274571 |
Protein GI | 209542342 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.970752 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0258031 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCATCC CCTCCGCCTC CGCCGCCCCG GTTCCGGCCG GGCGCGACGA CGTGCTGGCC ACGCTCCGGC AGGGACAGGG CGCGGTGGTC TGGAGCATCG AGGCCGCGGA CCTGCTGACC CCGGTCGCCG CCTATATGCG CCTGTCGCGC CTGGCCGGGG CCAGCGACAC GGCGCCCCCG CGCAACGCGT TCCTGCTGGA AAGCGTCGAG GGCGGGGTGG CGCGCGGCCG GTATTCGGTG ATCGGCCTGC TGCCCGACCT GATCTGGCGC TGCCATGGCG GCGCGGCCAC GATCAACACC GACGCGGCGC GGGACCCGGC CGCGTTCGTG CCGGCCGGGG TGCCGCCGCT GGATTCGCTG CGCGCCGTGA TCCGCGCCAG CCAGATGACG TTGCCGTCCG GCCTGCCGCC CATGGTGGCC GGGCTGTTCG GCTATCTGGG CTATGACATG GTCCGGCAGA TGGAGCATCT GCCGGACATG CCGGCTGACG ACCTGGACCT GCCCGAAGGG GTGATGATCC GCCCCGGGCT GTTCGCGATC TTCGATACGG TGCGCGACGA ACTGATCCTG GCGGCGCCCG TGCGCCCCCG AAGCGACCGC ACGCCCGAAG CGGCATGGCA GGCGGCGCAG GACCTGCTGG CCACGGCGCG GCGCACCCTG TCCGAGCCGC TGCAACTGCA CGAGATCACG CCGGATTATA CCGGGCCGCT CGAGGCGCCG CGCTCGACCT TCACGCGTGA GGGTTTCTGC GCCATGGTCC GGCGCATTCA GGACTACATC GCGGCGGGCG ATGCCTTCCA GGTCGTGCCC AGCCAGCGTT TCTCGACCGC CTTCACGCTG CCGCCGCTGG CGCTGTACCG GGCGCTGCGC CGCATCAATC CGGCGCCGTT CCTGTTCAAC CTGGCGTTCG ACGGATTCAG CCTGGTGGGC TCGTCGCCCG AAATCCTGGT CCGGCTGCGC GACGGGCAGA TGACGGTGCG CCCGCTGGCC GGCACCCGCC CGCGCGGCCG GACGGACGAG GAGGATCTGG CGCTGGAGCG GGACCTGCTG GCCGACCCCA AGGAACTGGC CGAGCACCTG ATGCTGATCG ATCTGGGGCG CAACGATATC GGGCGGGCCT GTACCGTGGG TTCGGTCCAG GTGACCGAGA AATTCGTCAT CGAGCGCTTC AGCCACGTCA TGCACATTTC CTCGAACGTC GAGGGGCAGT TGCGGCCGGG GCTGGAGGCG CTGGATGCCC TGATCGCGGG CTTTCCCGCC GGGACCCTGA CCGGCGCGCC GAAGATCCGT GCGATGGAGA TCATCGACGA GGTCGAGCCG ACCCGCCGCG CCACCTACGC CGGATGCATC GGCTATTTCG GGGCGAACGG CGCCATGGAT ACCTGCATCG GCCTGCGCAT GGCCGTGGTC AAGGACGGGC AGATGCACGT GCAGGCCGGC TGCGGCGTGG TGGCCGACAG CGTGCCCGAC CTGGAATACG AGGAAACCCG GCACAAGGCG CGTGCCCTGT TCCGCGCGGC CGAGGACGCT GTGCAGTTCG CCCGCGGGCA GAACACGGCG GGATCATAA
|
Protein sequence | MSIPSASAAP VPAGRDDVLA TLRQGQGAVV WSIEAADLLT PVAAYMRLSR LAGASDTAPP RNAFLLESVE GGVARGRYSV IGLLPDLIWR CHGGAATINT DAARDPAAFV PAGVPPLDSL RAVIRASQMT LPSGLPPMVA GLFGYLGYDM VRQMEHLPDM PADDLDLPEG VMIRPGLFAI FDTVRDELIL AAPVRPRSDR TPEAAWQAAQ DLLATARRTL SEPLQLHEIT PDYTGPLEAP RSTFTREGFC AMVRRIQDYI AAGDAFQVVP SQRFSTAFTL PPLALYRALR RINPAPFLFN LAFDGFSLVG SSPEILVRLR DGQMTVRPLA GTRPRGRTDE EDLALERDLL ADPKELAEHL MLIDLGRNDI GRACTVGSVQ VTEKFVIERF SHVMHISSNV EGQLRPGLEA LDALIAGFPA GTLTGAPKIR AMEIIDEVEP TRRATYAGCI GYFGANGAMD TCIGLRMAVV KDGQMHVQAG CGVVADSVPD LEYEETRHKA RALFRAAEDA VQFARGQNTA GS
|
| |