Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnuc_0745 |
Symbol | |
ID | 5053228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 |
Kingdom | Bacteria |
Replicon accession | NC_009379 |
Strand | - |
Start bp | 747471 |
End bp | 749294 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640470902 |
Product | anthranilate synthase component I/chorismate-binding protein |
Protein accession | YP_001155527 |
Protein GI | 145588930 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.766935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTGC TCGATGATGC GCAAAGTACA GCGGCTAATC CAACCAGTCG CCTATACCAA AATCCGCTTC ACTACTGGTG CGTCAGTCCA AGTGGCGATA ACGCCATCGA TCATGAAGCA ACTCAAACTT GCCTTATAGA AATCACCACT GCACTGAGTA AGGGGCAATT TGTGGTTACG GCATTTGCCT ATGAGCTTGG AAGATCAATC CATCATTTGC CACAAAAACC AAATTCCTCT AATCATCACC CCTTGATCGA AGCTTGGTCA TTTCAAGATT ACAAGAAGCT CTCAAAATCA GAGGTAGATA CTCTGATTGA GGATGAATTA GGAAAAGAAG GTGTATCTCA AGGTATTGCA GGAGTATTGG AGGTTGAGGA AACCTTAACA GAAGCCCAGT TTGCATCAGA CATTGCCGTA ATACAGGAAT ATATTCGTAG TGGCGATAGC TATCAAATCA ATCACACCTA TCGGATCAAA GGGAAAACCT ACGGATCACC CCTAGGACTA TACCAACGCC TTCGCAATCG TCAGCCAGGT AGATTTGGTG CTTACATCAA ATCAAATACC CATCACCTCC TCTCTCAATC GCCTGAATTA TTCATTGAGC GCAATGGAGA TATTCTGAAA GCAATGCCAA TGAAGGGGAC TGCAAGCGCT CTATCGGATA CACCAGGTGC ATTATCTGCG GACCCCAAGA ATCAAGCTGA GAACGTGATG ATTGTCGATT TATTGCGCAA CGATTTAAGC CGCATCTCCC TACCTGGTAG CGTGACAGTA CCTAACTTAT TTGCAGTTGC AAGACATGGT GATGTCTTGC AAATGACCTC AACTGTAGAG GGTCAGATCA AACCAAGCAT CTCCCTTGTC AATATATTGG AGGCAGTCTT TCCCTGTGGG TCAGTCACTG GCGCCCCCAA AAAACGCAGC ATGGAAATTA TTCAAGAACT AGAACCCGAG GATCGCGGAT ATTACTGTGG CGCCTTGGGG TGGCTCGATC CAAGCGGTGA TTTTGCGCTG AGTGTCCCCA TCAGAACAGT AGAAATATCT GAAGATGCCA ACAGTCATGC TGCCCATTTC ACATTAGGAA TCGGCGCTGG AATAACAATT GATTCAGATG CGCAACAAGA ATGGCAAGAG TGCCATATTA AGTCTGCTTT TTTAAAGAAT TTACCTAGCC ATACTGGTTT ATTCGAAACC ATTGCCATTA AACAAGGCAA AACTCAGAAT ATTGAAGCTC ATCTCAATAG AATGCAAGCA TCCGCATTGG CTCTAGGCAT TCAGTTTGAT CGAGTAAATG CTCAAACAAC GATAAGTGAA GCATGCAAAG CGCTTAATTC AGATCATGAC TTTAGGCTCC GCTTAGATCT AAGTCCTACT GGAATCATTA GCGCGACTAC CGGACAACTT GAAGCCCTTA AACCGTTAGT AAAGATTTTT TGGGCTGCCG ATATTCTGCC TTTTGATTGC ACGATGTTTT CAGGGAATGC ATTGCTGCGC CACAAAGTTA CCCATCGAGG TCTCTATGAT CAAGCCTGGC AAGCTGCTGT TCAGCAAGGA GGGTTTGATG CGCTCTTTAT AAATGAACAA GGATTTGTGA CTGAGGGTGG GCGTTCAAGC ATCTTTATTA AACCATCCAA CAGCACTTCC TGGCTAACGC CCCCTGTATC TACAGGTCTA TTACCAGGTG TTATGCGTGC AAGCTTACTC ACAGACCCTA AGATGAATGC CCGTGAAGCG AACCTGACTA TTCAGGATGT ATTAGAAGCA GATGAGATTA TTCTTAGTAA TGCGCTTCGA GGCGCTATCA AAGCCCATTT TTAG
|
Protein sequence | MILLDDAQST AANPTSRLYQ NPLHYWCVSP SGDNAIDHEA TQTCLIEITT ALSKGQFVVT AFAYELGRSI HHLPQKPNSS NHHPLIEAWS FQDYKKLSKS EVDTLIEDEL GKEGVSQGIA GVLEVEETLT EAQFASDIAV IQEYIRSGDS YQINHTYRIK GKTYGSPLGL YQRLRNRQPG RFGAYIKSNT HHLLSQSPEL FIERNGDILK AMPMKGTASA LSDTPGALSA DPKNQAENVM IVDLLRNDLS RISLPGSVTV PNLFAVARHG DVLQMTSTVE GQIKPSISLV NILEAVFPCG SVTGAPKKRS MEIIQELEPE DRGYYCGALG WLDPSGDFAL SVPIRTVEIS EDANSHAAHF TLGIGAGITI DSDAQQEWQE CHIKSAFLKN LPSHTGLFET IAIKQGKTQN IEAHLNRMQA SALALGIQFD RVNAQTTISE ACKALNSDHD FRLRLDLSPT GIISATTGQL EALKPLVKIF WAADILPFDC TMFSGNALLR HKVTHRGLYD QAWQAAVQQG GFDALFINEQ GFVTEGGRSS IFIKPSNSTS WLTPPVSTGL LPGVMRASLL TDPKMNAREA NLTIQDVLEA DEIILSNALR GAIKAHF
|
| |