Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2853 |
Symbol | |
ID | 8733297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3045542 |
End bp | 3047044 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646503466 |
Product | anthranilate synthase component I |
Protein accession | YP_003394647 |
Protein GI | 284044307 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGCCC CGCACGCCCA GCCCGCCGCT GCCGCCGCGA ACGCGAACGC GCTGCGCGTC ACGCCGTCGC TCGACGAGGT GCGCGAGCTC GCGCGCGAGC ACACGCTCGT GCCGCTGCGC CACACGTTCG TCGACGACAT CGAGACGCCC GTCTCCGCTT TCCTCAAGCT GCGCGGGGAC GGCCCGTCGT TCCTGCTGGA GTCGGCCGAG CAGGGCCGCA TGGGACGCTG GTCGTTCATC GGCTTCCGGC CGCGCAGCGT GCTGCGCTGG TCGCTCGGCG ACGGCGGCGA CCCGTACGCG CTCGCCGCCG CCGAGGTCGC CCGCTCCAGA CAGGCGCAGG TCCCTGGCCT GCCGCCGTTC TCCGGCGGCG CGGTCGGGAT CTTCGGCTAC GACCTCGTCC GCACGGTCGA GCCGCTCGGC GAGCCGAACC CCGACCCGGT CGGCCTGCCG GACATGGCGC TGATGTTGAC CGACGTGATC GTCGCGTTCG ACCACCAGCG CCACGAGCTG TCGATCCTCG CGAACGTCGA TGCCGCCGCC GGCGACCTGG AGCAGGCGTA CGCCGCGGCG GTCGCGACGA TCGAGGAGGT CCGCTGGAAG CTCTCCGGGC CGGTCCCGCG GCCGGCGCGG CCGCCCGCCG CGCGCGACCC TGAGCAGCCC GTCGACTTCC AGAGCAACAT GCCGCGCGAG CAGTTCGAGG GCATGGTCGA GCGGATCGTC GAGTACATCC ACGCCGGCGA CGCCTACCAG GTGGTGCCCT CCCAGCGCTG GTCGGCGGAG GTCCCGATCG AGGCGTTCTC GATCTACCGC GGGCTGCGCG CCGTCAACCC CAGCCCGTAC ATGTACTTCC TCGACTTCGG CGACTTCGAG ATCGCCGGCG CGAGCCCCGA GCCGCTGCTG ACGGTGCAGT CCGGCGTCGT CCGCACGCGG CCGATCGCTG GCACGCGCCC GCGCGGCACC GACGCCGCGG ACGACGCGCG CCTCGCCGCC GACCTGCTCG CCGACGAGAA GGAGCGCTCC GAGCACGTGA TGCTCGTCGA CCTCGCGCGC AACGACGTCG GCCGTGTCAG CGAGTACGGC AGCGTCAACG TCGACGGCTA CATGGAGATC GAGAACTACA GCCACGTGAT GCACATCGTC TCGCGCGTCT CGGGCCGTCT GCGCGAGGGG ATCGGCCCGC TCGACGCGCT GCGCTCGATC CTGCCGGCCG GAACGCTCTC GGGTGCGCCG AAGGTCCGCG CGATGCAGAT CATCGACGAG CTGGAACCGG TCAAGCGGGG CGGCTACGGT GGGGCGATCG GCTACCTCTC GTACACCGGC GACCTCGACA CGTGCATCCA CATCCGTACG GTCGTCGTCA AGGACGGCGT CGCCCACGTG CAGGCGGGCG GCGGCACGGT CGCCGACGCG AAGCCCGACT ACGAGTTCCG CGAGTCCGAG GCGAAGGCGC GCGCGGTGCG CCAGGCGATC GCGCTGGCGG TGGCGCAGCC GGAGTGGCCC TGA
|
Protein sequence | MGAPHAQPAA AAANANALRV TPSLDEVREL AREHTLVPLR HTFVDDIETP VSAFLKLRGD GPSFLLESAE QGRMGRWSFI GFRPRSVLRW SLGDGGDPYA LAAAEVARSR QAQVPGLPPF SGGAVGIFGY DLVRTVEPLG EPNPDPVGLP DMALMLTDVI VAFDHQRHEL SILANVDAAA GDLEQAYAAA VATIEEVRWK LSGPVPRPAR PPAARDPEQP VDFQSNMPRE QFEGMVERIV EYIHAGDAYQ VVPSQRWSAE VPIEAFSIYR GLRAVNPSPY MYFLDFGDFE IAGASPEPLL TVQSGVVRTR PIAGTRPRGT DAADDARLAA DLLADEKERS EHVMLVDLAR NDVGRVSEYG SVNVDGYMEI ENYSHVMHIV SRVSGRLREG IGPLDALRSI LPAGTLSGAP KVRAMQIIDE LEPVKRGGYG GAIGYLSYTG DLDTCIHIRT VVVKDGVAHV QAGGGTVADA KPDYEFRESE AKARAVRQAI ALAVAQPEWP
|
| |