Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1171 |
Symbol | |
ID | 8731606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 1231272 |
End bp | 1233470 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 646501787 |
Product | Chorismate binding-like protein |
Protein accession | YP_003392977 |
Protein GI | 284042637 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.416278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0752567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCG TGCGGCTCCT CCGCGTGCCG CTCCCCGTCG ACTGCTCCCC CGAAGCGGCG CTGCGCGCGC TCGGCGGCGA CGCCTGGCCG TTCGCGCTGA CCGGTCGCTG GGCCGGCGGC GGGGCGATCG TCGGCTCCGA GCCGCTGCAC GTCGCCGAGC CCGGCGAGGA CCCGTTCGCG CTGCTCGATG CGCTGCCAGA CGTCGCGCCA GCGGACGGCG CCGGCAAGGA CGCAGCCGGC GCCGCGGATG CGGTCGGCGG CGGCTGGTTC GGCTGGCTCG GCTACGGCCT CGGCGCGCGC GTCGAGCGGC TGCCGCCGCC CCCGCCCCGG CCCGCGGCGC TGCCGCCCTT CCAGCTCGCC TACTACGACC ACGTCCTCCA CCTCGACGCG GCCGGCCGCT GGTGGTTCGA AGCGCTCGCC ACCTCCGACC GCCGCCCCCA CCTCGACGCC CGGCTCGCCC GCCTGCGTGA CCTCCTCAGC CGTCCTGCCC TGGTGTCGCA TAGCGACACC AACCCAGGAC AACCTCCAGA GTTCAGGGTC GCGGGCGGCG GGGCCGGCGC GCATGTCGCG GCGGTGGCGG AGTGCCGCGA GCGGATCGCG GCCGGCGAGA TCTTCCAGGC GAACGTCTGC CTGCGGCTGG AGAGCGCGTG GGAGGGCGAC GTCGCGGAGC TGTTCGCACG CGCGAGCGCG CGCCTGCGGC CGGCGTACGG AGCGGCGTTC CCAGCGCCGT GGGGCGGCAT CGCGAGCCTG TCGCCCGAGC TGTTCCTGCG CCGCCGCGGA CGCGCGGTCG AGACCGCGCC GATCAAGGGG ACGATCGCGC GCGCCGACGG AGCAGACGCC GCGCGCGACA GCGACGCCGC ACGCGCCAGC CTCAATGCCT CCGTCAAGGA CCACGCCGAG CACGTGATGA TCGTCGACCT GATGCGCAAC GACCTCGGCC GCGTCGGCGT CTACGGCTCG ATCGAGGCCG CTCCCACGCC GGAGGCGCAG GCGCACCCGG GCGTCTGGCA CCTCGTCTCG CGCGTGCAGG GGACGCTGCG GCCGGACGTC GGCGACGGCG ATCTGCTGCG CGCGACGTTC CCGCCCGGTT CGGTCACCGG CGCACCGAAG GTGCAGGCGA TGCACGTGAT CGCGGCGCTG GAGGGCACCG GCCGCGAGGT CTACACCGGC GCGATCGGCT ACGCGAGCCC GCTCGCCGGG CTGGAGCTGA ACGTCGCGAT CCGTACCTTC GAGGCGCGCG ACGGCCGCCT CTGGCTCGGC GCGGGCGGCG GGATCGTCGC CGACTCCGAC CCGGCGCGCG AGCTGGAGGA GTGCCTCGTG AAGGCGCGCC CGCTGGTCGC CGCGATCGGC GGAGCGATCG AGACGACGCC GCTCGGCGTG CTTCGCACTG CGAGTTCCAC GACGGCGCCC GCCCTCGACC GCAGGCGCGC CCGTCCCGAT CCGGCAGCGG GCGTCTTCGA GACGCTGCTC GTCCGCGACG GCGTCGCGCT GCACGCCGCC GAGCACGTCG CGCGGCTCGC GCACAGCGCC GAGCGGCTGT ACGGGCTGCC GCTGCCCGAC GACCTGGCCG AGCGCGTCAC GGCCGCGAGC GAAGGCGCGG GCGGGGTCCG CCTGAGAGTC GTCGCGGTTC CGTCGCGGGG CCGCCTCGAC GTCACGCTCG CGCTGGCGCC GAACCCGCGG CGCCAGCTCC CGGTCGTGCT CGCGCCGTTC ACGCTGTCCG GCGGCCTCGG CGCGCACAAG TGGCTCGACC GCCAGCTGCT GGACACGCTC GCGCAGCAGG CGGGCGGGGC GACGCCGCTG CTGCTCGACG GCGACGGGTC GGTGCTCGAA GCGGCGTGGG CGAACGTCTT CGCGATCGAA GAGGAGCGGC TGCTGACGCC GCCGGCCGAC GGCCGGATCC TGCCCGGCGT GACGCGCGCG CTGCTGCTGG ACGCCGCCGA GGAGGCCGGC CTCACGGCGG TCGAGCAGCC GCTGCGGCTC GACGACCTCG CCGCCGCCGA CGCGGTCCTG CTCAGCTCGT CCGCGGCACT CGTGGTGCCG GCGCGGCTCG CGCGCGAGCC GGGTGCCGAC GAGCGCGCCG CCGGTGAACG GGGCGCCGAC GAGCCGGGCA CCGACGAGCG TGCCGCGCGG CTCACCCGCC GGCTGCTCCG GGCCGTGGAG GCCCGGAGCG CGCCGGTGAG CGGAGCGCGT ACCGACTAG
|
Protein sequence | MTVVRLLRVP LPVDCSPEAA LRALGGDAWP FALTGRWAGG GAIVGSEPLH VAEPGEDPFA LLDALPDVAP ADGAGKDAAG AADAVGGGWF GWLGYGLGAR VERLPPPPPR PAALPPFQLA YYDHVLHLDA AGRWWFEALA TSDRRPHLDA RLARLRDLLS RPALVSHSDT NPGQPPEFRV AGGGAGAHVA AVAECRERIA AGEIFQANVC LRLESAWEGD VAELFARASA RLRPAYGAAF PAPWGGIASL SPELFLRRRG RAVETAPIKG TIARADGADA ARDSDAARAS LNASVKDHAE HVMIVDLMRN DLGRVGVYGS IEAAPTPEAQ AHPGVWHLVS RVQGTLRPDV GDGDLLRATF PPGSVTGAPK VQAMHVIAAL EGTGREVYTG AIGYASPLAG LELNVAIRTF EARDGRLWLG AGGGIVADSD PARELEECLV KARPLVAAIG GAIETTPLGV LRTASSTTAP ALDRRRARPD PAAGVFETLL VRDGVALHAA EHVARLAHSA ERLYGLPLPD DLAERVTAAS EGAGGVRLRV VAVPSRGRLD VTLALAPNPR RQLPVVLAPF TLSGGLGAHK WLDRQLLDTL AQQAGGATPL LLDGDGSVLE AAWANVFAIE EERLLTPPAD GRILPGVTRA LLLDAAEEAG LTAVEQPLRL DDLAAADAVL LSSSAALVVP ARLAREPGAD ERAAGERGAD EPGTDERAAR LTRRLLRAVE ARSAPVSGAR TD
|
| |