Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2363 |
Symbol | |
ID | 6065139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2603681 |
End bp | 2605243 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641601766 |
Product | anthranilate synthase component I |
Protein accession | YP_001725325 |
Protein GI | 170020371 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000438966 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAT CCCACCGCGC TTTTTCACCA ATTGTGTGAG GATCGTCCGG CAACGCTGCT GCTGGAATCC GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGACAG TGCGCTACGC ATTACAGCTT TAGGTGACAC TGTCACAATC CAGTCCCTTT CCGGCAACGG CGAAGCCCTG CTGACACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATT ACCAAACTGC CGTGTGCTGC GCTTCCCCCC TGTTAGTCCA CTGCTGGATG AAGACGCTCG CTTATGCTCC CTTTCGGTTT TTGACGCTTT CCGCTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA CGAGAAGCCA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG GTTTGAAGAT TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGCATTC AGGCCAGCCT GTTTGCTCCG AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG ATCTTCGCCA GCAACTGACC GAAACCGCGC CACCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAACCAG AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCTGGAGAA ATTTTCCAGG TGGTGCCATC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAAT ATGACGCCAC CAGCCGCCAG ATTGAGATCT ACCCGATTGC CGGGACACGC CCACGCGGTC GTCGTGCCGA TGGTTCACTG GACAGAGACC TCGACAGCCG CATCGAACTG GAAATGCGTA CCGATCATAA AGAGCTTTCT GAACATCTGA TGCTGGTGGA TCTCGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC AGCCGCTACG TCGCCGATCT TACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC TCCCGCGTGG TCGGTGAGCT GCGCCACGAT CTCGACGCCC TGCACGCTTA CCGCGCCTGT ATGAATATGG GAACGTTAAG CGGTGCGCCG AAAGTACGCG CTATGCAGTT AATTGCCGAG GCTGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCATGGC GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACAGTAT CGCCACCGTG CAAGCCGGTG CTGGCGTAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACTTTC TGA
|
Protein sequence | MQTQKPTLEL LTCEGAYRDN PTALFHQLCE DRPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QSLSGNGEAL LTLLDNALPA GVENEQLPNC RVLRFPPVSP LLDEDARLCS LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNDLRQQLT ETAPPLPVVS VPHMRCECNQ SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENSIATV QAGAGVVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |