Gene EcolC_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2363 
Symbol 
ID6065139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2603681 
End bp2605243 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content54% 
IMG OID641601766 
Productanthranilate synthase component I 
Protein accessionYP_001725325 
Protein GI170020371 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000438966 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAT 
CCCACCGCGC TTTTTCACCA ATTGTGTGAG GATCGTCCGG CAACGCTGCT GCTGGAATCC
GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGACAG TGCGCTACGC
ATTACAGCTT TAGGTGACAC TGTCACAATC CAGTCCCTTT CCGGCAACGG CGAAGCCCTG
CTGACACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATT ACCAAACTGC
CGTGTGCTGC GCTTCCCCCC TGTTAGTCCA CTGCTGGATG AAGACGCTCG CTTATGCTCC
CTTTCGGTTT TTGACGCTTT CCGCTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA
CGAGAAGCCA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG GTTTGAAGAT
TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACG
CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGCATTC AGGCCAGCCT GTTTGCTCCG
AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG ATCTTCGCCA GCAACTGACC
GAAACCGCGC CACCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAACCAG
AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCTGGAGAA
ATTTTCCAGG TGGTGCCATC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC
TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT
TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAAT ATGACGCCAC CAGCCGCCAG
ATTGAGATCT ACCCGATTGC CGGGACACGC CCACGCGGTC GTCGTGCCGA TGGTTCACTG
GACAGAGACC TCGACAGCCG CATCGAACTG GAAATGCGTA CCGATCATAA AGAGCTTTCT
GAACATCTGA TGCTGGTGGA TCTCGCCCGT AATGATCTGG CACGCATTTG CACCCCCGGC
AGCCGCTACG TCGCCGATCT TACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC
TCCCGCGTGG TCGGTGAGCT GCGCCACGAT CTCGACGCCC TGCACGCTTA CCGCGCCTGT
ATGAATATGG GAACGTTAAG CGGTGCGCCG AAAGTACGCG CTATGCAGTT AATTGCCGAG
GCTGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCATGGC
GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACAGTAT CGCCACCGTG
CAAGCCGGTG CTGGCGTAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT
AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACTTTC
TGA
 
Protein sequence
MQTQKPTLEL LTCEGAYRDN PTALFHQLCE DRPATLLLES ADIDSKDDLK SLLLVDSALR 
ITALGDTVTI QSLSGNGEAL LTLLDNALPA GVENEQLPNC RVLRFPPVSP LLDEDARLCS
LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET
LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNDLRQQLT ETAPPLPVVS VPHMRCECNQ
SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND
FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLDSRIEL EMRTDHKELS
EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC
MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENSIATV
QAGAGVVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF