Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4040 |
Symbol | pyrG |
ID | 6971550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3734825 |
End bp | 3736462 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643387802 |
Product | CTP synthetase |
Protein accession | YP_002272245 |
Protein GI | 209398762 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0504] CTP synthase (UTP-ammonia lyase) |
TIGRFAM ID | [TIGR00337] CTP synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000538941 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.515613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACGA ACTATATTTT TGTGACCGGC GGGGTCGTAT CCTCTCTGGG TAAAGGCATT GCCGCAGCCT CCCTCGCAGC CATTCTTGAA GCCCGTGGCC TCAATGTGAC CATCATGAAA CTGGATCCGT ACATCAACGT CGATCCAGGT ACTATGAGCC CAATCCAACA CGGGGAAGTG TTCGTTACTG AAGACGGCGC TGAAACCGAC CTGGACCTGG GGCACTACGA GCGTTTCATC CGTACCAAAA TGAGCCGCCG CAACAACTTC ACCACGGGTC GTATCTACTC TGACGTTCTG CGTAAAGAAC GCCGCGGTGA CTACCTCGGC GCAACCGTAC AGGTTATTCC GCACATCACT AACGCAATCA AAGAGCGCGT GCTGGAAGGT GGCGAAGGTC ATGACGTAGT ACTGGTCGAA ATCGGCGGTA CAGTAGGTGA TATCGAATCC CTGCCGTTCC TCGAAGCGAT TCGCCAGATG GCTGTTGAAA TTGGCCGTGA GCACACTCTG TTTATGCACC TGACGCTGGT GCCGTACATG GCAGCGTCTG GTGAAGTCAA AACCAAACCG ACTCAGCACT CTGTAAAAGA GCTGCTCTCC ATCGGTATCC AGCCTGACAT CCTGATTTGT CGTTCAGATC GCGCCGTTCC GGCGAACGAA CGTGCGAAGA TTGCATTGTT CTGTAATGTT CCGGAAAAAG CGGTTATTTC TCTGAAAGAC GTCGATTCCA TCTATAAAAT TCCGGGCCTG TTGAAATCTC AGGGGCTGGA CGATTATATT TGTAAACGAT TCAGCTTAAA CTGCCCGGAA GCAAATCTGT CCGAATGGGA ACAGGTTATC TTCGAAGAAG CGAATCCGGT AAGTGAAGTC ACCATTGGTA TGGTCGGCAA GTACATTGAA CTGCCGGACG CTTACAAATC CGTGATTGAA GCACTGAAAC ACGGTGGGCT GAAGAATCGT GTCAGCGTCA ACATCAAACT GATCGATTCA CAAGATGTTG AAACGCGCGG CGTTGAAATC CTTAAAGGTC TGGATGCAAT CCTCGTACCT GGCGGTTTCG GCTATCGTGG CGTAGAAGGC ATGATTACGA CCGCGCGTTT TGCGCGTGAG AACAATATTC CTTATCTGGG CATTTGCCTG GGTATGCAGG TGGCGTTAAT TGATTACGCT CGCCATGTTG CCAACATGGA GAACGCCAAC TCTACGGAAT TTGTGCCAGA CTGTAAGTAC CCGGTTGTGG CGCTGATTAC CGAGTGGCGC GATGAAAACG GCAACGTTGA AGTTCGTAGC GAGAAGAGCG ATCTCGGCGG TACCATGCGT CTCGGCGCAC AGCAGTGCCA GTTGGTTGAC GATAGCCTGG TTCGCCAGCT GTACAATGCG CCGACAATTG TTGAGCGTCA TCGTCACCGT TACGAAGTCA ACAACATGCT GTTGAAACAG ATTGAAGATG CAGGTCTGCG CGTTGCGGGC CGTTCCGGGG ATGATCAGTT GGTCGAGATC ATCGAAGTTC CGAATCACCC GTGGTTCGTG GCTTGCCAGT TCCATCCGGA GTTTACTTCT ACTCCACGTG ATGGTCACCC GCTGTTTGCA GGCTTTGTGA AAGCCGCCAG CGAGTTCCAG AAACGTCAGG CGAAGTAA
|
Protein sequence | MTTNYIFVTG GVVSSLGKGI AAASLAAILE ARGLNVTIMK LDPYINVDPG TMSPIQHGEV FVTEDGAETD LDLGHYERFI RTKMSRRNNF TTGRIYSDVL RKERRGDYLG ATVQVIPHIT NAIKERVLEG GEGHDVVLVE IGGTVGDIES LPFLEAIRQM AVEIGREHTL FMHLTLVPYM AASGEVKTKP TQHSVKELLS IGIQPDILIC RSDRAVPANE RAKIALFCNV PEKAVISLKD VDSIYKIPGL LKSQGLDDYI CKRFSLNCPE ANLSEWEQVI FEEANPVSEV TIGMVGKYIE LPDAYKSVIE ALKHGGLKNR VSVNIKLIDS QDVETRGVEI LKGLDAILVP GGFGYRGVEG MITTARFARE NNIPYLGICL GMQVALIDYA RHVANMENAN STEFVPDCKY PVVALITEWR DENGNVEVRS EKSDLGGTMR LGAQQCQLVD DSLVRQLYNA PTIVERHRHR YEVNNMLLKQ IEDAGLRVAG RSGDDQLVEI IEVPNHPWFV ACQFHPEFTS TPRDGHPLFA GFVKAASEFQ KRQAK
|
| |