Gene ECH74115_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4040 
SymbolpyrG 
ID6971550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3734825 
End bp3736462 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content52% 
IMG OID643387802 
ProductCTP synthetase 
Protein accessionYP_002272245 
Protein GI209398762 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000538941 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.515613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGA ACTATATTTT TGTGACCGGC GGGGTCGTAT CCTCTCTGGG TAAAGGCATT 
GCCGCAGCCT CCCTCGCAGC CATTCTTGAA GCCCGTGGCC TCAATGTGAC CATCATGAAA
CTGGATCCGT ACATCAACGT CGATCCAGGT ACTATGAGCC CAATCCAACA CGGGGAAGTG
TTCGTTACTG AAGACGGCGC TGAAACCGAC CTGGACCTGG GGCACTACGA GCGTTTCATC
CGTACCAAAA TGAGCCGCCG CAACAACTTC ACCACGGGTC GTATCTACTC TGACGTTCTG
CGTAAAGAAC GCCGCGGTGA CTACCTCGGC GCAACCGTAC AGGTTATTCC GCACATCACT
AACGCAATCA AAGAGCGCGT GCTGGAAGGT GGCGAAGGTC ATGACGTAGT ACTGGTCGAA
ATCGGCGGTA CAGTAGGTGA TATCGAATCC CTGCCGTTCC TCGAAGCGAT TCGCCAGATG
GCTGTTGAAA TTGGCCGTGA GCACACTCTG TTTATGCACC TGACGCTGGT GCCGTACATG
GCAGCGTCTG GTGAAGTCAA AACCAAACCG ACTCAGCACT CTGTAAAAGA GCTGCTCTCC
ATCGGTATCC AGCCTGACAT CCTGATTTGT CGTTCAGATC GCGCCGTTCC GGCGAACGAA
CGTGCGAAGA TTGCATTGTT CTGTAATGTT CCGGAAAAAG CGGTTATTTC TCTGAAAGAC
GTCGATTCCA TCTATAAAAT TCCGGGCCTG TTGAAATCTC AGGGGCTGGA CGATTATATT
TGTAAACGAT TCAGCTTAAA CTGCCCGGAA GCAAATCTGT CCGAATGGGA ACAGGTTATC
TTCGAAGAAG CGAATCCGGT AAGTGAAGTC ACCATTGGTA TGGTCGGCAA GTACATTGAA
CTGCCGGACG CTTACAAATC CGTGATTGAA GCACTGAAAC ACGGTGGGCT GAAGAATCGT
GTCAGCGTCA ACATCAAACT GATCGATTCA CAAGATGTTG AAACGCGCGG CGTTGAAATC
CTTAAAGGTC TGGATGCAAT CCTCGTACCT GGCGGTTTCG GCTATCGTGG CGTAGAAGGC
ATGATTACGA CCGCGCGTTT TGCGCGTGAG AACAATATTC CTTATCTGGG CATTTGCCTG
GGTATGCAGG TGGCGTTAAT TGATTACGCT CGCCATGTTG CCAACATGGA GAACGCCAAC
TCTACGGAAT TTGTGCCAGA CTGTAAGTAC CCGGTTGTGG CGCTGATTAC CGAGTGGCGC
GATGAAAACG GCAACGTTGA AGTTCGTAGC GAGAAGAGCG ATCTCGGCGG TACCATGCGT
CTCGGCGCAC AGCAGTGCCA GTTGGTTGAC GATAGCCTGG TTCGCCAGCT GTACAATGCG
CCGACAATTG TTGAGCGTCA TCGTCACCGT TACGAAGTCA ACAACATGCT GTTGAAACAG
ATTGAAGATG CAGGTCTGCG CGTTGCGGGC CGTTCCGGGG ATGATCAGTT GGTCGAGATC
ATCGAAGTTC CGAATCACCC GTGGTTCGTG GCTTGCCAGT TCCATCCGGA GTTTACTTCT
ACTCCACGTG ATGGTCACCC GCTGTTTGCA GGCTTTGTGA AAGCCGCCAG CGAGTTCCAG
AAACGTCAGG CGAAGTAA
 
Protein sequence
MTTNYIFVTG GVVSSLGKGI AAASLAAILE ARGLNVTIMK LDPYINVDPG TMSPIQHGEV 
FVTEDGAETD LDLGHYERFI RTKMSRRNNF TTGRIYSDVL RKERRGDYLG ATVQVIPHIT
NAIKERVLEG GEGHDVVLVE IGGTVGDIES LPFLEAIRQM AVEIGREHTL FMHLTLVPYM
AASGEVKTKP TQHSVKELLS IGIQPDILIC RSDRAVPANE RAKIALFCNV PEKAVISLKD
VDSIYKIPGL LKSQGLDDYI CKRFSLNCPE ANLSEWEQVI FEEANPVSEV TIGMVGKYIE
LPDAYKSVIE ALKHGGLKNR VSVNIKLIDS QDVETRGVEI LKGLDAILVP GGFGYRGVEG
MITTARFARE NNIPYLGICL GMQVALIDYA RHVANMENAN STEFVPDCKY PVVALITEWR
DENGNVEVRS EKSDLGGTMR LGAQQCQLVD DSLVRQLYNA PTIVERHRHR YEVNNMLLKQ
IEDAGLRVAG RSGDDQLVEI IEVPNHPWFV ACQFHPEFTS TPRDGHPLFA GFVKAASEFQ
KRQAK