Gene ECH74115_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0404 
SymbolprpC 
ID6971924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp410307 
End bp411476 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID643384456 
Productmethylcitrate synthase 
Protein accessionYP_002268970 
Protein GI209395959 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA CAACGATCCT GCAAAACAGT ACCCATGTCA TTAAACCGAA AAAATCTGTG 
GCGCTTTCTG GCGTTCCGGC GGGCAATACG GCGCTCTGCA CCGTGGGTAA AAGTGGCAAT
GACTTGCATT ACCGTGGCTA CGATATTCTT GATCTGGCGG AACATTGCGA ATTTGAAGAA
GTGGCGCATC TGCTGATCCA CGGCAAACTG CCGACCCGTG ACGAACTCGC CGCCTACAAA
ACGAAACTGA AAGCCCTGCG CGGTTTACCG GCTAACGTGC GTACCGTGCT GGAAGCCTTA
CCGGCGGCGT CACACCCGAT GGATGTTATG CGCACCGGCG TTTCCGCGCT CGGCTGCACG
CTGCCAGAAA AAGAGGGGCA TACCGTCTCT GGCGCGCGGG ATATTGCCGA CAAACTGCTG
GCGTCACTTA GCTCGATTCT TCTCTACTGG TATCACTACA GCCACAACGG CGAACGCATC
CAGCCGGAAA CCGATGACGA CTCCATCGGT GGTCACTTCC TGCATCTGCT GCACGGCGAA
AAGCCGTCGC AAAGCTGGGA AAAGGCGATG CATATTTCGC TGGTGCTGTA CGCCGAACAC
GAGTTTAACG CCTCCACCTT TACCAGCCGG GTGATTGCGG GCACTGGCTC TGATATGTAT
TCCGCGATTA TTGGCGCGAT TGGTGCACTG CGCGGGCCGA AGCACGGCGG GGCGAATGAA
GTGTCGCTGG AGATCCAGCA ACGCTACGAA ACGCCGGACG AAGCCGAAGC CGATATCCGC
AAGCGGGTGG AAAGCAAAGA AGTGGTCATT GGTTTTGGTC ATCCGGTTTA TACCATCGCC
GACCCGCGCC ACCAGGTGAT TAAACGTGTG GCGAAGCAGC TCTCGCAGGA AGGCGGCTCG
CTGAAGATGT ACAACATCGC CGATCGCCTG GAAACGGTGA TGTGGGAGAG CAAAAAGATG
TTCCCCAATC TCGACTGGTT CTCCGCTGTT TCCTACAACA TGATGGGCGT TCCCACCGAG
ATGTTCACAC CACTGTTTGT TATCGCCCGC GTCACCGGCT GGGCGGCGCA CATTATCGAA
CAACGTCAGG ACAACAAAAT TATCCGTCCT TCCGCCAATT ATGTTGGACC GGAAGACCGC
CAGTTTGTCG CGCTGGATAA GCGCCAGTAA
 
Protein sequence
MSDTTILQNS THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAEHCEFEE 
VAHLLIHGKL PTRDELAAYK TKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT
LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE
KPSQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDMY SAIIGAIGAL RGPKHGGANE
VSLEIQQRYE TPDEAEADIR KRVESKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSQEGGS
LKMYNIADRL ETVMWESKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE
QRQDNKIIRP SANYVGPEDR QFVALDKRQ