Gene EcolC_3292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3292 
Symbol 
ID6067028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3605953 
End bp3607122 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID641602707 
Productmethylcitrate synthase 
Protein accessionYP_001726241 
Protein GI170021287 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA CAACGATCCT GCAAAACAGT ACCCATGTCA TTAAACCGAA AAAATCTGTG 
GCACTTTCTG GCGTTCCGGC GGGCAATACG GCGCTCTGCA CCGTGGGTAA AAGTGGCAAT
GACCTGCATT ACCGCGGCTA CGATATTCTT GATCTGGCGA AACATTGCGA ATTTGAAGAA
GTGGCGCATC TGCTGATCCA CGGCAAACTG CCGACCCGTG ACGAACTCGC CGCTTACAAA
ACGAAACTGA AAGCCCTGCG CGGTTTACCG GCTAACGTGC GTACCGTGCT GGAAGCCTTA
CCGGCGGCGT CACACCCGAT GGATGTTATG CGCACCGGTG TTTCCGCGCT CGGCTGCACG
CTGCCAGAAA AAGAGGGGCA TACCGTCTCT GGCGCGCGGG ATATTGCCGA CAAACTGCTG
GCGTCGCTTA GCTCGATTCT CCTTTATTGG TATCACTACA GCCACAACGG CGAACGCATC
CAACCGGAAA CCGATGACGA CTCCATCGGC GGTCACTTCC TGCATCTGCT GCACGGCGAA
AAGCCATCGC AAAGCTGGGA AAAGGCGATG CATATCTCGC TGGTGCTGTA CGCCGAACAC
GAGTTTAACG CCTCCACCTT TACCAGTCGG GTGATTGCGG GCACCGGCTC TGATATGTAT
TCCGCGATTA TTGGCGCGAT TGGCGCACTG CGCGGGCCAA AACACGGCGG GGCGAATGAA
GTGTCGCTGG AGATCCAGCA ACGCTACGAA ACGCCGGACG AAGCCGAAGC AGATATCCGC
AAGCGCGTGG AAAACAAAGA AGTGGTCATT GGTTTTGGTC ATCCGGTTTA CACCATCGCT
GACCCGCGCC ACCAGGTGAT TAAACGTGTG GCGAAGCAGC TCTCGCAGGA AGGCGGCTCG
CTGAAGATGT ACAACATCGC CGATCGCCTG GAAACGGTGA TGTGGGAGAG CAAAAAGATG
TTCCCCAATC TCGACTGGTT CTCTGCTGTT TCCTACAACA TGATGGGCGT TCCCACCGAG
ATGTTCACAC CACTGTTTGT TATCGCCCGC GTCACCGGCT GGGCGGCGCA CATTATCGAA
CAACGTCAGG ACAACAAAAT TATCCGTCCT TCCGCCAATT ATGTTGGACC GGAAGACCGC
CCGTTTGTCG CGCTGGATAA GCGCCAGTAA
 
Protein sequence
MSDTTILQNS THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAKHCEFEE 
VAHLLIHGKL PTRDELAAYK TKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT
LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE
KPSQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDMY SAIIGAIGAL RGPKHGGANE
VSLEIQQRYE TPDEAEADIR KRVENKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSQEGGS
LKMYNIADRL ETVMWESKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE
QRQDNKIIRP SANYVGPEDR PFVALDKRQ