Gene EcHS_A0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0398 
SymbolprpC 
ID5591599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp415608 
End bp416777 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID640919583 
Productmethylcitrate synthase 
Protein accessionYP_001457168 
Protein GI157159850 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA CAACGATCCT GCAAAACAGT ACCCATGTCA TTAAACCGAA AAAATCGGTG 
GCACTTTCCG GCGTTCCGGC GGGCAATACG GCGCTCTGCA CCGTGGGTAA AAGTGGCAAT
GACCTGCATT ACCGTGGCTA CGATATTCTT GATCTGGCGG AACATTGCGA ATTTGAAGAA
GTGGCTCATC TGCTGATCCA CGGCAAACTG CCGACCCGTG ACGAACTCGC CGCTTACAAA
ACGAAACTGA AAGCCCTGCG TGGTTTACCG GCTAACGTGC GTACCGTGCT GGAAGCCTTA
CCGGCGGCGT CACACCCGAT GGATGTTATG CGCACCGGTG TTTCCGCGCT CGGCTGCACG
CTGCCAGAAA AAGAGGGGCA TACCGTCTCT GGCGCGCGGG ATATTGCCGA CAAACTGCTG
GCGTCGCTTA GCTCGATTCT CCTTTATTGG TATCACTACA GCCACAACGG CGAACGCATC
CAACCGGAAA CCGATGACGA CTCCATCGGC GGTCACTTCC TGCATCTGCT GCACGGCGAA
AAGCCATCGC AAAGCTGGGA AAAGGCGATG CATATCTCGC TGGTGCTGTA CGCCGAACAC
GAGTTTAACG CCTCCACCTT TACCAGTCGG GTGATTGCGG GCACCGGCTC TGATATGTAT
TCCGCGATTA TTGGCGCGAT TGGCGCACTG CGCGGGCCAA AACACGGCGG GGCGAATGAA
GTGTCGCTGG AGATCCAGCA ACGCTACGAA ACGCCGGACG AAGCCGAAGC AGATATCCGC
AAGCGCGTGG AAAACAAAGA AGTGGTCATT GGTTTTGGTC ATCCGGTTTA CACCATCGCT
GACCCGCGCC ACCAGGTGAT TAAACGTGTG GCGAAGCAGC TCTCGCAGGA AGGCGGCTCG
CTGAAGATGT ACAACATCGC CGATCGCCTG GAAACGGTGA TGTGGGAGAG CAAAAAGATG
TTCCCCAATC TCGACTGGTT CTCTGCTGTT TCCTACAACA TGATGGGCGT TCCCACCGAG
ATGTTCACAC CACTGTTTGT TATCGCCCGC GTCACCGGCT GGGCGGCGCA CATTATCGAA
CAACGTCAGG ACAACAAAAT TATCCGTCCT TCCGCCAATT ATGTTGGACC GGAAGACCGC
CCGTTTGTCG CGCTGGATAA GCGCCAGTAA
 
Protein sequence
MSDTTILQNS THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAEHCEFEE 
VAHLLIHGKL PTRDELAAYK TKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT
LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE
KPSQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDMY SAIIGAIGAL RGPKHGGANE
VSLEIQQRYE TPDEAEADIR KRVENKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSQEGGS
LKMYNIADRL ETVMWESKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE
QRQDNKIIRP SANYVGPEDR PFVALDKRQ