Gene EcolC_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3026 
Symbol 
ID6066026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3305380 
End bp3306438 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content51% 
IMG OID641602442 
Productcitrate lyase ligase 
Protein accessionYP_001725977 
Protein GI170021023 
COG category[C] Energy production and conversion 
COG ID[COG3053] Citrate lyase synthetase 
TIGRFAM ID[TIGR00124] [citrate (pro-3S)-lyase] ligase
[TIGR00125] cytidyltransferase-related domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGCA ATGATATTTT CACCCGCGTA AAACGTTCAG AAAATAAAAA AATGGCGGAA 
ATCGCCCAAT TCCTGCATGA AAATGATTTG AGCGTTGACA CCACAGTCGA AGTATTTATT
ACCGTAACCC GCGATGAAAA GCTTATCGCG TGCGGTGGAA TTGCCGGAAA TATTATTAAA
TGCGTTGCTA TCAGTGAATC CGTTCGCGGT GAAGGACTGG CGCTGACATT AGCCACTGAA
CTGATAAACC TCGCCTATGA GCGGCACAGC ACGCATCTGT TTATTTATAC CAAAACCGAA
TACGAGGCGC TGTTCCGCCA GTGCGGTTTT TCCACGCTGA CCAGCGTACC CGGCGTGATG
GTGCTGATGG AAAACAGCGC CACGCGACTG AAACGCTATG CCGAATCGCT GAAAAAATTT
CGTCATCCAG GGAACAAGAT TGGCTGCATT GTGATGAACG CCAATCCCTT TACGAATGGT
CACCGTTATC TGATTCAACA AGCTGCGGCA CAGTGCGACT GGTTGCATCT GTTTTTAGTC
AAAGAAGATT CTTCACGCTT CCCCTATGAA GACCGGCTGG ATCTGGTGTT AAAAGGCACC
GCCGATATTC CACGCCTGAC TGTGCATCGC GGCTCCGAAT ACATCATCTC CCGCGCTACG
TTCCCTTGCT ACTTCATTAA AGAACAGAGC GTCATTAACC ATTGTTACAC CGAAATTGAT
CTGAAGATTT TCCGTCAGTA CCTCGCTCCC GCACTGGGTG TAACTCACCG CTTTGTCGGT
ACTGAACCCT TTTGTCGCGT TACCGCCCAG TACAACCAGG ATATGCGCTA CTGGCTGGAA
ACGCCGACTA TCTCCGCACC GCCCATCGAA CTGGTTGAAA TTGAGCGGCT GCGTTACCAG
GAGATGCCGA TATCCGCTTC CCGGGTACGT CAACTGCTGG CGAAAAACGA TCTCACGGCT
ATCGCGCCGC TGGTCCCTGC AGTCACGCTG CATTATTTGC AGAACCTGCT TGAGCACTCC
CGCCAGGACG CGGCAGCTCG TCAAAAGACC CCCGCATGA
 
Protein sequence
MFGNDIFTRV KRSENKKMAE IAQFLHENDL SVDTTVEVFI TVTRDEKLIA CGGIAGNIIK 
CVAISESVRG EGLALTLATE LINLAYERHS THLFIYTKTE YEALFRQCGF STLTSVPGVM
VLMENSATRL KRYAESLKKF RHPGNKIGCI VMNANPFTNG HRYLIQQAAA QCDWLHLFLV
KEDSSRFPYE DRLDLVLKGT ADIPRLTVHR GSEYIISRAT FPCYFIKEQS VINHCYTEID
LKIFRQYLAP ALGVTHRFVG TEPFCRVTAQ YNQDMRYWLE TPTISAPPIE LVEIERLRYQ
EMPISASRVR QLLAKNDLTA IAPLVPAVTL HYLQNLLEHS RQDAAARQKT PA