Gene EcSMS35_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0637 
SymbolcitC 
ID6146848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp650197 
End bp651255 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content50% 
IMG OID641615529 
Product[citrate (pro-3S)-lyase] ligase 
Protein accessionYP_001742735 
Protein GI170681316 
COG category[C] Energy production and conversion 
COG ID[COG3053] Citrate lyase synthetase 
TIGRFAM ID[TIGR00124] [citrate (pro-3S)-lyase] ligase
[TIGR00125] cytidyltransferase-related domain 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGCA ATGATATTTT CACCCGTGTA AAACGTTCAG AAAATAAAAA AATGGCGGAA 
ATCGCCCAAT TCCTGCATGA AAATGATTTG AGCGTTGACA CCACAGTCGA AGTATTTATT
ACCGTAACCC GCGATGAAAA GCTTATCGCG TGCGGTGGAA TTGCCGGAAA TATTATTAAA
TGCGTTGCTA TCAGTGAATC CGTTCGCGGT GAAGGACTGG CGCTGACATT AGCCACTGAA
TTGATAAACC TCGCCTATGA GCGGCACAGC ACGCATCTGT TTATTTATAC CAAAACCGAA
TACGAGGCGC TGTTCCGCCA GTGCGGTTTT TCCACGCTGA CCAGTGTACC CGGCGTGATG
GTGCTGATGG AAAACAGCGC CACGCGACTG AAACGCTATG CCGAATCGCT GAAAAAATTT
CGTCATCCAG GGAACAAGAT TGGCTGCATT GTGATGAACG CCAATCCCTT TACGAATGGT
CACCGTTATC TGATTCAACA GGCTGCAGCA CAGTGCGACT GGTTGCATCT GTTTTTAGTT
AAAGAAGATT CTTCACGCTT TCCCTATGAA GACCGGCTGG ATCTGGTGTT AAAAGGCACC
GCCGATATTC CACGCCTGAC TGTGCATCGC GGCTCCGAAT ACATCATCTC CCGCGCTACG
TTCCCTTGCT ACTTCATTAA AGAACAGAGC GTCATTAACC ATTGTTACAC CGAAATTGAT
CTGAAGATTT TCCGTCAGTA CCTCGCTCCC GCGCTGGGTG TAACTCACCG CTTTGTCGGT
ACTGAACCCT TTTGTCGTGT TACCGCCCAG TACAACCAGG ATATGCGCTA CTGGCTGGAA
ACGCCGACTA TCTCCGCACC GCCCATCGAA CTGGTTGAAA TTGAGCGGCT GCGTTACCAG
GAGATGCCGA TATCCGCTTC CCGGGTACGT CAACTGCTGG CGAAAAACGA TCTCACGGCT
ATCGCGCCGC TGGTCCCTGC AGTCACGCTG CATTATTTGC AGAACCTGCT TGAGCACTCC
CGCCAGGACG CGGCAGCTCG TCAAAAGACC CCCGCATGA
 
Protein sequence
MFGNDIFTRV KRSENKKMAE IAQFLHENDL SVDTTVEVFI TVTRDEKLIA CGGIAGNIIK 
CVAISESVRG EGLALTLATE LINLAYERHS THLFIYTKTE YEALFRQCGF STLTSVPGVM
VLMENSATRL KRYAESLKKF RHPGNKIGCI VMNANPFTNG HRYLIQQAAA QCDWLHLFLV
KEDSSRFPYE DRLDLVLKGT ADIPRLTVHR GSEYIISRAT FPCYFIKEQS VINHCYTEID
LKIFRQYLAP ALGVTHRFVG TEPFCRVTAQ YNQDMRYWLE TPTISAPPIE LVEIERLRYQ
EMPISASRVR QLLAKNDLTA IAPLVPAVTL HYLQNLLEHS RQDAAARQKT PA