Gene EcHS_A0669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0669 
SymbolcitC 
ID5594832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp686878 
End bp687936 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content51% 
IMG OID640919850 
Product[citrate (pro-3S)-lyase] ligase 
Protein accessionYP_001457432 
Protein GI157160114 
COG category[C] Energy production and conversion 
COG ID[COG3053] Citrate lyase synthetase 
TIGRFAM ID[TIGR00124] [citrate (pro-3S)-lyase] ligase
[TIGR00125] cytidyltransferase-related domain 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value0.850231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGCA ATGATATTTT CACCCGCGTA AAACGTTCAG AAAATAAAAA AATGGCGGAA 
ATCGCCCAAT TCCTGCATGA AAATGATTTG AGCGTTGACA CCACAGTCGA AGTATTTATT
ACCGTAACCC GCGATGAAAA GCTTATCGCG TGCGGTGGAA TTGCCGGAAA TATTATTAAA
TGCGTTGCTA TCAGTGAATC CGTTCGCGGT GAAGGACTGG CGCTGACATT AGCCACTGAA
CTGATAAACC TCGCCTATGA GCGGCACAGC ACGCATCTGT TTATTTATAC CAAAACCGAA
TACGAGGCGC TGTTCCGCCA GTGCGGTTTT TCCACGCTGA CCAGCGTACC CGGCGTGATG
GTGCTGATGG AAAACAGCGC CACGCGACTG AAACGCTATG CCGAATCGCT GAAAAAATTT
CGTCATCCAG GGAACAAGAT TGGCTGCATT GTGATGAACG CCAATCCCTT TACGAATGGT
CACCGTTATC TGATTCAACA GGCTGCGGCA CAGTGCGACT GGTTGCATCT GTTTTTAGTC
AAAGAAGATT CTTCACGCTT CCCCTATGAA GACCGGCTGG ATCTGGTGTT AAAAGGCACC
GCCGATATTC CACGCCTGAC TGTGCATCGC GGCTCCGAAT ACATCATCTC CCGCGCTACG
TTCCCTTGCT ACTTCATTAA AGAACAGAGC GTCATTAACC ATTGTTACAC CGAAATTGAT
CTGAAGATTT TCCGTCAGTA CCTCGCTCCC GCACTGGGTG TAACTCACCG CTTTGTCGGT
ACTGAACCCT TTTGTCGCGT TACCGCCCAG TACAACCAGG ATATGCGCTA CTGGCTGGAA
ACGCCGACTA TCTCCGCACC GCCCATCGAA CTGGTTGAAA TTGAGCGGCT GCGTTACCAG
GAGATGCCGA TATCCGCTTC CCGGGTACGT CAACTGCTGG CGAAAAACGA TCTCACGGCT
ATCGCGCCGC TGGTCCCTGC AGTCACGCTG CATTATTTGC AGAACCTGCT TGAGCACTCC
CGCCAGGACG CGGCAGCTCG TCAAAAGACC CCCGCATGA
 
Protein sequence
MFGNDIFTRV KRSENKKMAE IAQFLHENDL SVDTTVEVFI TVTRDEKLIA CGGIAGNIIK 
CVAISESVRG EGLALTLATE LINLAYERHS THLFIYTKTE YEALFRQCGF STLTSVPGVM
VLMENSATRL KRYAESLKKF RHPGNKIGCI VMNANPFTNG HRYLIQQAAA QCDWLHLFLV
KEDSSRFPYE DRLDLVLKGT ADIPRLTVHR GSEYIISRAT FPCYFIKEQS VINHCYTEID
LKIFRQYLAP ALGVTHRFVG TEPFCRVTAQ YNQDMRYWLE TPTISAPPIE LVEIERLRYQ
EMPISASRVR QLLAKNDLTA IAPLVPAVTL HYLQNLLEHS RQDAAARQKT PA