Gene SeHA_C0065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0065 
SymbolcitF 
ID6488792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp66337 
End bp67857 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID642740354 
Productcitrate lyase subunit alpha 
Protein accessionYP_002044028 
Protein GI194449842 
COG category[C] Energy production and conversion 
COG ID[COG3051] Citrate lyase, alpha subunit 
TIGRFAM ID[TIGR01584] citrate lyase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA CTGTAACCAT GCTCAACCAG CAATATGTCG TACCGGAAGG ATTACAGCCC 
TACCAGGGGG TAACAGCAAA CAGCCCGTGG CTTGCCAGTG AAACGGAAAA GCGTCGGCGT
AAAATTTGTG ACTCGCTTGA AGAGGCTATC CGTCGTTCCG GGCTTAAAAA TGGCATGACG
ATATCCTTCC ACCATGCGTT TCGTGGTGGC GACAAAGTTG TCAATATGGT GATGGCGAAA
CTGGCGGAAA TGGGATTCCG CGATCTGACG CTTGCCTCCA GCTCATTAAT CGATGCCCAC
TGGCCGCTTA TCGAACACAT TAAAAATGGC GTGGTGCGGC AGATTTATAC CTCCGGGCTG
CGCGGCAAGT TAGGGGAAGA GATCTCCGCT GGCCTGATGG AAAACCCGGT ACAGATCCAC
TCTCACGGCG GCCGCGTAAA ACTGATTCAG AGCGGTGAGC TGAATATTGA TGTCGCTTTC
CTCGGCGTGC CGTGCTGCGA CGAATTTGGT AATGCGAACG GCTTTAGCGG CAAGTCACGC
TGCGGCTCTT TGGGGTATGC GCAGGTTGAT GCGCAGTACG CTAAATGTGT GGTGCTCCTG
ACTGAAGAGT GGGTCGAATT CCCCAATTAT CCGGCCAGTA TCGCGCAGGA TCAGGTTGAT
TTGATCGTAC AGGTGGACGA AGTAGGGGAT CCGGAAAAAA TCACCGCGGG TGCGATTCGT
CTGTCCAGCA ACCCGCGTGA ACTGTTGATC GCCCGGCAGG CGGCCAATGT GATCGAACAC
TCAGGTTATT TTTGTGATGG TTTCTCGCTG CAAACCGGCA CCGGTGGCGC TTCTCTGGCG
GTCACCCGTT TCCTGGAAGA CAAAATGCGT CGCCACAACA TTACGGCGAG TTTTGGCCTG
GGCGGGATCA CCGGCACCAT GGTGGATCTG CATGAGAAAG GGTTGATCAA AGCGCTGCTG
GATACGCAAT CATTCGATGG CGATGCAGCC CGTTCGCTGG CGCAGAATCC ACATCATATT
GAAATCTCAA CCAATCAATA CGCGAACCCC GTCTCTAAAG GCGCGGCATG CGAACGCCTG
AATGTGGTGA TGCTGAGCGC GCTGGAAATT GACGTGAATT TCAACGTTAA CGTGATGACC
GGTTCAAACG GCGTACTGCG TGGCGCATCG GGCGGCCATA GCGATACGGC AGCAGGGGCC
GATCTCACCA TTATTACCGC ACCGCTGGTC CGTGGCCGAA TCCCATGCGT GGTGGAAAAA
GTGCTGACTA CCGTCACGCC TGGGGCAAGT GTTGATGTGC TGGTTACCGA CCATGGTATT
GCGGTGAACC CTGCGCGTCA GGATCTGCTT GATAACCTGC GTGCTGCGGG CGTGGCGCTG
ATGACCATCG AACAACTGCA ACAGCGCGCT GAGCAACTGA CCGGTAAACC GCAGCCGATT
GAGTTTACCG ATCGGGTGGT GGCTGTGGTG CGTTATCGCG ACGGTTCAGT GATTGACGTG
ATTCGCCAGG TTAAAGGCTG A
 
Protein sequence
MKETVTMLNQ QYVVPEGLQP YQGVTANSPW LASETEKRRR KICDSLEEAI RRSGLKNGMT 
ISFHHAFRGG DKVVNMVMAK LAEMGFRDLT LASSSLIDAH WPLIEHIKNG VVRQIYTSGL
RGKLGEEISA GLMENPVQIH SHGGRVKLIQ SGELNIDVAF LGVPCCDEFG NANGFSGKSR
CGSLGYAQVD AQYAKCVVLL TEEWVEFPNY PASIAQDQVD LIVQVDEVGD PEKITAGAIR
LSSNPRELLI ARQAANVIEH SGYFCDGFSL QTGTGGASLA VTRFLEDKMR RHNITASFGL
GGITGTMVDL HEKGLIKALL DTQSFDGDAA RSLAQNPHHI EISTNQYANP VSKGAACERL
NVVMLSALEI DVNFNVNVMT GSNGVLRGAS GGHSDTAAGA DLTIITAPLV RGRIPCVVEK
VLTTVTPGAS VDVLVTDHGI AVNPARQDLL DNLRAAGVAL MTIEQLQQRA EQLTGKPQPI
EFTDRVVAVV RYRDGSVIDV IRQVKG