Gene SeD_A0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0066 
SymbolcitF 
ID6873719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp70874 
End bp72394 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID642783321 
Productcitrate lyase subunit alpha 
Protein accessionYP_002214015 
Protein GI198246254 
COG category[C] Energy production and conversion 
COG ID[COG3051] Citrate lyase, alpha subunit 
TIGRFAM ID[TIGR01584] citrate lyase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value0.597387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA CTGTAACCAT GCTCAACCAG CAATATGTCG TACCGGAAGG ATTACAGCCC 
TACCAGGGGG TAACAGCGAA CAGCCCGTGG CTTGCCAGTG AAACGGAAAA GCGTCGGCGT
AAAATTTGTG ACTCGCTTGA AGAGGCTATC CGTCGTTCCG GGCTTAAAAA TGGCATGACG
ATATCCTTCC ACCATGCGTT TCGTGGTGGC GACAAAGTCG TCAATATGGT GATGGCGAAG
CTGGCGGAAA TGGGATTCCG CGATCTGACG CTTGCCTCCA GCTCATTAAT TGATGTCCAC
TGGCCGCTTA TCGAACACAT TAAAAATGGC GTGGTGCGGC AGATTTACAC CTCCGGGCTG
CGCGGCAAGT TAGGGGAAGA GATCTCCGCT GGCCTGATGG AAAACCCGGT ACAGATCCAC
TCTCACGGCG GCCGCGTAAA GCTGATTCAG AGCGGTGAGC TGAATATTGA TGTCGCTTTC
CTCGGCGTGC CGTGCTGCGA TGAATTTGGT AATGCTAACG GCTTTAGCGG CAAGTCACGC
TGCGGCTCTT TGGGGTATGC GCAGGTTGAT GCGCAGTACG CTAAATGTGT GGTGCTCCTG
ACTGAAGAGT GGGTCGAATT CCCCAATTAT CCGGCCAGTA TCGCGCAGGA TCAGGTTGAT
TTGATCGTAC AGGTGGACGA AGTAGGGGAT CCGGAAAAAA TCACCGCGGG TGCGATTCGT
CTGTCCAGCA ACCCGCGTGA ACTGTTGATC GCCCGGCAGG CGGCCAATGT GATCGAACAC
TCAGGTTATT TTTGTGATGG TTTCTCGCTG CAAACTGGCA CCGGTGGCGC TTCTCTGGCG
GTCACCCGTT TCCTGGAAGA CAAAATGCGT CGCCACAACA TTACGGCGAG TTTTGGCCTG
GGCGGGATCA CCGGCACCAT GGTGGATCTG CATGAGAAAG GGTTGATCAA ATCGCTGCTG
GATACGCAAT CATTCGATGG CGATGCAGCC CGTTCACTGG CGCAGAATCC ACATCATATT
GAAATCTCAA CCAATCAATA CGCGAACCCC GTCTCTAAAG GCGCGGCATG CGAACGCCTG
AATGTGGTGA TGCTGAGCGC ACTGGAAATT GACGTGAATT TCAACGTTAA CGTGATGACA
GGTTCAAACG GCGTACTGCG TGGCGCATCG GGCGGCCATA GCGATACGGC AGCAGGGGCC
GATCTCACCA TTATTACCGC ACCGCTGGTC CGTGGCCGAA TCCCATGTGT GGTGGAAAAA
GTGCTGACTA CCGTCACGCC TGGGGCAAGT GTTGATGTGC TGGTTACCGA CCATGGTATT
GCGGTGAACC CTGCGCGTCA GGATCTGCTT GATAACCTGC GTGCTGCGGG TGTGGCGCTG
ATGACCATCG AACAACTGCA ACAGCGCGCT GAGCAACTGA CCGGTAAACC GCAGCCGATT
GAGTTTACCG ATCGGGTGGT GGCTGTGGTG CGTTATCGCG ACGGTTCAGT GATTGACGTG
ATTCGCCAGG TTAAAGGCTG A
 
Protein sequence
MKETVTMLNQ QYVVPEGLQP YQGVTANSPW LASETEKRRR KICDSLEEAI RRSGLKNGMT 
ISFHHAFRGG DKVVNMVMAK LAEMGFRDLT LASSSLIDVH WPLIEHIKNG VVRQIYTSGL
RGKLGEEISA GLMENPVQIH SHGGRVKLIQ SGELNIDVAF LGVPCCDEFG NANGFSGKSR
CGSLGYAQVD AQYAKCVVLL TEEWVEFPNY PASIAQDQVD LIVQVDEVGD PEKITAGAIR
LSSNPRELLI ARQAANVIEH SGYFCDGFSL QTGTGGASLA VTRFLEDKMR RHNITASFGL
GGITGTMVDL HEKGLIKSLL DTQSFDGDAA RSLAQNPHHI EISTNQYANP VSKGAACERL
NVVMLSALEI DVNFNVNVMT GSNGVLRGAS GGHSDTAAGA DLTIITAPLV RGRIPCVVEK
VLTTVTPGAS VDVLVTDHGI AVNPARQDLL DNLRAAGVAL MTIEQLQQRA EQLTGKPQPI
EFTDRVVAVV RYRDGSVIDV IRQVKG