Gene SNSL254_A0066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0066 
SymbolcitF 
ID6486259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp70870 
End bp72390 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID642735510 
Productcitrate lyase, alpha subunit 
Protein accessionYP_002039292 
Protein GI194442210 
COG category[C] Energy production and conversion 
COG ID[COG3051] Citrate lyase, alpha subunit 
TIGRFAM ID[TIGR01584] citrate lyase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA CTGTAACCAT GCTCAACCAG CAATATGTCG TACCGGAAGG ATTACAGCCC 
TACCAGGGGG TAACAGCGAA CAGCCCGTGG CTTGCCAGTG AAACGGAAAA GCGTCGGCGT
AAAATTTGTG ACTCGCTTGA AGAGGCTATC CGTCGTTCCG GGCTTAAAAA TGGCATGACG
ATATCCTTCC ACCATGCGTT TCGTGGTGGC GACAAAGTTG TCAATATGGT GATGGCGAAG
CTGGCGGAAA TGGGATTCCG CGATCTGACG CTTGCCTCCA GCTCATTAAT TGATGCCCAC
TGGCCGCTTA TCGAACACAT TAAAAATGGC GTGGTGCGGC AGATCTACAC CTCCGGGCTG
CGCGGCAAGT TAGGGGAAGA GATCTCCGCT GGCCTGATGG AAAACCCGGT ACAGATCCAC
TCTCACGGCG GCCGCGTAAA GCTGATTCAG AGCGGTGAGC TGAATATTGA TGTCGCTTTC
CTCGGCGTGC CGTGCTGCGA TGAATTTGGT AATGCTAACG GCTTTAGCGG CAAGTCACGC
TGCGGCTCTT TGGGATATGC GCAGGTTGAT GCGCAGTATG CTAAATGTGT GGTGCTCCTG
ACTGAAGAGT GGGTCGAATT TCCCAATTAT CCGGCCAGTA TCGCGCAGGA TCAGGTTGAT
TTGATCGTAC AGGTGGACGA AGTAGGGGAT CCGGAAAAGA TCACCGCGGG CGCGATTCGT
CTGTCCAGCA ACCCGCGTGA ACTGTTGATC GCCCGGCAGG CGGCTAATGT GATCGAACAC
TCAGGTTATT TTTGTGACGG TTTCTCGCTG CAAACCGGCA CCGGTGGCGC ATCTCTGGCG
GTCACCCGTT TCCTGGAAGA CAAAATGCGT CGCCACAACA TTACGGCGAG TTTTGGCCTG
GGCGGGATCA CCGGCACCAT GGTGGATCTG CATGAGAAAG GGTTGATCAA AGCGCTGCTG
GATACGCAAT CATTCGATGG CGATGCAGCC CGTTCGCTGG CGCAGAATCC ACATCATATT
GAAATCTCAA CCAATCAATA CGCGAACCCC GCCTCTAAAG GCGCGGCATG CGAACGCCTG
AATGTGGTGA TGCTGAGCGC GCTGGAAATT GACGTGAATT TCAACGTTAA CGTGATGACC
GGTTCAAACG GCGTACTGCG TGGCGCATCG GGCGGCCATA GCGATACGGC AGCAGGGGCC
GATCTCACCA TTATTACCGC ACCGCTGGTC CGTGGCCGAA TCCCATGTGT GGTGGAAAAA
GTGCTGACTA CCGTCACGCC TGGGGCAAGT GTTGATGTGC TGGTTACCGA CCATGGTATT
GCAGTGAACC CTGCGCGTCA GGATCTGCTT GATAACCTGC GTGCTGCGGG CGTGGCGCTG
ATGACCATCG AACAACTGCA ACAGCGCGCT GAGCAACTGA CCGGTAAACC GCAGCCGATT
GAGTTTACCG ATCGGGTGGT GGCTGTGGTG CGTTATCGCG ACGGTTCAGT GATTGACGTG
ATTCGCCAGG TTAAAGGCTG A
 
Protein sequence
MKETVTMLNQ QYVVPEGLQP YQGVTANSPW LASETEKRRR KICDSLEEAI RRSGLKNGMT 
ISFHHAFRGG DKVVNMVMAK LAEMGFRDLT LASSSLIDAH WPLIEHIKNG VVRQIYTSGL
RGKLGEEISA GLMENPVQIH SHGGRVKLIQ SGELNIDVAF LGVPCCDEFG NANGFSGKSR
CGSLGYAQVD AQYAKCVVLL TEEWVEFPNY PASIAQDQVD LIVQVDEVGD PEKITAGAIR
LSSNPRELLI ARQAANVIEH SGYFCDGFSL QTGTGGASLA VTRFLEDKMR RHNITASFGL
GGITGTMVDL HEKGLIKALL DTQSFDGDAA RSLAQNPHHI EISTNQYANP ASKGAACERL
NVVMLSALEI DVNFNVNVMT GSNGVLRGAS GGHSDTAAGA DLTIITAPLV RGRIPCVVEK
VLTTVTPGAS VDVLVTDHGI AVNPARQDLL DNLRAAGVAL MTIEQLQQRA EQLTGKPQPI
EFTDRVVAVV RYRDGSVIDV IRQVKG