Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0066 |
Symbol | citF |
ID | 6486259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 70870 |
End bp | 72390 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642735510 |
Product | citrate lyase, alpha subunit |
Protein accession | YP_002039292 |
Protein GI | 194442210 |
COG category | [C] Energy production and conversion |
COG ID | [COG3051] Citrate lyase, alpha subunit |
TIGRFAM ID | [TIGR01584] citrate lyase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGA CTGTAACCAT GCTCAACCAG CAATATGTCG TACCGGAAGG ATTACAGCCC TACCAGGGGG TAACAGCGAA CAGCCCGTGG CTTGCCAGTG AAACGGAAAA GCGTCGGCGT AAAATTTGTG ACTCGCTTGA AGAGGCTATC CGTCGTTCCG GGCTTAAAAA TGGCATGACG ATATCCTTCC ACCATGCGTT TCGTGGTGGC GACAAAGTTG TCAATATGGT GATGGCGAAG CTGGCGGAAA TGGGATTCCG CGATCTGACG CTTGCCTCCA GCTCATTAAT TGATGCCCAC TGGCCGCTTA TCGAACACAT TAAAAATGGC GTGGTGCGGC AGATCTACAC CTCCGGGCTG CGCGGCAAGT TAGGGGAAGA GATCTCCGCT GGCCTGATGG AAAACCCGGT ACAGATCCAC TCTCACGGCG GCCGCGTAAA GCTGATTCAG AGCGGTGAGC TGAATATTGA TGTCGCTTTC CTCGGCGTGC CGTGCTGCGA TGAATTTGGT AATGCTAACG GCTTTAGCGG CAAGTCACGC TGCGGCTCTT TGGGATATGC GCAGGTTGAT GCGCAGTATG CTAAATGTGT GGTGCTCCTG ACTGAAGAGT GGGTCGAATT TCCCAATTAT CCGGCCAGTA TCGCGCAGGA TCAGGTTGAT TTGATCGTAC AGGTGGACGA AGTAGGGGAT CCGGAAAAGA TCACCGCGGG CGCGATTCGT CTGTCCAGCA ACCCGCGTGA ACTGTTGATC GCCCGGCAGG CGGCTAATGT GATCGAACAC TCAGGTTATT TTTGTGACGG TTTCTCGCTG CAAACCGGCA CCGGTGGCGC ATCTCTGGCG GTCACCCGTT TCCTGGAAGA CAAAATGCGT CGCCACAACA TTACGGCGAG TTTTGGCCTG GGCGGGATCA CCGGCACCAT GGTGGATCTG CATGAGAAAG GGTTGATCAA AGCGCTGCTG GATACGCAAT CATTCGATGG CGATGCAGCC CGTTCGCTGG CGCAGAATCC ACATCATATT GAAATCTCAA CCAATCAATA CGCGAACCCC GCCTCTAAAG GCGCGGCATG CGAACGCCTG AATGTGGTGA TGCTGAGCGC GCTGGAAATT GACGTGAATT TCAACGTTAA CGTGATGACC GGTTCAAACG GCGTACTGCG TGGCGCATCG GGCGGCCATA GCGATACGGC AGCAGGGGCC GATCTCACCA TTATTACCGC ACCGCTGGTC CGTGGCCGAA TCCCATGTGT GGTGGAAAAA GTGCTGACTA CCGTCACGCC TGGGGCAAGT GTTGATGTGC TGGTTACCGA CCATGGTATT GCAGTGAACC CTGCGCGTCA GGATCTGCTT GATAACCTGC GTGCTGCGGG CGTGGCGCTG ATGACCATCG AACAACTGCA ACAGCGCGCT GAGCAACTGA CCGGTAAACC GCAGCCGATT GAGTTTACCG ATCGGGTGGT GGCTGTGGTG CGTTATCGCG ACGGTTCAGT GATTGACGTG ATTCGCCAGG TTAAAGGCTG A
|
Protein sequence | MKETVTMLNQ QYVVPEGLQP YQGVTANSPW LASETEKRRR KICDSLEEAI RRSGLKNGMT ISFHHAFRGG DKVVNMVMAK LAEMGFRDLT LASSSLIDAH WPLIEHIKNG VVRQIYTSGL RGKLGEEISA GLMENPVQIH SHGGRVKLIQ SGELNIDVAF LGVPCCDEFG NANGFSGKSR CGSLGYAQVD AQYAKCVVLL TEEWVEFPNY PASIAQDQVD LIVQVDEVGD PEKITAGAIR LSSNPRELLI ARQAANVIEH SGYFCDGFSL QTGTGGASLA VTRFLEDKMR RHNITASFGL GGITGTMVDL HEKGLIKALL DTQSFDGDAA RSLAQNPHHI EISTNQYANP ASKGAACERL NVVMLSALEI DVNFNVNVMT GSNGVLRGAS GGHSDTAAGA DLTIITAPLV RGRIPCVVEK VLTTVTPGAS VDVLVTDHGI AVNPARQDLL DNLRAAGVAL MTIEQLQQRA EQLTGKPQPI EFTDRVVAVV RYRDGSVIDV IRQVKG
|
| |