Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0066 |
Symbol | citF |
ID | 6873719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 70874 |
End bp | 72394 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642783321 |
Product | citrate lyase subunit alpha |
Protein accession | YP_002214015 |
Protein GI | 198246254 |
COG category | [C] Energy production and conversion |
COG ID | [COG3051] Citrate lyase, alpha subunit |
TIGRFAM ID | [TIGR01584] citrate lyase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 0.597387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGA CTGTAACCAT GCTCAACCAG CAATATGTCG TACCGGAAGG ATTACAGCCC TACCAGGGGG TAACAGCGAA CAGCCCGTGG CTTGCCAGTG AAACGGAAAA GCGTCGGCGT AAAATTTGTG ACTCGCTTGA AGAGGCTATC CGTCGTTCCG GGCTTAAAAA TGGCATGACG ATATCCTTCC ACCATGCGTT TCGTGGTGGC GACAAAGTCG TCAATATGGT GATGGCGAAG CTGGCGGAAA TGGGATTCCG CGATCTGACG CTTGCCTCCA GCTCATTAAT TGATGTCCAC TGGCCGCTTA TCGAACACAT TAAAAATGGC GTGGTGCGGC AGATTTACAC CTCCGGGCTG CGCGGCAAGT TAGGGGAAGA GATCTCCGCT GGCCTGATGG AAAACCCGGT ACAGATCCAC TCTCACGGCG GCCGCGTAAA GCTGATTCAG AGCGGTGAGC TGAATATTGA TGTCGCTTTC CTCGGCGTGC CGTGCTGCGA TGAATTTGGT AATGCTAACG GCTTTAGCGG CAAGTCACGC TGCGGCTCTT TGGGGTATGC GCAGGTTGAT GCGCAGTACG CTAAATGTGT GGTGCTCCTG ACTGAAGAGT GGGTCGAATT CCCCAATTAT CCGGCCAGTA TCGCGCAGGA TCAGGTTGAT TTGATCGTAC AGGTGGACGA AGTAGGGGAT CCGGAAAAAA TCACCGCGGG TGCGATTCGT CTGTCCAGCA ACCCGCGTGA ACTGTTGATC GCCCGGCAGG CGGCCAATGT GATCGAACAC TCAGGTTATT TTTGTGATGG TTTCTCGCTG CAAACTGGCA CCGGTGGCGC TTCTCTGGCG GTCACCCGTT TCCTGGAAGA CAAAATGCGT CGCCACAACA TTACGGCGAG TTTTGGCCTG GGCGGGATCA CCGGCACCAT GGTGGATCTG CATGAGAAAG GGTTGATCAA ATCGCTGCTG GATACGCAAT CATTCGATGG CGATGCAGCC CGTTCACTGG CGCAGAATCC ACATCATATT GAAATCTCAA CCAATCAATA CGCGAACCCC GTCTCTAAAG GCGCGGCATG CGAACGCCTG AATGTGGTGA TGCTGAGCGC ACTGGAAATT GACGTGAATT TCAACGTTAA CGTGATGACA GGTTCAAACG GCGTACTGCG TGGCGCATCG GGCGGCCATA GCGATACGGC AGCAGGGGCC GATCTCACCA TTATTACCGC ACCGCTGGTC CGTGGCCGAA TCCCATGTGT GGTGGAAAAA GTGCTGACTA CCGTCACGCC TGGGGCAAGT GTTGATGTGC TGGTTACCGA CCATGGTATT GCGGTGAACC CTGCGCGTCA GGATCTGCTT GATAACCTGC GTGCTGCGGG TGTGGCGCTG ATGACCATCG AACAACTGCA ACAGCGCGCT GAGCAACTGA CCGGTAAACC GCAGCCGATT GAGTTTACCG ATCGGGTGGT GGCTGTGGTG CGTTATCGCG ACGGTTCAGT GATTGACGTG ATTCGCCAGG TTAAAGGCTG A
|
Protein sequence | MKETVTMLNQ QYVVPEGLQP YQGVTANSPW LASETEKRRR KICDSLEEAI RRSGLKNGMT ISFHHAFRGG DKVVNMVMAK LAEMGFRDLT LASSSLIDVH WPLIEHIKNG VVRQIYTSGL RGKLGEEISA GLMENPVQIH SHGGRVKLIQ SGELNIDVAF LGVPCCDEFG NANGFSGKSR CGSLGYAQVD AQYAKCVVLL TEEWVEFPNY PASIAQDQVD LIVQVDEVGD PEKITAGAIR LSSNPRELLI ARQAANVIEH SGYFCDGFSL QTGTGGASLA VTRFLEDKMR RHNITASFGL GGITGTMVDL HEKGLIKSLL DTQSFDGDAA RSLAQNPHHI EISTNQYANP VSKGAACERL NVVMLSALEI DVNFNVNVMT GSNGVLRGAS GGHSDTAAGA DLTIITAPLV RGRIPCVVEK VLTTVTPGAS VDVLVTDHGI AVNPARQDLL DNLRAAGVAL MTIEQLQQRA EQLTGKPQPI EFTDRVVAVV RYRDGSVIDV IRQVKG
|
| |