Gene SNSL254_A4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4520 
SymbolaceA 
ID6482560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4393830 
End bp4395134 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content56% 
IMG OID642739746 
Productisocitrate lyase 
Protein accessionYP_002043428 
Protein GI194443373 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC GTACTCAACA AATCGAAGAA TTACAGAAAG AGTGGACACA ACCGCGCTGG 
GAAGGCATCA CCCGCCCGTA CAGCGCGGAG GAGGTGGTGA AATTACGCGG CTCGGTTAAC
CCGGAGTGCA CGCTGGCGCA GCTCGGCGCC GCGAAAATGT GGCGGCTGTT GCACGGTGAA
GCGAAAAAAG GCTATATCAA CAGCCTTGGC GCGCTGACTG GCGGTCAGGC ATTGCAGCAG
GCGAAAGCTG GTATTGAGGC GATTTATCTT TCAGGCTGGC AGGTGGCGGC AGATGCCAAC
CTGGCATCCA GCATGTATCC GGATCAATCG TTGTACCCGG CAAACTCTGT TCCGGCGGTA
GTGGATCGGA TCAACAACAC TTTCCGTCGT GCAGATCAGA TCCAGTGGGC ATCCGGTATT
GAACCCAACG ATCCGCGCTA TGTGGATTAC TTCCTGCCGA TCGTTGCTGA TGCGGAAGCC
GGTTTTGGCG GCGTTCTGAA TGCCTTCGAA CTGATGAAAT CGATGATTGA AGCCGGTGCA
GCGGCCGTTC ACTTCGAAGA TCAGCTGGCG TCGGTGAAGA AATGCGGCCA TATGGGTGGC
AAGGTGCTGG TCCCCACGCA GGAGGCGATT CAGAAACTGG TTGCTGCGCG TCTGGCCGCT
GATGTGATGG GCGTCCCGAC GCTGGTGATT GCGCGTACCG ATGCGGATGC GGCAGATCTG
ATCACCTCCG ACTGCGATCC CTATGACAGC GGTTTTATTA CCGGCGAACG CACCAGCGAA
GGTTTTTACC GCACCCATGC GGGCATTGAG CAGGCGATCA GCCGCGGTCT GGCGTATGCC
CCGTATGCCG ATCTGGTATG GTGCGAAACC TCTACACCGG ATCTCGAACT GGCGCGTCGT
TTTGCCGATG CTATCCACGC GAAGTATCCG GGCAAACTGC TGGCCTATAA CTGTTCGCCA
TCCTTCAACT GGCAGAAGAA TCTGGACGAC AAGACCATTG CCAGCTTCCA GCAGCAGTTG
TCGGACATGG GTTACAAATA CCAGTTTATT ACCCTGGCGG GTATTCACAG CATGTGGTTC
AACATGTTTG ACCTGGCGCA TGCATACGCT CAGGGCGAGG GCATGAAACA CTATGTTGAG
AAGGTTCAAC AACCCGAGTT CGCCGCGGCG AAAGACGGCT ACACCTTTGT TTCCCACCAG
CAGGAAGTGG GCACTGGTTA CTTCGACAAA GTCACCACCA TTATTCAGGG TGGCGCGTCA
TCCGTTACCG CGTTAACGGG TTCCACCGAA GAATCGCAGT TTTGA
 
Protein sequence
MKTRTQQIEE LQKEWTQPRW EGITRPYSAE EVVKLRGSVN PECTLAQLGA AKMWRLLHGE 
AKKGYINSLG ALTGGQALQQ AKAGIEAIYL SGWQVAADAN LASSMYPDQS LYPANSVPAV
VDRINNTFRR ADQIQWASGI EPNDPRYVDY FLPIVADAEA GFGGVLNAFE LMKSMIEAGA
AAVHFEDQLA SVKKCGHMGG KVLVPTQEAI QKLVAARLAA DVMGVPTLVI ARTDADAADL
ITSDCDPYDS GFITGERTSE GFYRTHAGIE QAISRGLAYA PYADLVWCET STPDLELARR
FADAIHAKYP GKLLAYNCSP SFNWQKNLDD KTIASFQQQL SDMGYKYQFI TLAGIHSMWF
NMFDLAHAYA QGEGMKHYVE KVQQPEFAAA KDGYTFVSHQ QEVGTGYFDK VTTIIQGGAS
SVTALTGSTE ESQF