Gene SNSL254_A0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0123 
SymbolleuC 
ID6484363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp131456 
End bp132856 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID642735562 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_002039344 
Protein GI194443917 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.610682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.713431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAA CGTTATACGA AAAATTATTT GATGCCCACG TGGTCTTTGA GGCGCCAAAC 
GAAACGCCGC TGCTGTACAT CGACCGCCAC CTGGTGCATG AAGTCACCTC TCCGCAGGCG
TTTGACGGTC TGCGCGCGCA CCATCGTCCG GTACGTCAGC CAGGGAAAAC CTTCGCTACG
ATGGATCACA ACGTCTCGAC GCAGACTAAA GACATTAATG CTTCCGGTGA AATGGCGCGT
ATCCAGATGC AGGAGCTGAT TAAGAACTGT AACGAGTTCG GCGTCGAGCT GTATGACCTG
AATCACCCAT ATCAGGGCAT CGTCCATGTG ATGGGGCCGG AACAGGGCGT CACCCTGCCG
GGCATGACCA TCGTCTGCGG CGACTCCCAC ACCGCCACCC ACGGCGCGTT TGGTGCGCTG
GCCTTCGGCA TCGGCACTTC TGAGGTAGAA CATGTACTGG CGACGCAAAC CCTGAAACAG
GGACGCGCTA AAACCATGAA GATTGAAGTC ACGGGCAACG CCGCGCCGGG CATTACCGCC
AAAGACATCG TGCTGGCGAT CATCGGTAAA ACCGGTAGCG CCGGCGGCAC CGGACACGTG
GTTGAATTTT GCGGCGACGC TATCCGCGCG CTGAGTATGG AAGGCCGCAT GACGCTGTGC
AATATGGCGA TTGAGATGGG CGCCAAAGCC GGTCTGGTCG CCCCGGATGA AACCACTTTC
AACTACGTAA AAGGGCGTTT GCACGCGCCG AAGGGCCGCG ATTTTGACGA AGCCGTCGAG
TACTGGAAAA CGCTGAAAAC CGATGACGGC GCGACCTTTG ATACTGTCGT CGCCCTGCGA
GCAGAAGAGA TCGCGCCGCA GGTGACCTGG GGCACGAATC CGGGCCAGGT GATTTCCGTC
ACCGACATCA TCCCCGATCC CGCCTCCTTT AGCGATCCGG TTGAGCGCGC CAGCGCCGAA
AAAGCGCTGG CTTATATGGG CTTACAGCCG GGCGTACCGT TAACGGACGT TGCTATCGAT
AAAGTCTTTA TCGGCTCTTG TACCAATTCA CGCATTGAAG ATTTGCGCGC GGCGGCGGAA
GTCGCCAAAG GGCGCAAAGT TGCGCCGGGC GTGCAGGCGC TGGTGGTGCC GGGTTCAGGT
CCGGTGAAAG CGCAGGCGGA AGCGGAAGGT CTGGACAAGA TCTTTATCGA AGCAGGATTT
GAATGGCGCT TACCGGGCTG TTCCATGTGC CTGGCCATGA ATAACGACCG CCTGAACCCG
GGCGAGCGCT GCGCATCCAC CAGCAACCGT AACTTTGAAG GCCGTCAGGG CCGCGGGGGG
CGCACTCACC TGGTTAGCCC GGCGATGGCC GCCGCTGCCG CCGTTACCGG CCACTTCGCC
GATATTCGCA GCATCAAATA A
 
Protein sequence
MAKTLYEKLF DAHVVFEAPN ETPLLYIDRH LVHEVTSPQA FDGLRAHHRP VRQPGKTFAT 
MDHNVSTQTK DINASGEMAR IQMQELIKNC NEFGVELYDL NHPYQGIVHV MGPEQGVTLP
GMTIVCGDSH TATHGAFGAL AFGIGTSEVE HVLATQTLKQ GRAKTMKIEV TGNAAPGITA
KDIVLAIIGK TGSAGGTGHV VEFCGDAIRA LSMEGRMTLC NMAIEMGAKA GLVAPDETTF
NYVKGRLHAP KGRDFDEAVE YWKTLKTDDG ATFDTVVALR AEEIAPQVTW GTNPGQVISV
TDIIPDPASF SDPVERASAE KALAYMGLQP GVPLTDVAID KVFIGSCTNS RIEDLRAAAE
VAKGRKVAPG VQALVVPGSG PVKAQAEAEG LDKIFIEAGF EWRLPGCSMC LAMNNDRLNP
GERCASTSNR NFEGRQGRGG RTHLVSPAMA AAAAVTGHFA DIRSIK