Gene SeD_A0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0120 
SymbolleuC 
ID6872545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp125289 
End bp126689 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID642783370 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_002214064 
Protein GI198245083 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAA CGTTATACGA AAAATTATTT GATGCCCACG TGGTCTTTGA GGCGCCAAAC 
GAAACGCCGC TGCTGTACAT CGACCGCCAC CTGGTGCATG AAGTCACCTC TCCGCAGGCG
TTTGACGGTC TGCGCGCGCA CCATCGTCCG GTACGTCAGC CAGGGAAAAC CTTCGCCACG
ATGGATCACA ACGTCTCGAC GCAGACTAAA GACATTAATG CTTCCGGTGA AATGGCGCGT
ATCCAGATGC AGGAGCTGAT TAAGAACTGT AACGAGTTCG GCGTCGAGCT GTATGACCTG
AATCACCCAT ATCAGGGCAT CGTCCATGTG ATGGGGCCGG AACAGGGCGT CACCCTGCCG
GGCATGACCA TCGTCTGCGG CGACTCCCAC ACCGCCACCC ACGGCGCGTT TGGCGCGCTG
GCCTTCGGCA TCGGCACTTC TGAGGTAGAA CATGTACTGG CGACGCAAAC CCTGAAACAG
GGGCGCGCTA AAACCATGAA GATTGAAGTC ACGGGCAACG CCGCGCCGGG CATTACCGCC
AAAGACATCG TGCTGGCGAT CATCGGTAAA ACCGGCAGCG CCGGCGGCAC CGGGCACGTG
GTTGAATTTT GCGGCGACGC GATCCGCGCG CTGAGTATGG AAGGCCGCAT GACGCTGTGC
AATATGGCGA TTGAGATGGG CGCCAAAGCC GGTCTGGTCG CCCCGGATGA AACCACTTTT
AACTACGTAA AAGGGCGTTT GCACGCGCCG AAGGGCCGCG ATTTTGACGA AGCCGTCGAG
TACTGGAAAA CGCTGAAAAC CGATGACGGC GCGACCTTTG ATACTGTCGT CACCCTGCGA
GCAGAAGAGA TCGCGCCGCA GGTGACCTGG GGCACGAATC CGGGCCAGGT GATTTCCGTC
ACCGACATCA TCCCCGATCC CGCCTCTTTT AGCGATCCGG TTGAGCGCGC CAGCGCCGAA
AAAGCGCTGG CTTATATGGG CTTACAGCCG GGCGTACCGT TAACGGACGT TGCTATTGAT
AAAGTCTTTA TCGGCTCTTG TACCAATTCG CGTATTGAAG ATTTGCGCGC GGCGGCGGAA
GTCGCCAAAG GGCGCAAAGT TGCGCCTGGC GTGCAGGCGC TGGTGGTGCC GGGTTCAGGT
CCGGTGAAAG CGCAGGCGGA AGCGGAAGGT CTGGACAAGA TCTTTATCGA AGCAGGATTT
GAATGGCGCT TACCGGGCTG TTCCATGTGC CTGGCCATGA ATAATGACCG CCTGAACCCG
GGCGAGCGCT GCGCCTCCAC CAGCAACCGT AACTTTGAAG GCCGTCAGGG CCGCGGGGGT
CGCACGCATT TAGTCAGCCC GGCGATGGCC GCCGCTGCCG CCGTTACCGG CCACTTCGCC
GACATTCGCA GCATCAAATA A
 
Protein sequence
MAKTLYEKLF DAHVVFEAPN ETPLLYIDRH LVHEVTSPQA FDGLRAHHRP VRQPGKTFAT 
MDHNVSTQTK DINASGEMAR IQMQELIKNC NEFGVELYDL NHPYQGIVHV MGPEQGVTLP
GMTIVCGDSH TATHGAFGAL AFGIGTSEVE HVLATQTLKQ GRAKTMKIEV TGNAAPGITA
KDIVLAIIGK TGSAGGTGHV VEFCGDAIRA LSMEGRMTLC NMAIEMGAKA GLVAPDETTF
NYVKGRLHAP KGRDFDEAVE YWKTLKTDDG ATFDTVVTLR AEEIAPQVTW GTNPGQVISV
TDIIPDPASF SDPVERASAE KALAYMGLQP GVPLTDVAID KVFIGSCTNS RIEDLRAAAE
VAKGRKVAPG VQALVVPGSG PVKAQAEAEG LDKIFIEAGF EWRLPGCSMC LAMNNDRLNP
GERCASTSNR NFEGRQGRGG RTHLVSPAMA AAAAVTGHFA DIRSIK