Gene SeSA_A4739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4739 
Symbol 
ID6519263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4604015 
End bp4605046 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content50% 
IMG OID642749671 
ProductL-idonate 5-dehydrogenase 
Protein accessionYP_002117404 
Protein GI194734309 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.130148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTAA AAACTCAATC CTGCGTTGTT GCGGGTAAGC GTGCTGTTGC CGTTACGGAA 
CAAAATATTG AATGGAATAA TAAAGGAACA CTCGTACAAA TTACCCGAGG CGGCATTTGT
GGGTCTGACT TACATTATTA TCAGGAAGGC AAAGTCGGCA ATTTTACAGT AAAAGCGCCA
ATGATTTTAG GTCATGAAGT GATTGGCAAA ATCGTTCATA GCGACTCAGA TTTATTACGT
GAAGGACAAC CGGTAGCGAT TAATCCATCG AAGCCTTGCG GTCATTGCAA ATACTGTCTG
CAGCATGAAG AAAACCACTG TACTGAAATG CGTTTCTTTG GCAGCGCCAT GTATTTTCCG
CATGTCGATG GCGGTTTTAC CCGATTTAAA TCTGTCGATA CCGTTCAGTG CATTCCCTGG
CCGGAACAGG CAGACGAAAA AGCCATGGCC TTTGCCGAAC CGCTGGCGGT TGCCATTCAT
GCGGCTCATG AGGCGGGCGA TCTGCAAGGC AAACGCGTCT TTATCTCCGG CGTTGGCCCT
ATCGGCTGCC TGATTGTTAG CGCGGTAAAA ACGCTGGGCG CAGCGGAAGT GGTATGTGCT
GATATCAGTA CCCGTTCTCT CTCGCTGGCC CGGCAGATGG GCGCGGATAC GCTGGTAAAC
CCACAGCATG ACTCTCTTGA TGGCTGGAAA GCAGAAAAAG GGTATTTCGA TATCAGTTTT
GAAGTCTCCG GGCATCCTTC CTCAATCTCA ACGTGTCTGG AAGTCACACG GGCAAAAGGC
GTGATGGTGC AGGTTGGCAT GGGCGGCGCA GTCCCCAACT TCCCGATGAT GATGGTAATA
AGCAAAGAGA TCTCCCTGAA AGGCTCTTTC CGCTTTACTA CCGAATTTAA TACTGCGGTT
TCCTGGCTTG CCAACCGCGT TATCAATCCG CTGCCGTTAC TGAGCGCGGA ATATCCATTT
ACCGACCTGG AAGCGGCGCT GATCTTTGCC GGAGACAAAA CACAGGCGGC AAAAGTTCAG
CTCGTTTTCT GA
 
Protein sequence
MEVKTQSCVV AGKRAVAVTE QNIEWNNKGT LVQITRGGIC GSDLHYYQEG KVGNFTVKAP 
MILGHEVIGK IVHSDSDLLR EGQPVAINPS KPCGHCKYCL QHEENHCTEM RFFGSAMYFP
HVDGGFTRFK SVDTVQCIPW PEQADEKAMA FAEPLAVAIH AAHEAGDLQG KRVFISGVGP
IGCLIVSAVK TLGAAEVVCA DISTRSLSLA RQMGADTLVN PQHDSLDGWK AEKGYFDISF
EVSGHPSSIS TCLEVTRAKG VMVQVGMGGA VPNFPMMMVI SKEISLKGSF RFTTEFNTAV
SWLANRVINP LPLLSAEYPF TDLEAALIFA GDKTQAAKVQ LVF