Gene SNSL254_A3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3946 
Symbol 
ID6483276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3827989 
End bp3828987 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content55% 
IMG OID642739206 
Product2,3-diketo-L-gulonate reductase 
Protein accessionYP_002042916 
Protein GI194443785 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CTTTCGAAGA GTTAAAAGGG GCCTTCTACC GCGTCTTGCG GTCGCGGAAT 
ATTGCGGAAG ATACCGCCGA CAAGTGCGCG GAAATGTTCG CTCGCACCAC CGAGTCCGGT
GTCTATTCCC ACGGCGTGAA CCGCTTTCCT CGCTTCATCC AGCAACTGGA TAACGGCGAC
ATTATTCCTG ATGCTAAACC GCAGCGGGTT ACCAGCCTCG GCGCCATCGA ACAGTGGGAT
GCTCAGCGCG CTATCGGCAA CCTGACGGCG AAAAAGATGA TGGACCGGGC CATCGAGCTG
GCTTCCGATC ATGGTATTGG CCTGGTGGCG TTACGTAATG CTAACCACTG GATGCGCGGC
GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATCG GCATTTGCTG GACCAACTCC
ATCGCCGTCA TGCCGCCGTG GGGGGCGAAA GAGTGCCGTA TCGGTACCAA TCCGCTGATC
GTCGCCATTC CGTCTACGCC GATCACGATG GTAGATATGT CGATGTCGAT GTTCTCCTAC
GGAATGTTAG AAGTTAACCG TCTGGCGGGC CGCGAACTGC CGGTGGATGG CGGTTTCGAC
GATAACGGTC AGTTGACCAA AGAACCGGGC GTTATCGAGA AAAATCGCCG CATTTTACCA
ATGGGTTACT GGAAAGGATC TGGTCTGTCG ATTGTGCTGG ACATGATTGC CACCCTGCTT
TCTAACGGTT CTTCCGTTGC CGAAGTGACC CAGGAAAACA GCGATGAGTA TGGCGTCTCA
CAGATTTTCA TCGCCATAGA AGTGGATAAG CTGATCGATG GCGCAACCCG CGATGCCAAA
CTGCAGCGGA TTATGGATTT CATCACCACT GCTGAACGCG CCGACGACAA CGTCGCGATT
CGGCTGCCCG GCCACGAATT TACCAAATTG CTGGATGACA ACCGCCGTCA CGGTATCACC
ATTGACGACA GCGTCTGGGC CAAAATTCAG GCGCTGTAA
 
Protein sequence
MKVTFEELKG AFYRVLRSRN IAEDTADKCA EMFARTTESG VYSHGVNRFP RFIQQLDNGD 
IIPDAKPQRV TSLGAIEQWD AQRAIGNLTA KKMMDRAIEL ASDHGIGLVA LRNANHWMRG
GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY
GMLEVNRLAG RELPVDGGFD DNGQLTKEPG VIEKNRRILP MGYWKGSGLS IVLDMIATLL
SNGSSVAEVT QENSDEYGVS QIFIAIEVDK LIDGATRDAK LQRIMDFITT AERADDNVAI
RLPGHEFTKL LDDNRRHGIT IDDSVWAKIQ AL