Gene SeSA_A3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3866 
Symbol 
ID6518946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3733833 
End bp3734831 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content56% 
IMG OID642748839 
Product2,3-diketo-L-gulonate reductase 
Protein accessionYP_002116602 
Protein GI194738073 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CTTTCGAAGA GTTAAAAGGG GCCTTCTACC GCGTCTTGCG GTCGCGGAAT 
ATTGCGGAAG ATACCGCCGA CGCCTGCGCG GAAATGTTCG CTCGCACCAC CGAGTCCGGT
GTCTATTCCC ACGGCGTGAA CCGCTTTCCC CGCTTCATTC AGCAACTAGA TAACGGCGAC
ATTATTCCTG ATGCTAAACC GCAGCGGGTT ACCAGCCTCG GCGCCATCGA ACAGTGGGAT
GCTCAGCGCG CTATCGGCAA CCTGACGGCG AAAAAGATGA TGGACCGGGC CATCGAGCTG
GCTTCCGATC ATGGTATTGG CCTGGTGGCG TTACGTAATG CTAACCACTG GATGCGCGGC
GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATCG GCATTTGCTG GACCAACTCC
ATCGCCGTCA TGCCGCCGTG GGGCGCGAAA GAGTGCCGTA TCGGTACCAA TCCGTTGATC
GTCGCCATCC CGTCTACGCC GATCACTATG GTAGATATGT CGATGTCGAT GTTCTCTTAC
GGCATGCTGG AGGTTAACCG CCTGGCCGGC CGCGAACTGC CGGTGGACGG CGGTTTCGAC
GATAACGGTC AGTTGACCAA AGAACCGGGC GTTATCGAGA AAAATCGCCG CATTTTACCG
ATGGGTTACT GGAAAGGATC TGGTCTGTCG ATTGTGCTGG ACATGATTGC CACCCTGCTT
TCCAACGGCT CTTCCGTTGC CGAAGTGACC CAGGAAAACA GCGATGAATA TGGCGTTTCG
CAGATCTTCA TCGCCATAGA AGTGGATAAG CTGATCGATG GCGCAACCCG CGATGCCAAA
CTGCAGCGGA TTATGGATTT CATCACCACC GCTGAACGTG CTGACGACAA CGTCGCGATT
CGGCTGCCCG GCCACGAATT TACCAAATTG CTGGATGACA ACCGCCGTCA CGGTATCACC
ATTGACGACA GCGTCTGGGC CAAAATTCAG GCGCTGTAA
 
Protein sequence
MKVTFEELKG AFYRVLRSRN IAEDTADACA EMFARTTESG VYSHGVNRFP RFIQQLDNGD 
IIPDAKPQRV TSLGAIEQWD AQRAIGNLTA KKMMDRAIEL ASDHGIGLVA LRNANHWMRG
GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY
GMLEVNRLAG RELPVDGGFD DNGQLTKEPG VIEKNRRILP MGYWKGSGLS IVLDMIATLL
SNGSSVAEVT QENSDEYGVS QIFIAIEVDK LIDGATRDAK LQRIMDFITT AERADDNVAI
RLPGHEFTKL LDDNRRHGIT IDDSVWAKIQ AL