Gene SNSL254_A4834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4834 
Symbol 
ID6484260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4708304 
End bp4709323 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content56% 
IMG OID642740047 
Producthypothetical protein 
Protein accessionYP_002043725 
Protein GI194445237 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.809336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAA TAAAAAGCTA TGCCGCCAAA GAGGCGGGCG GCGAACTCGA ACTCTATGAA 
TATGACGCGG GAGAACTCCA ACCGGAAGAT GTCGAGGTAC GGGTCGACTA CTGCGGGATC
TGCCATTCCG ATCTGTCAAT GATCGACAAT GAATGGGGGT TCTCTCAATA CCCTCTGGTT
GCCGGACATG AGGTCATCGG TCGGTTGGCC GCACTCGGTA GCGCGGCACA GGATAAGGGA
CTAAAAGTCG GCCAGCGCGT TGGAATCGGC TGGACGGCGC GCAGCTGCGG ACACTGCGAT
GCCTGTATCA GCGGCAATCA AATTAACTGC CTGGAAGGGG CAGTGCCCAC TATCCTCAAT
CGTGGCGGTT TTGCCGAGAA GCTTCGCGCA GGCTGGCAGT GGGTAATTCC TCTTCCGGAG
AATATGGATA TGGCGTCCGC AGGCCCGCTG TTATGTGGCG GCATTACGGT CTTTAAACCG
CTACTGATGC ACCATATTAC TGCTACCAGC CGCGTTGGCG TCATCGGTAT CGGCGGGCTG
GGGCATATCG CCATAAAGCT GTTACACGCA ATGGGCTGCG AAGTCACCGC GTTCAGCTCC
AATCCATCGA AGGAGCAGGA GGTGCTGGCG ATGGGGGCCA ATAACGTGGT GAACAGCCGC
GATCCGGAAG CGTTAAAAGC ACTGGCGGGC CAGTTCGATC TCATTATTAA CACGGTCAAC
GTCGATCTCG ACTGGCAGCC CTACTTCGAA GCGCTGACCT ATGGCGGCAA CTTCCATACC
GTTGGGGCCG TATTGAAGCC GCTGCCCGTA CCGGCGTTTA CATTGATTGC CGGCGATCGC
AGTATCTCAG GCTCGGCAAC CGGAACGCCA TATGAACTTC GCAAACTGAT GAAATTCGCC
GGACGCAGCA AAGTCGCGCC CACCACGGAA CTGTTCGCAA TGTCACAAAT CAACGAGGCT
ATTCAGCACG TTCGCGACGG CAAAGCCCGC TATCGTGTAG TGCTAAAAGC TGACTTCTGA
 
Protein sequence
MTIIKSYAAK EAGGELELYE YDAGELQPED VEVRVDYCGI CHSDLSMIDN EWGFSQYPLV 
AGHEVIGRLA ALGSAAQDKG LKVGQRVGIG WTARSCGHCD ACISGNQINC LEGAVPTILN
RGGFAEKLRA GWQWVIPLPE NMDMASAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL
GHIAIKLLHA MGCEVTAFSS NPSKEQEVLA MGANNVVNSR DPEALKALAG QFDLIINTVN
VDLDWQPYFE ALTYGGNFHT VGAVLKPLPV PAFTLIAGDR SISGSATGTP YELRKLMKFA
GRSKVAPTTE LFAMSQINEA IQHVRDGKAR YRVVLKADF