Gene SeD_A4871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4871 
Symbol 
ID6871833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4722215 
End bp4723234 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content56% 
IMG OID642787755 
Producthypothetical protein 
Protein accessionYP_002218349 
Protein GI198242573 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0380128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAA TAAAAAGCTA CGCCGCCAAA GAGGCGGGCG GCGAACTCGA ACTCTATGAA 
TATGACGCGG GAGAACTCCA ACCGGAAGAT GTCGAGGTAC GGGTCGACTA CTGCGGGATC
TGCCATTCCG ATCTGTCAAT GATCGACAAT GAATGGGGGT TCTCTCAATA CCCTCTGGTT
GCCGGACATG AGGTCATCGG TCGGGTGGCC GCACTCGGTA GCGCGGCACA GGATAAGGGA
CTAAAAGTCG GCCAGCGCGT TGGAATCGGC TGGACGGCGC GCAGCTGCGG ACACTGCGAT
GCCTGTATCA GCGGCAATCA AATTAACTGC CTGGAAGGGG CAGTGCCCAC TATCCTCAAT
CGTGGCGGTT TTGCCGAGAA GCTTCGCGCG GGCTGGCAGT GGGTAATTCC TCTTCCGGAG
AATATGGATA TGGCGTCCGC AGGCCCGCTG TTATGTGGCG GCATTACGGT CTTTAAACCG
CTACTGATGC ACCATATTAC TGCTACCAGC CGCGTTGGCG TCATCGGTAT TGGCGGGCTT
GGGCATATCG CCATAAAGCT GTTACATGCA ATGGGCTGCG AAGTCACCGC GTTCAGCTCC
AATCCATCGA AAGAGCAGGA AGTGCTGGCG ATGGGTGCCA ATAACGTGGT GAACAGCCGC
GATCCGGAAG CGTTAAAAGC ACTGGCGGGC CAGTTCGATC TCATTATTAA CACGGTCAAC
GTCGATCTCG ACTGGCAGCC CTACTTCGAA GCGCTGACGT ATGGCGGCAA CTTCCATACC
GTTGGGGCCG TATTGAAGCC GCTGCCCGTA CCGGCGTTTA CATTAATTGC CGGCGATCGC
AGTATCTCAG GCTCGGCAAC CGGAACGCCA TATGAACTTC GCAAACTGAT GAAATTCGCC
GGACGCAGCA AAGTCGCGCC CACCACGGAA CTGTTCGCGA TGTCACAAAT CAACGAGGCT
ATCCAGCACG TTCGCGACGG CAAAGCCCGC TATCGTGTAG TGCTAAAAGC TGACTTCTGA
 
Protein sequence
MTIIKSYAAK EAGGELELYE YDAGELQPED VEVRVDYCGI CHSDLSMIDN EWGFSQYPLV 
AGHEVIGRVA ALGSAAQDKG LKVGQRVGIG WTARSCGHCD ACISGNQINC LEGAVPTILN
RGGFAEKLRA GWQWVIPLPE NMDMASAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL
GHIAIKLLHA MGCEVTAFSS NPSKEQEVLA MGANNVVNSR DPEALKALAG QFDLIINTVN
VDLDWQPYFE ALTYGGNFHT VGAVLKPLPV PAFTLIAGDR SISGSATGTP YELRKLMKFA
GRSKVAPTTE LFAMSQINEA IQHVRDGKAR YRVVLKADF