Gene SeD_A4476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4476 
Symbol 
ID6871940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4314038 
End bp4315081 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content50% 
IMG OID642787391 
Producthypothetical protein 
Protein accessionYP_002218002 
Protein GI198245841 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAATTTA TGGAGAAAAA AATGGCAAGA CACAGCATTA AAATGATCGC CTTACTCACT 
GCGTTTGGTC TGACATCTGC GGTAATGACC GTACAGGCGG CAGAGCGGAT TGCTTTTATT
CCCAAACTGG TTGGCGTGGG CTTTTTTACC AGCGGCGGCA ATGGCGCGCA GGAAGCGGGA
AAAGCGCTGG GCATTGACGT AACTTACGAT GGCCCTACAG AGCCCAGCGT CTCAGGCCAG
GTTCAACTGG TGAATAACTT TGTCAATCAG GGGTATGACG CCATTATCGT TTCTGCCGTT
TCGCCTGATG GCCTGTGCCC GGCGTTGAAG CGGGCAATGC AAAGAGGCGT GAAAATATTA
ACCTGGGATT CCGATACCAA GCCGGAGTGC CGTTCTTACT ATATCAATCA AGGGACGCCA
AAACAGCTCG GCAGCATGCT GGTAGAGATG GTCGCTCATC AGGTGGACAA AGAGAAAGCG
AAAGTCGCTT TCTTCTATTC CAGCCCAACG GTGACCGACC AGAACCAGTG GGTAAAAGAA
GCTAAAGCCA AAATTAGCCA GGAACATCCG GGGTGGGAGA TAGTCACTAC CCAGTTTGGC
TATAACGATG CCACGAAATC GCTCCAGACG GCGGAAGGTA TCATCAAAGC GTATCCCGAT
CTGGATGCCA TCATCGCGCC TGACGCTAAC GCTTTACCTG CTGCGGCACA GGCGGCGGAG
AACCTTAAAC GTAATAATCT CGCGATTGTT GGTTTTAGTA CGCCGAATGT GATGCGCCCT
TATGTTCAGC GCGGCACTGT TAAAGAGTTT GGCCTGTGGG ATGTCGTCCA ACAGGGAAAA
ATTTCCGTAT ATGTCGCCAA CGCGTTGCTG AAAAATATGC CAATGAATGT CGGTGACTCA
CTGGATATTC CCGGCATCGG CAAAGTCACC GTTTCACCTA ATAGTGAGCA GGGATATCAC
TATGAGGCAA AAGGTAACGG CATTGTGTTA TTGCCGGAGC GTGTCATTTT CAACAAAGAC
AATATCGACA AATATGATTT CTGA
 
Protein sequence
MKFMEKKMAR HSIKMIALLT AFGLTSAVMT VQAAERIAFI PKLVGVGFFT SGGNGAQEAG 
KALGIDVTYD GPTEPSVSGQ VQLVNNFVNQ GYDAIIVSAV SPDGLCPALK RAMQRGVKIL
TWDSDTKPEC RSYYINQGTP KQLGSMLVEM VAHQVDKEKA KVAFFYSSPT VTDQNQWVKE
AKAKISQEHP GWEIVTTQFG YNDATKSLQT AEGIIKAYPD LDAIIAPDAN ALPAAAQAAE
NLKRNNLAIV GFSTPNVMRP YVQRGTVKEF GLWDVVQQGK ISVYVANALL KNMPMNVGDS
LDIPGIGKVT VSPNSEQGYH YEAKGNGIVL LPERVIFNKD NIDKYDF