Gene SeD_A4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4045 
Symbol 
ID6870995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3888400 
End bp3889578 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content53% 
IMG OID642786994 
Productxylose operon regulatory protein 
Protein accessionYP_002217621 
Protein GI198244615 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.654241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.387961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATA AACGTCACCG TATCACTCTG TTATTTAACG CGAATAAAGC CTATGACCGT 
CAGGTAGTGG AGGGGGTGGG TGAATATTTA CAAGCCTCGC AATCCGAATG GGATATATTT
ATTGAGGAAG ATTTCCGTGC CCGTATCGAT AACATTAAAG AGTGGTTAGG CGACGGCGTT
ATTGCCGATT ACGATGATGA CGATATCGCG CAATTATTGG CCGATGTCGA CGTACCCATT
GTCGGGGTCG GCGGTTCTTA CCATCTTGCT GAAAATTATC CCGCCGTTCA TTACATCGCC
ACCGATAATC ATGCGCTCGT TGAAAGCGCT TTCCTGCATT TAAAAGAAAA AGGCGTTAAC
CGCTTCGCGT TTTACGGTTT GCCCGACTCC AGCCGCAAAC ATTGGGCGGC GGAACGGGAA
TACGCCTTTC GCCAGCTGGT CGCCGAGGAA AAATACCGCG GCGTAGTCTA TCAGGGGCTG
GAAACCGCGC CGGAAAACTG GCAGCACGCG CAAAATCGCC TCGCCGACTG GCTTCAGACG
CTGCCGCCGC AAACCGGCAT CATTGCCGTA ACGGATGCCC GCGCCCGTCA CGTATTGCAG
GCCTGTGAAC ACCTGCATAT TCCGGTGCCG GAAAAACTTT GCGTTATCGG TATTGATAAC
GAAGAGTTAA CCCGTTATCT GTCGCGCGTC GCGCTTTCCT CCGTCGCGCA GGGGGCGCGG
CAAATGGGCT ATCAGGCGGC GAAGCTGCTG CACCGTTTGC TGGCGCGCGA AGAGATGCCG
TTACAGCGCA TTCTGGTGCC GCCGGTGCGC GTCATTGCGC GCCGCTCGAC AGACTATCGC
TCCCTGACCG ATCCGGCGGT TATCCAGGCG ATGCACTTTA TTCGTAACCA TGCCTGTAAG
GGCATTAAAG TCGAGCAAGT GCTGGACGCG GTTGGGATTT CACGTTCAAA CCTGGAAAAA
CGTTTTAAGG AAGAAGTTGG CGAGACGATA CATGCGCTGA TCCACGCCGA AAAGCTGGAA
AAAGCGCGTA GTTTGTTGAT TTCCACCACG TTGGCGATAA ACGAAATTTC GCAAATGTGC
GGCTACCCGT CACTGCAATA TTTCTATTCG GTGTTTAAAA AGGAGTACGT CACTACGCCT
AAGGAGTATC GCGACCAGCA TAGTGAAGCG TTGTTGTAG
 
Protein sequence
MFDKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID NIKEWLGDGV 
IADYDDDDIA QLLADVDVPI VGVGGSYHLA ENYPAVHYIA TDNHALVESA FLHLKEKGVN
RFAFYGLPDS SRKHWAAERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT
LPPQTGIIAV TDARARHVLQ ACEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR
QMGYQAAKLL HRLLAREEMP LQRILVPPVR VIARRSTDYR SLTDPAVIQA MHFIRNHACK
GIKVEQVLDA VGISRSNLEK RFKEEVGETI HALIHAEKLE KARSLLISTT LAINEISQMC
GYPSLQYFYS VFKKEYVTTP KEYRDQHSEA LL