Gene SeSA_A3859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3859 
Symbol 
ID6515479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3726346 
End bp3727524 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content53% 
IMG OID642748832 
Productxylose operon regulatory protein 
Protein accessionYP_002116595 
Protein GI194736118 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.897217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATA AACGTCACCG CATCACTCTG TTATTTAACG CGAATAAAGC CTATGACCGT 
CAGGTAGTGG AGGGGGTGGG TGAATATTTA CAAGCCTCGC AATCCGAATG GGATATATTT
ATTGAGGAAG ATTTCCGTGC CCGTATCGAT AACATTAAAG AGTGGTTAGG CGACGGCGTT
ATTGCCGATT ACGATGATGA CGATATCGCG CAATTATTGG CCGATGTCGA CGTACCCATT
GTCGGGGTCG GCGGTTCTTA CCATCTTGCT GAAAATTATC CCGCCGTTCA TTACATCGCC
ACCGATAATC ATGCGCTCGT TGAAAGCGCT TTCCTGCATT TAAAAGAAAA AGGCGTTAAC
CGCTTCGCGT TTTACGGTTT GCCCGACTCC AGCCGCAAAC ATTGGGCGGC GGAACGGGAA
TACGCCTTTC GCCAGCTGGT CGCCGAGGAA AAATACCGCG GCGTAGTCTA TCAGGGGCTG
GAAACCGCGC CGGAAAACTG GCAGCACGCG CAAAATCGCC TCGCCGACTG GCTTCAGACG
CTGCCGCCGC AAACCGGCAT CATTGCCGTA ACGGATGCCC GCGCCCGTCA CGTATTGCAG
GCCTGTGAAC ACCTGCATAT TCCGGTGCCG GAAAAACTTT GCGTTATCGG TATTGATAAC
GAAGAGTTAA CCCGTTATCT GTCGCGCGTC GCGCTTTCCT CCGTCGCGCA GGGGGCGCGG
CAAATGGGCT ATCAGGCGGC GAAGCTGCTG CACCGTTTGC TGGCGCGCGA AGAGATGCCG
TTACAGCGCA TTCTGGTGCC GCCGGTGCGC GTCATTGCGC GCCGCTCGAC AGACTATCGC
TCCCTGACCG ATCCGGCGGT TATCCAGGCG ATGCACTTTA TTCGTAACCA TGCCTGTAAG
GGCATTAAAG TCGAGCAGGT GCTGGATGCG GTTGGGATTT CACGTTCAAA CCTGGAAAAA
CGTTTTAAGG AAGAAGTTGG CGAGACGATA CATGCGCTGA TCCACGCCGA AAAGCTGGAA
AAAGCGCGTA GTTTGTTGAT TTCTACCACG TTGGCGATAA ACGAAATTTC GCAAATGTGC
GGCTACCCGT CACTGCAATA TTTCTATTCG GTGTTTAAAA AGGAGTACGT CACTACGCCT
AAGGAGTATC GCGACCAGCA TAGTGAAGCG TTGTTGTAG
 
Protein sequence
MFDKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID NIKEWLGDGV 
IADYDDDDIA QLLADVDVPI VGVGGSYHLA ENYPAVHYIA TDNHALVESA FLHLKEKGVN
RFAFYGLPDS SRKHWAAERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT
LPPQTGIIAV TDARARHVLQ ACEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR
QMGYQAAKLL HRLLAREEMP LQRILVPPVR VIARRSTDYR SLTDPAVIQA MHFIRNHACK
GIKVEQVLDA VGISRSNLEK RFKEEVGETI HALIHAEKLE KARSLLISTT LAINEISQMC
GYPSLQYFYS VFKKEYVTTP KEYRDQHSEA LL