Gene SNSL254_A3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3940 
Symbol 
ID6484852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3820503 
End bp3821681 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content53% 
IMG OID642739200 
Productxylose operon regulatory protein 
Protein accessionYP_002042910 
Protein GI194444174 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.688005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATA AACGTCACCG CATCACTCTG TTATTTAACG CGAATAAAGC CTATGACCGT 
CAGGTAGTGG AGGGGGTGGG TGAATATTTA CAAGCCTCGC AATCCGAATG GGATATATTT
ATTGAGGAAG ATTTCCGTGC CCGTATCGAT AACATTAAAG AGTGGTTAGG CGACGGCGTT
ATTGCCGATT ACGATGATGA CGATATCGCG CAATTATTGG CCGATGTCGA CGTCCCCATT
GTCGGGGTCG GCGGTTCTTA CCATCTTGCT GAAAATTATC CCGCCGTTCA TTACATCGCC
ACCGATAATC ATGCGCTCGT TGAAAGCGCT TTCCTGCATT TAAAAGAAAA AGGCGTTAAC
CGCTTCGCGT TTTACGGTTT GCCCGACTCC AGCCGCAAAC ATTGGGCGGC GGAACGGGAA
TACGCCTTTC GCCAGCTGGT CGCCGAGGAA AAATACCGCG GCGTAGTCTA TCAGGGACTG
GAAACCGCGC CGGAAAACTG GCAGCACGCG CAAAATCGCC TCGCCGACTG GCTTCAGACG
CTGCCGCCGC AAACCGGCAT CATTGCCGTA ACGGATGCCC GCGCCCGTCA CGTATTGCAG
GCCTGTGAAC ACCTGCATAT TCCGGTGCCG GAAAAACTTT GCGTTATCGG TATTGATAAC
GAAGAGTTAA CCCGTTATCT GTCGCGCGTC GCGCTTTCCT CCGTCGCGCA GGGGGCGCGG
CAAATGGGCT ATCAGGCGGC GAAGCTGCTG CACCGTTTGC TGGCGCGCGA AGAGATGCCG
TTACAGCGCA TTCTGGTGCC GCCGGTGCGC GTCATTGCGC GCCGCTCGAC AGACTATCGC
TCCCTGACCG ATCCGGCGGT TATCCAGGCG ATGCACTTTA TTCGTAACCA TGCCTGTAAG
GGCATTAAAG TCGAGCAGGT GCTGGACGCG GTTGGGATTT CACGTTCAAA CCTGGAAAAA
CGTTTTAAGG AGGAAGTTGG CGAGACGATA CATGCGCTGA TCCACGCCGA AAAGCTGGAA
AAAGCGCGTA GTTTGTTGAT TTCTACCACG TTGGCGATAA ACGAAATTTC GCAAATGTGC
GGCTACCCGT CACTGCAATA TTTCTATTCG GTGTTTAAAA AGGAGTACGT AACTACGCCT
AAGGAGTATC GCGACCAGCA TAGTGAAGCG TTGTTGTAG
 
Protein sequence
MFDKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID NIKEWLGDGV 
IADYDDDDIA QLLADVDVPI VGVGGSYHLA ENYPAVHYIA TDNHALVESA FLHLKEKGVN
RFAFYGLPDS SRKHWAAERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT
LPPQTGIIAV TDARARHVLQ ACEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR
QMGYQAAKLL HRLLAREEMP LQRILVPPVR VIARRSTDYR SLTDPAVIQA MHFIRNHACK
GIKVEQVLDA VGISRSNLEK RFKEEVGETI HALIHAEKLE KARSLLISTT LAINEISQMC
GYPSLQYFYS VFKKEYVTTP KEYRDQHSEA LL