Gene SeHA_C3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3985 
Symbol 
ID6487718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3861243 
End bp3862421 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content53% 
IMG OID642744086 
Productxylose operon regulatory protein 
Protein accessionYP_002047691 
Protein GI194448031 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATA AACGTCACCG CATCACTCTG TTATTTAACG CGAATAAAGC CTATGACCGT 
CAGGTAGTGG AGGGGGTGGG TGAATATTTA CAAGCCTCGC AATCCGAATG GGATATATTT
ATTGAGGAAG ATTTCCGTGC CCGTATCGAT AACATTAAAG AGTGGTTAGG CGACGGCGTT
ATTGCCGATT ACGATGATGA CGATATCGCG CAATTATTAG CCGATGTCGA CGTACCCATT
GTCGGGGTCG GCGGTTCTTA CCATCTTGCT GAAAATTATC CCGCCGTTCA TTACATCGCC
ACCGATAATC ATGCGCTCGT TGAAAGCGCT TTCCTGCATT TAAAAGAAAA AGGCGTCAAC
CGCTTCGCGT TTTACGGTTT GCCCGACTCC AGCCGTAAAC ATTGGGCGGC GGAACGGGAA
TACGCCTTTC GCCAGCTGGT CGCCGAGGAA AAATACCGCG GCGTAGTCTA TCAGGGGCTG
GAAACCGCGC CGGAAAACTG GCAGCACGCG CAAAATCGCC TCGCCGACTG GCTTCAGACG
CTGCCGCCGC AAACCGGCAT CATTGCCGTA ACGGATGCCC GCGCCCGTCA CGTATTGCAG
GCCTGTGAAC ACCTGCATAT TCCGGTGCCG GAAAAACTTT GCGTTATCGG TATTGATAAC
GAAGAGTTAA CCCGTTATCT GTCGCGCGTC GCGCTTTCCT CCGTCGCGCA GGGGGCGCGG
CAAATGGGCT ATCAGGCGGC GAAGCTGCTG CACCGTTTGC TGGCGCGCGA AGAGATGCCG
TTACAGCGCA TTCTGGTGCC GCCGGTGCGC GTCATTGCGC GCCGCTCGAC AGACTATCGC
TCCCTGACCG ATCCGGCGGT TATCCAGGCG ATGCACTTTA TTCGTAACCA TGCCTGTAAG
GGCATTAAAG TCGAGCAGGT GCTGGACGCG GTTGGGATTT CACGTTCAAA CCTGGAAAAA
CGTTTTAAGG AAGAAGTTGG CGAGACGATA CATGCGCTGA TCCACGCCGA AAAGCTGGAA
AAAGCGCGTA GTTTGTTGAT TTCTACCACG TTGGCGATAA ACGAAATTTC GCAAATGTGC
GGCTACCCGT CACTGCAATA TTTCTATTCG GTGTTTAAAA AGGAGTACGT CACTACGCCG
AAGGAGTATC GCGACCAGCA TAGTGAAGCG TTGTTGTAG
 
Protein sequence
MFDKRHRITL LFNANKAYDR QVVEGVGEYL QASQSEWDIF IEEDFRARID NIKEWLGDGV 
IADYDDDDIA QLLADVDVPI VGVGGSYHLA ENYPAVHYIA TDNHALVESA FLHLKEKGVN
RFAFYGLPDS SRKHWAAERE YAFRQLVAEE KYRGVVYQGL ETAPENWQHA QNRLADWLQT
LPPQTGIIAV TDARARHVLQ ACEHLHIPVP EKLCVIGIDN EELTRYLSRV ALSSVAQGAR
QMGYQAAKLL HRLLAREEMP LQRILVPPVR VIARRSTDYR SLTDPAVIQA MHFIRNHACK
GIKVEQVLDA VGISRSNLEK RFKEEVGETI HALIHAEKLE KARSLLISTT LAINEISQMC
GYPSLQYFYS VFKKEYVTTP KEYRDQHSEA LL