Gene SeD_A0108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0108 
Symbol 
ID6873341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp116937 
End bp117782 
Gene Length846 bp 
Protein Length281 aa 
Translation table11 
GC content54% 
IMG OID642783361 
ProductDNA-binding transcriptional regulator AraC 
Protein accessionYP_002214055 
Protein GI198244573 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAA CGCAAAATGA TCCGCTATTG CCGGGATATT CATTTAATGC CCATCTGGTC 
GCCGGGCTGA CGCCAATTGA AGCGAATGGA TATCTGGATT TTTTTATCGA TCGTCCGTTG
GGCATGAAGG GATATATTCT TAACCTGACC ATCCGCGGAG AGGGCGTCAT TAATAATAAT
GGCGAGCAGT TTGTCTGTCG GCCTGGAGAT ATATTATTGT TTCCGCCGGG CGAGATTCAT
CACTATGGAC GGCATCCGGA CGCCAGCGAG TGGTATCACC AGTGGGTTTA TTTCCGGCCT
CGCGCCTACT GGCAGGAGTG GCTGACCTGG CCGACAATCT TTGCCCAGAC AGGATTTTTC
CGCCCGGACG AGGCGCGCCA GCCGCATTTC AGCGAACTGT TCGGGCAGAT CATCAGCGCC
GGGCAAGGGG AAGGCCGCTA TTCTGAGCTA CTGGCGATCA ATCTGCTGGA GCAGTTGTTG
CTCAGACGTA TGGCGGTAAT TAATGAGTCG TTGCATCCGC CGATGGATAG CCGTGTGCGC
GATGCCTGCC AGTATATCAG CGACCATCTG GCGGACAGCC ATTTTGATAT CGCCAGCGTC
GCCCAGCATG TCTGCCTGTC GCCCTCCCGG TTATCGCATC TGTTCCGCCA GCAGTTAGGC
ATTAGCGTAT TGAGTTGGCG CGAAGATCAG CGTATCAGCC AGGCGAAACT CCTGCTTAGC
ACCACGCGAA TGCCGATAGC GACCGTTGGG CGCAATGTTG GATTTGACGA TCAGCTCTAT
TTTTCGCGGG TATTTAAAAA ATGCACCGGG GCAAGTCCTA GCGAGTTCAG GGCCGGATGT
GAATAA
 
Protein sequence
MAETQNDPLL PGYSFNAHLV AGLTPIEANG YLDFFIDRPL GMKGYILNLT IRGEGVINNN 
GEQFVCRPGD ILLFPPGEIH HYGRHPDASE WYHQWVYFRP RAYWQEWLTW PTIFAQTGFF
RPDEARQPHF SELFGQIISA GQGEGRYSEL LAINLLEQLL LRRMAVINES LHPPMDSRVR
DACQYISDHL ADSHFDIASV AQHVCLSPSR LSHLFRQQLG ISVLSWREDQ RISQAKLLLS
TTRMPIATVG RNVGFDDQLY FSRVFKKCTG ASPSEFRAGC E