Gene SeD_A4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4014 
Symbol 
ID6871204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3859017 
End bp3860024 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content60% 
IMG OID642786968 
Productregulatory protein LacI:Periplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002217596 
Protein GI198243857 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0637146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTTC AAAATAAAAA ACGCGCAAAG TTGATTGATG TTGCCCGCCA TGCAGGCGTA 
TCGCCAGGGA CGGTATCCAA TGCATTGCAC AACACCCGCT TTGTCGAGCC GCAGACGCGA
CGGCGTATTG AAGAGGCCAT TGTTGCGCTC AACTACACGC CGAATATTCG CGCCCGCCAG
TTGCGAACCG GCAAAACCAA TACCATTGCC TTGCTCTCTT CGGTGCCGCT GGCGATTGCC
TCCGGCGCGT CACGACTGGG ATTTATGATG GAGGTGGCGT TAACGTCCGC GATGATGGCG
CTGGAAAAAC AGCATGCGCT GATTCTGGTG CCGCCGGGGG CAAATCCACT GGATGCCGTC
AGCTTTGACG CGGCGATCCT GATTGAGCCG GCGGAGAACG ACCCGCAGCT CCAGGCGCTG
GCGCAAGCGG GCATTCCCTG CGTCACCATT GGCCGCACGC CGGGGACCGA CACGCCTGTG
CCGTGGGTGG AGCTGCACTC GGCGGCAACA GCACAGCTTC TGCTAACGCA TCTGGAGGCC
TCCGGCGCCA GCAAATGTGC GTTATTTGTC GGTAACACAC GGCGAACATC AGTTCTGGAG
AGCGAAGCGG CTTACCAGCG CTGGTGCGCG GGACGCCAGG CCCCCGTCGT CTACTCTCTC
AATGAAAGCG AGGGTGAAAA TGCCGGCTAC CAGGCCGCGC AGCAGCTATT ACAGGCACAT
CCCGACGTTG ACGGCGTGCT GGTGCTGATC GACACCTTTG CCAGCGGCGC GGTACGCGCT
TTTCAGGAAC AAGACATCGC CATACCTGAA CAAATGCGGG TGGTCACCCG CTATGATGGT
ATCCGCGCGC GCGAATCGCT GCCGCCGCTG ACGGCAGTGA ATATGCATCT TGATGAGGTG
GCGCGACAGG CAATCACGCT CCTGTTTGCC GTTCTGTCGG GTGAGAAGGT CAGCTACAGC
GACGGGATCA TGCCTGAACT GGTGGTGCGA GCGTCAACCT GCCGGTGA
 
Protein sequence
MAVQNKKRAK LIDVARHAGV SPGTVSNALH NTRFVEPQTR RRIEEAIVAL NYTPNIRARQ 
LRTGKTNTIA LLSSVPLAIA SGASRLGFMM EVALTSAMMA LEKQHALILV PPGANPLDAV
SFDAAILIEP AENDPQLQAL AQAGIPCVTI GRTPGTDTPV PWVELHSAAT AQLLLTHLEA
SGASKCALFV GNTRRTSVLE SEAAYQRWCA GRQAPVVYSL NESEGENAGY QAAQQLLQAH
PDVDGVLVLI DTFASGAVRA FQEQDIAIPE QMRVVTRYDG IRARESLPPL TAVNMHLDEV
ARQAITLLFA VLSGEKVSYS DGIMPELVVR ASTCR