Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4014 |
Symbol | |
ID | 6871204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3859017 |
End bp | 3860024 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642786968 |
Product | regulatory protein LacI:Periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_002217596 |
Protein GI | 198243857 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0637146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTTC AAAATAAAAA ACGCGCAAAG TTGATTGATG TTGCCCGCCA TGCAGGCGTA TCGCCAGGGA CGGTATCCAA TGCATTGCAC AACACCCGCT TTGTCGAGCC GCAGACGCGA CGGCGTATTG AAGAGGCCAT TGTTGCGCTC AACTACACGC CGAATATTCG CGCCCGCCAG TTGCGAACCG GCAAAACCAA TACCATTGCC TTGCTCTCTT CGGTGCCGCT GGCGATTGCC TCCGGCGCGT CACGACTGGG ATTTATGATG GAGGTGGCGT TAACGTCCGC GATGATGGCG CTGGAAAAAC AGCATGCGCT GATTCTGGTG CCGCCGGGGG CAAATCCACT GGATGCCGTC AGCTTTGACG CGGCGATCCT GATTGAGCCG GCGGAGAACG ACCCGCAGCT CCAGGCGCTG GCGCAAGCGG GCATTCCCTG CGTCACCATT GGCCGCACGC CGGGGACCGA CACGCCTGTG CCGTGGGTGG AGCTGCACTC GGCGGCAACA GCACAGCTTC TGCTAACGCA TCTGGAGGCC TCCGGCGCCA GCAAATGTGC GTTATTTGTC GGTAACACAC GGCGAACATC AGTTCTGGAG AGCGAAGCGG CTTACCAGCG CTGGTGCGCG GGACGCCAGG CCCCCGTCGT CTACTCTCTC AATGAAAGCG AGGGTGAAAA TGCCGGCTAC CAGGCCGCGC AGCAGCTATT ACAGGCACAT CCCGACGTTG ACGGCGTGCT GGTGCTGATC GACACCTTTG CCAGCGGCGC GGTACGCGCT TTTCAGGAAC AAGACATCGC CATACCTGAA CAAATGCGGG TGGTCACCCG CTATGATGGT ATCCGCGCGC GCGAATCGCT GCCGCCGCTG ACGGCAGTGA ATATGCATCT TGATGAGGTG GCGCGACAGG CAATCACGCT CCTGTTTGCC GTTCTGTCGG GTGAGAAGGT CAGCTACAGC GACGGGATCA TGCCTGAACT GGTGGTGCGA GCGTCAACCT GCCGGTGA
|
Protein sequence | MAVQNKKRAK LIDVARHAGV SPGTVSNALH NTRFVEPQTR RRIEEAIVAL NYTPNIRARQ LRTGKTNTIA LLSSVPLAIA SGASRLGFMM EVALTSAMMA LEKQHALILV PPGANPLDAV SFDAAILIEP AENDPQLQAL AQAGIPCVTI GRTPGTDTPV PWVELHSAAT AQLLLTHLEA SGASKCALFV GNTRRTSVLE SEAAYQRWCA GRQAPVVYSL NESEGENAGY QAAQQLLQAH PDVDGVLVLI DTFASGAVRA FQEQDIAIPE QMRVVTRYDG IRARESLPPL TAVNMHLDEV ARQAITLLFA VLSGEKVSYS DGIMPELVVR ASTCR
|
| |