Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4078 |
Symbol | |
ID | 6145099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4169433 |
End bp | 4170392 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618902 |
Product | DNA-binding transcriptional regulator YidZ |
Protein accession | YP_001746040 |
Protein GI | 170681541 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT CCATCACCAC GCTTGATCTC AATCTGCTGC TCTGTCTGCA ACTGCTGATG CAGGAGCGTA GCGTGACCAA AGCGGCGAAG CGGATGAACG TGACCCCTTC GGCGGTGAGT AAGTCGCTGG CAAAGTTAAG AGCGTGGTTT GACGACCCGC TCTTTGTGAA CTCACCGCTG GGGCTGTCGC CCACACCGCT AATGGTCAGC ATGGAGCAAA ATCTGGCGGA GTGGATGCAA ATGAGCAACC TGCTGCTGGA TAAACCGCAC CACCAGACAC CGCGCGGCCT GAAGTTTGAG CTGGCAGCAG AATCACCGCT AATGATGATC ATGCTTAATG CGCTGTCGAA ACGGATCTAC CAACGTTACC CGCAGGCGAC CATCAAATTA CGTAACTGGG ATTACGATTC CTTAGATGCC ATTACTCGTG GTGAAGTGGA TATTGGTTTT TCCGGTCGCG AAAGTCATCC CCGTTCGCGG GAGCTGTTAA GCTCGCTACC GTTAGCCATT GATTATGAAG TGCTGTTTAG TGATGTGCCC TGCGTCTGGT TACGCAAAGA TCATCCGGCA CTGCATGAAG CATGGAATCT GGACACCTTT TTACGTTATC CGCATATCAG CATTTGCTGG GAACAGAGCG ATACCTGGGC GCTGGACAAT GTGTTACAGG AGCTGGGGCG CGAACGTACT ATTGCAATGA GCCTGCCGGA ATTCGAGCAG TCACTGTTTA TGGCAGCGCA ACCCGACAAT CTACTACTGG CGACCGCGCC GCGCTACTGT CAGTACTACA ATCAACTCCA TCAACTGCCG TTAGTTGCTC TTCCTCTCCC ATTTGATGAA ATCCAGCAAA AAAAGCTGGA AGTCCCTTTT ACCCTTCTGT GGCATAAACG GAACAGCCGT AATCCGAAGA TCGTCTGGTT ACGGGAAACC ATTAAAAATC TTTACGCGTC GATGGCATAA
|
Protein sequence | MKKSITTLDL NLLLCLQLLM QERSVTKAAK RMNVTPSAVS KSLAKLRAWF DDPLFVNSPL GLSPTPLMVS MEQNLAEWMQ MSNLLLDKPH HQTPRGLKFE LAAESPLMMI MLNALSKRIY QRYPQATIKL RNWDYDSLDA ITRGEVDIGF SGRESHPRSR ELLSSLPLAI DYEVLFSDVP CVWLRKDHPA LHEAWNLDTF LRYPHISICW EQSDTWALDN VLQELGRERT IAMSLPEFEQ SLFMAAQPDN LLLATAPRYC QYYNQLHQLP LVALPLPFDE IQQKKLEVPF TLLWHKRNSR NPKIVWLRET IKNLYASMA
|
| |