Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0649 |
Symbol | |
ID | 6145484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 661027 |
End bp | 661980 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641615539 |
Product | putative DNA-binding transcriptional regulator |
Protein accession | YP_001742745 |
Protein GI | 170679981 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00471226 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATAGTA ATAATCAAAT TGAAACCTGT TTAAGCAGAA AGTCGTCAGA AGGTAAACCA CAAATATTTA CAACCTTACG AAATATTGAT CTTAACCTTC TGACTATTTT TGAAGCTGTA TATGTACATA AAGGGATCGT TAATGCAGCG AAAGTGCTTA ATCTGACACC CTCGGCAATC AGTCAGTCTA TTCAGAAACT GCGCGTTATA TTCCCTGACC CATTGTTTAT TCGCAAAGGC CAGGGTGTCA CTCCTACCGC ATTTGCGATG CATCTACATG AGTATATCAG TCAGGGCCTT GAGTCCATTC TTGGCGCGCT GGATATCGAA GGAAGCTATG ATAAGCAACG AACGATAACT ATTGCGACCA CCCCCTCGGT CGGAGCCCTG GTCCTTCCTG TCATCTATCG GGCGATTAAA TCTCACTATC CGCAGCTTTT ACTGCGCAAC CCACCCACCA GCGACGCGGA AAACCAACTC AGTCAGTTTC AAACCGATCT CATCATCGAT AACATGTTTT GCACCAATCG TACGGTGCAA CATCATGTTC TGTTCACCGA CAATATGGTG TTAATTTGCC GTGAGGGAAA TCCACTACTC TCTTTAGAAG ATGACAGAGA GACTATCGAC AACGCTGCGC ATATACTCCT GTTACCGGAA GGGCAAAATT TCAGCGGTCT GCGGCAGAGA GTTCAAGAGA TGTTTCCGGA CCGGCAAATC AGTTTCACCA GCTACAACAT TTTGACAATC GCTGCACTGG TTGCCAACAG TGACATGTTA GCGATTATTC CCAGCCGTTT TTATAACCTG TTTAGCCGTT GTTGGCCGCT AGAAAAACTG CCTTTTCCGT CCTTAAATGA GGAGCAAATA GATTTCTCTA TCCACTACAA CAAATTCAGC CTGCGTGATC CCATCCTGCA CGGGGTAATA GATGTTATCC GGAATGCATT TTGA
|
Protein sequence | MDSNNQIETC LSRKSSEGKP QIFTTLRNID LNLLTIFEAV YVHKGIVNAA KVLNLTPSAI SQSIQKLRVI FPDPLFIRKG QGVTPTAFAM HLHEYISQGL ESILGALDIE GSYDKQRTIT IATTPSVGAL VLPVIYRAIK SHYPQLLLRN PPTSDAENQL SQFQTDLIID NMFCTNRTVQ HHVLFTDNMV LICREGNPLL SLEDDRETID NAAHILLLPE GQNFSGLRQR VQEMFPDRQI SFTSYNILTI AALVANSDML AIIPSRFYNL FSRCWPLEKL PFPSLNEEQI DFSIHYNKFS LRDPILHGVI DVIRNAF
|
| |