Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2592 |
Symbol | |
ID | 6143807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2645268 |
End bp | 2646320 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617463 |
Product | transcriptional regulator EutR |
Protein accession | YP_001744628 |
Protein GI | 170680296 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA CCCGTACAGC CAATTTGCAC CATCTTTATC ATGAACCCTT ACCCGAAAAC CTGAAGCTCA CGCCGAAGGT CGAAGTGGAT AATGTTCATC AACGACAGAC AACGGATGTC TATGAACATG CTTTGACAAT TACCGCCTGG CAGCAGATTT ACGATCAGCT GCATCCGGGC AAGTTTCATG GTGAATTTAC GGAAATTCTA CTCGATGATA TTCAGGTTTT TCGTGAATAC ACTGGTCTGG CGCTGCGTCA GTCGTGCCTG GTCTGGCCGA ACTCGTTCTG GTTTGGCATT CCGGCGACGC GCGGTGAGCA GGGATTTATC GGTTCGCAAT GTCTGGGAAG CGCAGAAATT GCGACGCGCC CGGGTGGAAC CGAGTTTGAA TTAAGTACGC CGGACGATTA CACGATCCTG GGCGTAGTGC TTTCTGAAGA TGTCATCACT CGTCAGGCTA ACTTTTTGCA TAATCCGGAT CGGGTATTAC ATATGCTGCG TAGCCAGTCG GCGTTGGAAG TGAAAGAGCA GCATAAAGCC GCGCTGTGGG GCTTTGTCCA ACAGGCGCTG GCGACATTTT GCGAGAACCC GGAAAATCTT CATCAGCCTG CTGTGCGTAA AGTGCTGGGG GATAATTTGC TCATGGCGAT GGGGGCGATG CTGGAAGAAG CGCAGCCAAT GGTGACGGCG GAAAGCATCA GTCATCAGAG TTACCGTCGA TTGCTTTCCC GCGCCCGTGA ATATGTGCTG GAAAACATGT CTGAACCGGT AACGGTGTTG GAGTTGTGCA ATCAATTGCA TGTCAGCCGC CGCACGCTAC AAAACGCGTT TCACGCTATT TTAGGCATTG GCCCAAACGC GTGGCTGAAA CGCATTCGCC TGAACGCCGT ACGCCGCGAA CTGATAAGCC CGTGGTCGCA AAGCACAACG GTAAAAGACG CCGCCATGCA GTGGGGATTC TGGCATCTGG GGCAATTTGC CACGGATTAC CAGCAGCTGT TTGCCGAGAA GCCGTCACTG ACGCTGCATC AGCGGATGCG GGAGTGGGGG TGA
|
Protein sequence | MKKTRTANLH HLYHEPLPEN LKLTPKVEVD NVHQRQTTDV YEHALTITAW QQIYDQLHPG KFHGEFTEIL LDDIQVFREY TGLALRQSCL VWPNSFWFGI PATRGEQGFI GSQCLGSAEI ATRPGGTEFE LSTPDDYTIL GVVLSEDVIT RQANFLHNPD RVLHMLRSQS ALEVKEQHKA ALWGFVQQAL ATFCENPENL HQPAVRKVLG DNLLMAMGAM LEEAQPMVTA ESISHQSYRR LLSRAREYVL ENMSEPVTVL ELCNQLHVSR RTLQNAFHAI LGIGPNAWLK RIRLNAVRRE LISPWSQSTT VKDAAMQWGF WHLGQFATDY QQLFAEKPSL TLHQRMREWG
|
| |