Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4046 |
Symbol | |
ID | 6144970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4136540 |
End bp | 4137433 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641618870 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001746008 |
Protein GI | 170679673 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGAA AATTGCAAAG TTCGGATGTA AAAAACGAAA CTCCGTACAA TATTCCCTTA TTGATTAACG AAAATGTGAT CTCCAGCGGA ATTTCTCTGA TCTCGCTCTG GCATACCTAC GCCGACGAGC ATTACCGGGT GATCTGGCCG CGGGACAAGA AGAAACCGCT TATCGCCAAC TCATGGGTTG CGGTGTATAC CGTACAAGGA TGTGGGAAAA TTATTTTAAA GAATGGTGAA CAAATAACAC TGCATGGCAA CTGTATTATA TTTTTAAAGC CAATGGATAT TCACTCTTAT CACTGTGAAG GTTTAGTATG GGAACAGTAC TGGATGGAAT TTACCCCAAC CAGTATGATG GATATTCCCG TTGGTCAGCA AAGCGTTATT TATAATGGCG AAATTTATAA TCAGGAACTC ACCGAAGTTG CTGAGTTAAT AACTTCACCA GAAGCAATAA AAAATAATCT GGCAGTCGCT TTTCTGACGA AAATTATTTA TCAGTGGATT TGTCTTATGT ACGCAGACGG TAAAAAAGAT CCACAACGGC GGCAAATTGA AAATTTAATT GCCACTTTAC ATGCCAGTCT GCAACAACGC TGGAGCGTAG CTGATATGGC TGCCACGATC CCCTGTAGCG AAGCCTGGTT GCGTCGTCTG TTTTTACGCT ATACCGGCAA GACGCCGAAA GAATATTACC TCGATGCGCG TCTGGATCTG GCGCTATCGC TATTAAAACA ACAAGGAAAC TCGGTTGGCG AAGTCGCTGA TACGCTCAAC TTCTTCGACT CCTTTCATTT CAGCAAAGCC TTTAAAAATA AATTTGGTTA TGCGCCGTCA GCTGTGCTGA AGAATACGGA CCAGCACCCA ACGGATGCCA GTCCACACAA TTAA
|
Protein sequence | MNGKLQSSDV KNETPYNIPL LINENVISSG ISLISLWHTY ADEHYRVIWP RDKKKPLIAN SWVAVYTVQG CGKIILKNGE QITLHGNCII FLKPMDIHSY HCEGLVWEQY WMEFTPTSMM DIPVGQQSVI YNGEIYNQEL TEVAELITSP EAIKNNLAVA FLTKIIYQWI CLMYADGKKD PQRRQIENLI ATLHASLQQR WSVADMAATI PCSEAWLRRL FLRYTGKTPK EYYLDARLDL ALSLLKQQGN SVGEVADTLN FFDSFHFSKA FKNKFGYAPS AVLKNTDQHP TDASPHN
|
| |