Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0541 |
Symbol | |
ID | 5586407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 566250 |
End bp | 567176 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640924263 |
Product | DNA-binding transcriptional activator AllS |
Protein accession | YP_001461690 |
Protein GI | 157156027 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGATC CAGAAACCTT GCGGACTTTC ATTGCGGTTG CTGAAACAGG AAGTTTTTCA AAAGCGGCAG AACGATTATG TAAAACCACG GCGACGATCA GTTATCGCAT TAAACTTCTG GAAGAGAATA CCGGAGTAGC GCTGTTTTTC CGCACGACTC GCAGCGTGAC GTTGACAGCG GCTGGCGAGC ATCTACTTTC CCAGGCCAGA GACTGGCTGA GCTGGCTGGA AAGTATGCCA AGCGAGCTGC AACAGGTGAA TGATGGCGTG GAACGCCAGG TGAATATTGT CATCAACAAC CTGCTCTACC ACCCCCAGGC CGTCGCCCAG TTGCTGGCGT GGCTGAATGA ACGTTACCCC TTTACCCAGT TTCACATCTC CCGACAAATC TATATGGGCG TCTGGGATTC GCTATTATAC GAAGGTTTTT CACTGGCTAT CGGCGTCACG GGAACCGAGG CGCTGGCAAA TACCTTTAGT CTTGATCCCT TAGGATCGGT GCAATGGCGC TTTGTCATGG CGGCGGATCA TCCGCTGGCG AACGTTGAAG AGCCGCTAAC AGAAGCGCAG TTGCGGCGCT TTCCGGCGGT CAATATTGAA GACAGCGCCC GCACCTTAAC CAAACGCGTC GCCTGGCGAT TGCCTGGGCA AAAAGAGATT ATTGTTCCTG ATATGGAAAC GAAAATCGCC GCCCATCTTG CGGGCGTTGG CATTGGTTTT TTGCCAAAAT CGCTTTGCCA GTCAATGATC GATAATCAAC AACTAGTTAG CCGGGTAATC CCAACGATGC GCCCTCCTTC GCCATTGAGT CTGGCATGGC GCAAATTTGG CAGCGGCAAA GCGGTAGAAG ATATTGTGAC CTTGTTTACC CAGCGCAGGC CGGAAATCAG CGGATTTTTA GAAATTTTCG GCAACCCACG CAGTTAA
|
Protein sequence | MFDPETLRTF IAVAETGSFS KAAERLCKTT ATISYRIKLL EENTGVALFF RTTRSVTLTA AGEHLLSQAR DWLSWLESMP SELQQVNDGV ERQVNIVINN LLYHPQAVAQ LLAWLNERYP FTQFHISRQI YMGVWDSLLY EGFSLAIGVT GTEALANTFS LDPLGSVQWR FVMAADHPLA NVEEPLTEAQ LRRFPAVNIE DSARTLTKRV AWRLPGQKEI IVPDMETKIA AHLAGVGIGF LPKSLCQSMI DNQQLVSRVI PTMRPPSPLS LAWRKFGSGK AVEDIVTLFT QRRPEISGFL EIFGNPRS
|
| |