Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0578 |
Symbol | |
ID | 5591475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 590872 |
End bp | 591798 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640919762 |
Product | DNA-binding transcriptional activator AllS |
Protein accession | YP_001457345 |
Protein GI | 157160027 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGATC CAGAAACCTT GCGGACTTTC ATTGCGGTTG CTGAAACAGG AGGTTTTTCA AAAGCGGCAG AACGATTATG TAAAACCACG GCGACGATCA GTTATCGCAT TAAACTTCTG GAAGAGAATA CCGGAGTAGC GCTGTTTTTC CGTACGACTC GCAGCGTGAC GTTGACAGCG GCTGGCGAGC ATCTACTTTC CCAGGCCAGA GACTGGCTGA GCTGGCTGGA AAGTATGCCA AGCGAGCTGC AACAGGTGAA TGATGGCGTG GAACGCCAGG TGAATATTGT CATCAACAAC CTGCTCTACA ACCCCCAGGC CGTCGCCCAG TTGCTGGCGT GGCTGAATGA GCGTTACCCC TTTACCCAGT TTCACATCTC CCGACAAATC TATATGGGCG TCTGGGACTC GCTATTGTAC GAAGGTTTTT CGCTGGCTAT CGGCGTCACG GGAACTGAGG CGCTGGCAAA TACCTTTAGT CTTGATCCCT TAGGATCGGT GCAATGGCGC TTTGTCATGG CGGCGGATCA TCCGCTGGCG AACGTTGAAG AGCCGCTAAC AGAAGCGCAG TTGCGGCGCT TTCCGGCGGT CAATATTGAA GACAGCGCCC GCACCTTAAC CAAACGCGTC GCCTGGCGAT TGCCAGGGCA AAAAGAGATT ATTGTTCCCG ATATGGAAAC GAAAATCGCC GCCCATCTGG CGGGCGTTGG CATTGGTTTT TTGCCAAAAT CGCTTTGCCA GTCAATGATC GATAATCAAC AACTGGTCAG CCGGGTAATC CCAACGATGC GCCCTCCTTC GCCATTGAGT CTGGCATGGC GCAAATTTGG CAGCGGCAAA GCGGTAGAAG ATATTGTGAC CTTGTTTACC CAGCGCAGGC CGGAAATCAG CGGATTTTTA GAAATTTTCG GCAACCCACG CAGTTAA
|
Protein sequence | MFDPETLRTF IAVAETGGFS KAAERLCKTT ATISYRIKLL EENTGVALFF RTTRSVTLTA AGEHLLSQAR DWLSWLESMP SELQQVNDGV ERQVNIVINN LLYNPQAVAQ LLAWLNERYP FTQFHISRQI YMGVWDSLLY EGFSLAIGVT GTEALANTFS LDPLGSVQWR FVMAADHPLA NVEEPLTEAQ LRRFPAVNIE DSARTLTKRV AWRLPGQKEI IVPDMETKIA AHLAGVGIGF LPKSLCQSMI DNQQLVSRVI PTMRPPSPLS LAWRKFGSGK AVEDIVTLFT QRRPEISGFL EIFGNPRS
|
| |