Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3189 |
Symbol | |
ID | 5593100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3199093 |
End bp | 3200049 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640922307 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001459805 |
Protein GI | 157162487 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 72 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACAAA ATTGCGCACA ATCAAATTGC CGCATTATTC CTAAGAAATT ACGCGATATG AAACGTGAAG AGATTTGCCG CTTGCTGGCG GATAAAGTTA ATAAACTGAA AAATAAAGAA AATAGTTTGT CAGAACTGTT GCCCGATGTG CGTTTGTTGT ATGGCGAGAC ACCTTTCGCA CGTACACCGG TGATGTACGA GCCTGGCATC ATAATTCTCT TTTCCGGGCA TAAAATCGGT TATATCAATG AACGCGTGTT TCGTTATGAT GCCAATGAAT ACCTGCTGCT GACGGTGCCG TTGCCGTTTG AGTGCGAAAC CTATGCCACG TCAGAGGTGC CGCTGGCAGG GTTGCGTCTC AATGTCGATA TTTTGCAGTT ACAGGAACTG TTGATGGACA TTGGCGAAGA TGAGCATTTC CAGCCGTCGA TGGCAGCCAG CGGGATTAAC TCCGCCACGT TATCAGAAGA GATTTTATGC GCGGCGGAGC GGTTACTCGA CGTGATGGAG CGACCACTGG ATGCGCGTAT TCTCGGCAAA CAGATCATCC GCGAAATTCT GTACTACGTG CTGACCGGAC CTTGCGGCGG CGCGTTACTG GCGCTGGTCA GTCGCCAGAC TCACTTCAGT CTGATTAGCC GCGTGCTGAA ACGGATTGAG AATAAATACA CCGAAAACCT GAGCGTCGAG CAACTGGCGG CAGAAGCCAA CATGAGCGTA TCGGCGTTCC ACCATAATTT TAAGTCTGTC ACCAGCACCT CGCCGTTGCA GTATTTGAAG AATTACCGTC TGCATAAGGC GCGGATGATG ATCATCCATG ACGGCATGAA GGCCAGCGCA GCAGCGATGC GCGTCGGCTA TGAAAGCGCA TCGCAATTTA GCCGTGAGTT TAAACGTTAC TTCGGTGTGA CGCCGGGGGA AGATGCGGCA AGAATGCGGG CGATGCAGGG GAATTAA
|
Protein sequence | MLQNCAQSNC RIIPKKLRDM KREEICRLLA DKVNKLKNKE NSLSELLPDV RLLYGETPFA RTPVMYEPGI IILFSGHKIG YINERVFRYD ANEYLLLTVP LPFECETYAT SEVPLAGLRL NVDILQLQEL LMDIGEDEHF QPSMAASGIN SATLSEEILC AAERLLDVME RPLDARILGK QIIREILYYV LTGPCGGALL ALVSRQTHFS LISRVLKRIE NKYTENLSVE QLAAEANMSV SAFHHNFKSV TSTSPLQYLK NYRLHKARMM IIHDGMKASA AAMRVGYESA SQFSREFKRY FGVTPGEDAA RMRAMQGN
|
| |