Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3478 |
Symbol | |
ID | 5587362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3487528 |
End bp | 3488655 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640927105 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001464475 |
Protein GI | 157156780 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGTTG GGGTGTGCAG ATTAAAGTTG TTCATTACTT GCTCCCTTTG CTGGGCCAAT ATGAGGGCAG AGAACGATCT GCCTGATGTT TTTCATTGTG ATCGCCAGCG CCCTGGCTCT CAATGCTCAT TTCTGCCAAT GTCTTGCCTA TTTCTCCAGA GTGCTGGAGA AATGCTACAA AATTGCGCAC AATCAAATTG CCGCATTATT CCTAAGAAAT TACGCGATAT GAAACGTGAA GAGATTTGCC GTTTGCTGGC GAATAAAGTT AATAAAATGA AAAATAAAGA AAATAGTTTG TCAGAACTGT TGCCCGATGT GCGTTTGTTG TATGGCGAGA CGCCTTTCGC ACGTACACCG GTGATGTACG AGCCTGGCAT CATAATTCTC TTTTCCGGAC ATAAAATCGG TTATATCAAT GAACGCGTGT TTCGTTATGA TGCCAATGAA TACCTGCTGC TGACGGTGCC GTTGCCGTTT GAGTGCGAAA CCTATGCCAC GTCAGAGATG CCGCTGGCAG GGGTGCGTCT CAATGTCGAT ATTTTGCAGT TACAGGAACT GTTGATGGAC ATTGGCGAAG ATGAGCATTT CCAGCCGTCG ATGGCAGCCA GCGGGATTAA CTCCGCCACG TTATCAGAAG AGATTTTATG CGCGGCGGAG CGGTTACTCG ACGTGATGGA GCGACCACTG GATGCGCGTA TTCTCGGCAA ACAGATCATC CGCGAAATTC TGTACTACGT GCTGACCGGA CCTTGCGGCG GCGCGTTACT GGCGCTGGTC AGTCGCCAGA CTCACTTCAG TCTGATTAGC CGCGTGCTGA AACGGATTGA GAATAAATAC ACCGAAAACC TGAGCGTCGA GCAACTGGCG GCAGAAGCCA ATATGAGCGT ATCGGCGTTC CACCATAATT TTAAGTCTGT CACAAGCACC TCGCCGTTGC AGTATTTGAA GAATTACCGT CTGCATAAGG CGCGGATGAT GATCATCCAT GACGGCATGA AGGCCAGCGC AGCAGCGATG CGCGTCGGCT ATGAAAGCGC ATCGCAATTT AGCCGTGAGT TTAAACGTTA CTTCGGTGTG ACGCCGGGGG AAGATGCGGC AAGAATGCGG GCGATGCAGG GGAATTAA
|
Protein sequence | MRVGVCRLKL FITCSLCWAN MRAENDLPDV FHCDRQRPGS QCSFLPMSCL FLQSAGEMLQ NCAQSNCRII PKKLRDMKRE EICRLLANKV NKMKNKENSL SELLPDVRLL YGETPFARTP VMYEPGIIIL FSGHKIGYIN ERVFRYDANE YLLLTVPLPF ECETYATSEM PLAGVRLNVD ILQLQELLMD IGEDEHFQPS MAASGINSAT LSEEILCAAE RLLDVMERPL DARILGKQII REILYYVLTG PCGGALLALV SRQTHFSLIS RVLKRIENKY TENLSVEQLA AEANMSVSAF HHNFKSVTST SPLQYLKNYR LHKARMMIIH DGMKASAAAM RVGYESASQF SREFKRYFGV TPGEDAARMR AMQGN
|
| |