Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3294 |
Symbol | |
ID | 6147369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3369356 |
End bp | 3370255 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618124 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001745274 |
Protein GI | 170683787 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.693263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTG AAGAGATTTG CCGCTTGCTG GCGGATAAAG TTAATAAACT GAAAAATAAA GAAAATAGTT TGTCAGAGCT GTTGCCCGAT GTGCGTTTGT TGTATGGCGA GACGCCTTTC GCACGCACAC CGGTGATGTA CGAGCCTGGC ATCATAATTC TCTTTTCCGG ACATAAAATC GGTTATATCA ATGAACGCGT GTTTCGTTAT GATGCCAATG AATACCTGCT GCTGACGGTG CCGTTGCCGT TTGAGTGCGA AACCTATGCC ACGTCAGAGG TGCCGCTGGC AGGGTTGCGT CTCAATGTCG ATATTTTGCA GTTACAGGAA CTGTTGATGG ACATTGGCGA AGATGAGCAT TTCCAGCCGT CGATGGCAGC CAGCGGGATT AACTCCGCCA CGTTATCAGA AGAGATTTTA TGCGCGGCGG AGCGGTTACT CGACGTGATG GAGCGACCGC TGGATGCGCG TATTCTCGGC AAACAGATCA TCCGCGAAAT TCTGTACTAC GTGCTGACCG GACCTTGCGG CGGCGCGTTA CTGGCGCTGG TCAGTCGCCA GACTCACTTC AGCCTTATTA GCCGCGTGCT GAAACGGATT GAGAATAAAT ACACCGAAAA CCTGAGCGTC GAGCAACTGG CGGCAGAAGC CAACATGAGC GTATCGGCGT TCCACCATAA TTTTAAGTCT GTCACCAGCA CCTCGCCGTT GCAGTATTTG AAGAATTACC GTCTGCATAA GGCGCGGATG ATGATCATCC ACGACGGCAT GAAAGCCAGT GCGGCAGCGA TGCGCGTCGG TTACGAAAGC GCATCGCAAT TTAGCCGTGA GTTTAAACGT TACTTCGGTG TGACGCCGGG GGAAGATGCG GCAAGAATGC GGGCGATGCA GGGGAATTAA
|
Protein sequence | MKREEICRLL ADKVNKLKNK ENSLSELLPD VRLLYGETPF ARTPVMYEPG IIILFSGHKI GYINERVFRY DANEYLLLTV PLPFECETYA TSEVPLAGLR LNVDILQLQE LLMDIGEDEH FQPSMAASGI NSATLSEEIL CAAERLLDVM ERPLDARILG KQIIREILYY VLTGPCGGAL LALVSRQTHF SLISRVLKRI ENKYTENLSV EQLAAEANMS VSAFHHNFKS VTSTSPLQYL KNYRLHKARM MIIHDGMKAS AAAMRVGYES ASQFSREFKR YFGVTPGEDA ARMRAMQGN
|
| |