Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3709 |
Symbol | |
ID | 6064706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4059436 |
End bp | 4060326 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641603127 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001726647 |
Protein GI | 170021693 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | [TIGR02297] 4-hydroxyphenylacetate catabolism regulatory protein HpaA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.680131 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTGACC GTCAGATTGC CAATATTGAT ATCAGCAAAG AGTACGATGA AAGCCTGGGC ACGGACGATG TGCATTATCA GTCGTTCGCC CGCATGGCGG CCTTTTTTGG CCGCCATATG CTGCCACATC GCCACGAACA GTACTTTCAG ATGCATTTCC TCAATAGCGG ACAGATTGAG CTACAGCTTG ACGATCATCG CTACTCGGTG GAAGCGCCCC TGTTTGTCCT GACGCCGCCG TCAGTACCTC ATGCGTTTAT TACGGAGTCT GATGCTGACG GTCATGTATT GACGGTACGG GAAGATCTGA TCTGGCCCCT GCTGGAAGTT CTTTATCCGG GCACTCGGGA AACCTTCGGC CTGCCGGGGA TTTGCCTGTC ACTGGCAGAT AAACCCGACG AACTGGCGGC GCTGGAACAC TATTGGCAAC TGATAGAGCG GGAATCGGTA GAACAACTGC CTGGACGGGA ACACACCCTG ACGTTACTGG CACAGGCAGT GTTCACCCTA CTGCTGCGTA ACGCAAAACT CGACGACCAT GCCGCCAGCG GAATGCGCGG AGAATTAAAA CTGTTCCAGC GTTTTCATAT GCTTATTGAA AGCCATTTTC ATCAGCACTG GACAGTACCG GATTACGCTA ACGAACTGCA TATCACCGAA TCACGCCTCA CGGACATCTG CCGCCGCTTT GCCAACCGTC CGCCAAAACG GTTGATTTTC GACAGGCAGC TACGAGAAGC CAAGCGGCTG CTGCTGTTTT CTGATAACGC CGTGAACAAT ATTGCCTGGC AACTCGGTTT TAAGGATCCG GCTTATTTTG CGCGCTTTTT TAATCGCTTA GTCGGTTGCT CGCCCAGTGC TTATCGTGCC AAAAAAGTAC CTGTGACGTG A
|
Protein sequence | MCDRQIANID ISKEYDESLG TDDVHYQSFA RMAAFFGRHM LPHRHEQYFQ MHFLNSGQIE LQLDDHRYSV EAPLFVLTPP SVPHAFITES DADGHVLTVR EDLIWPLLEV LYPGTRETFG LPGICLSLAD KPDELAALEH YWQLIERESV EQLPGREHTL TLLAQAVFTL LLRNAKLDDH AASGMRGELK LFQRFHMLIE SHFHQHWTVP DYANELHITE SRLTDICRRF ANRPPKRLIF DRQLREAKRL LLFSDNAVNN IAWQLGFKDP AYFARFFNRL VGCSPSAYRA KKVPVT
|
| |