Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1935 |
Symbol | |
ID | 6068575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2137388 |
End bp | 2138299 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641601346 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001724908 |
Protein GI | 170019954 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0147604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCAAC GCTGTTTTGA TAATGCCAGT GAAACGCTGT TTGTCGCTGG TAAAACGCCA CGGCTTTCAC GTTTTGCATT TAGCGATGAT CCAAAATGGG AGTCTGGACA TCACGTTCAT GACAATGAAA CCGAGCTGAT TTACGTCAAG AAAGGGGTGG CAAGGTTTAC CATCGATTCT TCGTTATATG TCGCGCATGC AGATGATATA GTGGTGATAG AACGCGGCAG GCTGCATGCG GTGGCCTCTG ACGTTAACGA TCCGGCAACG ACGTGTACCT GTGCGCTGTA CGGCTTTCAG TTTCAGGGGG CTGAGGAAAA TCATCTACTG CAACCGCATT CTTGTCCGGT AATTGCCGCG GGGCAGGGAA AAGAAGTCAT TAAAACCTTA TTTAATGAGC TAAGTGTGAT TTTGCCGCAA AGTAAAAATA GCCAAACATC TTCGTTATGG GATGCATTTG CCTATACATT AGCAATTCTT TACTACGAAA ACTTTAAAAA TGCTTATCGT TCGGAGCAGG GGTATATTAA AAAAGATGTT CTGATAAAAG ATATTCTTTT CTATCTGAAT AATAATTATC GCGAAAAAAT CACTTTAGAA CAGTTATCGA AAAAATTTCG TGCCAGCGTC AGTTATATTT GCCATGAATT TACCAAAGAG TATCGTATTT CCCCAATTAA CTATGTTATT CAACGGCGTA TGACGGAAGC GAAATGGTTA CTCACTAATA CTGAATTATC ACAGGCAGAG ATCTCCTGGC GTGTGGGTTA TGAAAATGTC GATCACTTTG GCAAACTGTT TTTGCGCCAT GTCGGCTGTT CACCCAGCGA TTACCGCAGG CAATTTAAAA ACTGTTTTGC GGAACAAGAA ATCCTATCTG AATTTCCTCA ACCGGTAAGT CTTGTTGGAT AA
|
Protein sequence | MYQRCFDNAS ETLFVAGKTP RLSRFAFSDD PKWESGHHVH DNETELIYVK KGVARFTIDS SLYVAHADDI VVIERGRLHA VASDVNDPAT TCTCALYGFQ FQGAEENHLL QPHSCPVIAA GQGKEVIKTL FNELSVILPQ SKNSQTSSLW DAFAYTLAIL YYENFKNAYR SEQGYIKKDV LIKDILFYLN NNYREKITLE QLSKKFRASV SYICHEFTKE YRISPINYVI QRRMTEAKWL LTNTELSQAE ISWRVGYENV DHFGKLFLRH VGCSPSDYRR QFKNCFAEQE ILSEFPQPVS LVG
|
| |