Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1500 |
Symbol | |
ID | 6147217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1483559 |
End bp | 1484470 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641616378 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001743558 |
Protein GI | 170682471 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.831009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0187399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCAAC GCTGTTTTGA TAATGCCAGT GAAACGCTGT TTGTCGCCGG TAAAACGCCA CGGCTTTCAC GTTTCGCATT TAGCGATGAT CCAAAATGGG AGTCCGGACA TCACGTTCAT GACAATGAAA CCGAGCTGAT TTACGTCAAG AAAGGGGTAG CAAGGTTTAC CATCGATTCT TCGTTATATG TCGCGCATGC CGATGACATT GTGGTGATAG AACGTGGCAG GCTGCATGCG GTGGCCTCTG ACGTTAACGA TCCGGCAACG ACGTGTACCT GTGCGCTGTA CGGCTTTCAG TTTCAGGGGG CTGAGGAAAA TCAGCTACTG CAACCGCATT CTTGCCCGGT AATTGCCGCA GGGCAGGGAA AAGAAGTCAT TAAAACCTTA TTTAATGAGC TAAGTGTGAT TTTGCCGCAA AGTAAAAATA GCCAAACATC TTCGTTATGG GACGCATTTG CCTATACGTT AGCAATTCTT TACTACGAAA ACTTTAAAAA TGCTTATCGT TCGGAGCAGG GATATATTAA AAAAGATGTT CTGATAAAAG ATATTCTTTT CTATCTGAAT AATAACTATC GCGAAAAAAT CACTTTAGAG CAGTTATCGA AAAAATTTCG TGCCAGCGTC AGTTATATCT GCCATGAATT TACCAAAGAG TATCGTATTT CCCCTATTAA CTATGTTATT CAACGGCGTA TGACGGAAGC GAAATGGTCA CTCACTAATA CTGAATTATC ACAGGCAGAG ATTTCCTGGC GTGTGGGTTA TGAAAATGTC GATCACTTTG CCAAACTGTT TTTGCGCCAT GTCGGCTGTT CGCCCAGCGA TTACCGCAGG CAATTTAAAA ACTGTTTTGC GGAACAAGAA ATCCTATCTG AATTTCCTCA ACCAGTAAGT CTTGCCGGAT AA
|
Protein sequence | MYQRCFDNAS ETLFVAGKTP RLSRFAFSDD PKWESGHHVH DNETELIYVK KGVARFTIDS SLYVAHADDI VVIERGRLHA VASDVNDPAT TCTCALYGFQ FQGAEENQLL QPHSCPVIAA GQGKEVIKTL FNELSVILPQ SKNSQTSSLW DAFAYTLAIL YYENFKNAYR SEQGYIKKDV LIKDILFYLN NNYREKITLE QLSKKFRASV SYICHEFTKE YRISPINYVI QRRMTEAKWS LTNTELSQAE ISWRVGYENV DHFAKLFLRH VGCSPSDYRR QFKNCFAEQE ILSEFPQPVS LAG
|
| |