Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3214 |
Symbol | |
ID | 6066697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3521512 |
End bp | 3522486 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641602629 |
Product | aldo/keto reductase |
Protein accession | YP_001726163 |
Protein GI | 170021209 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000166082 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAATACA ACCCCTTAGG AAAAACCGAC CTTCGCGTTT CCCGACTTTG CCTCGGCTGT ATGACCTTTG GCGAGCCAGA TCGTGGTAAT CACGCATGGA CACTGCCGGA AGAAAGCAGC CGTCCCATAA TTAAACGCGC GCTGGAAGGC GGCATAAATT TCTTTGACAC CGCCAATAGC TATTCCGATG GCAGCAGCGA AGAGATCGTC GGTCGCGCAC TGCGGGATTT CGCCCGTCGT GAAGACGTGG TCGTTGCCAC CAAAGTATTC CATCGCGTTG GTGATTTGCC GGAAGGATTA TCCCGTGCGC AAATATTGCG CTCTATCGAC GACAGCCTGC GCCGTCTCGG CATGGATTAT GTCGATATCC TGCAAATTCA TCGCTGGGAT TACAACACGC CGATCGAAGA GACGCTGGAA GCCCTGAACG ACGTGGTAAA AGCCGGGAAA GCGCGTTATA TCGGCGCGTC ATCAATGCAC GCTTCGCAGT TTGCTCAGGC ACTGGAACTC CAAAAACAGC ATGGCTGGGC GCAGTTTGTC AGTATGCAGG ATCACTACAA TCTGATTTAT CGCGAAGAAG AGCGCGAGAT GCTGCCACTG TGTTATCAGG AGGGCGTGGC GGTGATTCCG TGGAGCCCGC TGGCGCGGGG GCGACTGACG CGTCCGTGGG GAGAAACTAC CGCACGACTG GTGTCTGATG AGGTGGGGAA AAATCTCTAT AAAGAAAGCG ATGAAAATGA CGCGCAGATC GCAGAGCGGT TAACGGGCGT CAGTGAAGAA CTCGGTGCAA CACGAGCACA AGTTGCGCTG GCCTGGTTGT TGAGTAAACC GGGCATTGCC GCACCGATTA TCGGTACATC GCGGGAAGAA CAGCTTGATG AGCTATTGAA CGCGGTGGAT ATCACTTTGA AGCCGGAACA GATTGCCGAA CTGGAAACGC CGTATAAACC GCATCCGGTA GTAGGATTTA AATAA
|
Protein sequence | MQYNPLGKTD LRVSRLCLGC MTFGEPDRGN HAWTLPEESS RPIIKRALEG GINFFDTANS YSDGSSEEIV GRALRDFARR EDVVVATKVF HRVGDLPEGL SRAQILRSID DSLRRLGMDY VDILQIHRWD YNTPIEETLE ALNDVVKAGK ARYIGASSMH ASQFAQALEL QKQHGWAQFV SMQDHYNLIY REEEREMLPL CYQEGVAVIP WSPLARGRLT RPWGETTARL VSDEVGKNLY KESDENDAQI AERLTGVSEE LGATRAQVAL AWLLSKPGIA APIIGTSREE QLDELLNAVD ITLKPEQIAE LETPYKPHPV VGFK
|
| |