Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1599 |
Symbol | |
ID | 6066057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1779085 |
End bp | 1780080 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641601015 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001724585 |
Protein GI | 170019631 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.979048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATA ACGTTTTGCT CATAGGAGCT TCCGGATTCG TAGGAACCCG ACTACTTGAA ACGGCAATTG CTGACTTTAA TATCAAGAAC CTGGACAAAC AGCAGAGCCA CTTTTATCCA GAAATCACAC AGATTGGCGA TGTTCGTGAT CAACAGGCAC TCGACCAGGC GTTAGCCGGT TTTGACACTG TTGTACTACT GGCAGCGGAA CACCGCGATG ACGTCAGCCC TACTTCTCTC TATTATGATG TCAACGTTCA GGGTACCCGC AATGTGCTGG CGGCCATGGA AAAAAATGGC GTTAAAAATA TCATCTTTAC CAGTTCCGTT GCTGTTTATG GTTTGAACAA ACACAACCCT GACGAAAACC ATCCACACGA CCCTTTCAAC CACTACGGCA AAAGTAAGTG GCAGGCAGAG GAAGTGCTGC GTGAATGGTA TAACAAAGCA CCAACAGAAC GTTCATTAAC TATCATCCGT CCTACCGTTA TCTTCGGTGA ACGCAACCGC GGTAACGTCT ATAACTTGCT GAAACAGATC GCTGGCGGCA AGTTTATGAT GGTGGGCGCA GGGACTAACT ATAAGTCCAT GGCTTATGTT GGAAACATTG TTGAGTTTAT CAAGTACAAA CTGAAGAATG TTGCCGCAGG TTACGAGGTT TATAACTACG TTGATAAGCC AGACCTGAAC ATGAACCAGT TGGTTGCTGA AGTTGAACAA AGCCTGAACA AAAAGATCCC TTCTATGCAC TTGCCTTACC CACTAGGAAT GCTGGGTGGA TATTGCTTTG ATATCCTGAG CAAAATTACG GGCAAAAAAT ACGCTGTCAG CTCTGTGCGC GTGAAAAAAT TCTGCGCAAC AACACAGTTT GACGCAACGA AAGTGCATTC TTCAGGTTTT GTGGCACCGT ATACGCTGTC GCAAGGTCTG GATCGAACTC TGCAGTATGA ATTCGTCCAT GCCAAAAAAG ACGACATAAC GTTTGTTTCT GAGTAA
|
Protein sequence | MNDNVLLIGA SGFVGTRLLE TAIADFNIKN LDKQQSHFYP EITQIGDVRD QQALDQALAG FDTVVLLAAE HRDDVSPTSL YYDVNVQGTR NVLAAMEKNG VKNIIFTSSV AVYGLNKHNP DENHPHDPFN HYGKSKWQAE EVLREWYNKA PTERSLTIIR PTVIFGERNR GNVYNLLKQI AGGKFMMVGA GTNYKSMAYV GNIVEFIKYK LKNVAAGYEV YNYVDKPDLN MNQLVAEVEQ SLNKKIPSMH LPYPLGMLGG YCFDILSKIT GKKYAVSSVR VKKFCATTQF DATKVHSSGF VAPYTLSQGL DRTLQYEFVH AKKDDITFVS E
|
| |