Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3334 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 3583755 |
End bp | 3585365 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | Alpha-N-arabinofuranosidase |
Protein accession | ACX40956 |
Protein GI | 260450534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.783434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCA CTAACCCGAT ACTCACCGGC TTCAACCCGG ACCCGTCCCT GTGCCGCCAG GGCGAGGACT ACTACATCGC CACCTCGACC TTCGAGTGGT TCCCGGGCGT GCGCATCTAC CACTCCCGTG ACCTGAAAAA CTGGTCGCTG GTCAGCACCC CGTTGGACCG CGTGTCGATG CTGGACATGA AGGGCAACCC GGACTCCGGC GGCATCTGGG CGCCGTGCCT GAGCTACGCC GACGGTAAAT TCTGGCTGCT CTACACCGAC GTGAAGATTG TCGACTCGCC GTGGAAAAAC GGCCGCAACT TCCTCGTCAC CGCGCCCTCC ATCGAGGGGC CATGGAGCGA GCCAATCCCG ATGGGCAACG GCGGGTTTGA CCCGTCCCTG TTCCACGACG ACGATGGCCG CAAATACTAT ATCTACCGCC CGTGGGGGCC GCGCCACCAC AGCAACCCGC ACAACACCAT CGTGTTACAG GCGTTTGACC CGCAGACCGG CACGCTCTCG CCCGAGCGCA AAACGCTGTT TACCGGCACG CCGCTCTGCT ACACCGAAGG CGCGCACCTG TATCGCCACG CGGGATGGTA CTACCTGATG GCCGCCGAGG GCGGCACCAG CTACGAGCAC GCCGTCGTGG TGCTGCGTTC CAAAAATATC GACGGGCCGT ACGAGCTGCA CCCGGACGTA ACGATGATGA CCAGCTGGCA CCTGCCGGAG AACCCGCTGC AGAAGAGCGG CCACGGCTCG CTGCTGCAGA CGCATACGGG TGAATGGTAC ATGGCCTACC TCACCAGCCG CCCGCTGCGC CTGCCCGGCG TGCCGCTGCT GGCCTCCGGC GGACGCGGCT ACTGCCCGCT GGGGCGCGAG ACCGGCATCG CCCGCATTGA ATGGCGCGAC GGCTGGCCGT ACGTGGAAGG CGGCAAGCAC GCGCAGCTGA CCGTGAAAGG CCCGCAAGTA GCCGAGCAGC CTGCAGCCGT TCCGGGCAAC TGGCGGGACG ATTTCGACGC CAGTTCGCTT GACCCGGAGC TGCAGACCCT GCGCATTCCG TTCGACGACA CCCTCGGCTC GCTCACCGCG CGCCCGGGCT TCTTACGGCT CTATGGCAAC GACTCGCTCA ATTCGACCTT CACCCAATCG ACCGTGGCGC GCCGCTGGCA GCACTTCGCC TTCCGGGCAG AAACGCGGAT GGAGTTCTCG CCGGTGCACT TCCAGCAGAG CGCGGGGCTG ACCTGCTACT ACAACAGCAA AAACTGGAGC TACTGCTTTG TGGACTACGA GGAGGGACAG GGTAGAACCA TCAAAGTTAT CCAGCTCGAC CACAACGTGC CGTCGTGGCC GCTGCACGAG CAGCCCATTC CGGTGCCGGA ACATGCGGAG AGCGTCTGGC TGCGGGTGGA CGTGGATACG CTGGTCTACC GCTACAGCTA CTCGTTTGAT GGCGAGACGT GGCACACCGT GCCGGTGACG TATGAGGCGT GGAAGCTGTC GGACGACTAC ATCGGCGGGC GCGGCTTCTT CACCGGCGCG TTTGTGGGCC TGCACTGCGA GGACATCAGC GGCGACGGCT GCTACGCGGA CTTCGACTAC TTCACCTACG AGCCGGTCTA A
|
Protein sequence | MEITNPILTG FNPDPSLCRQ GEDYYIATST FEWFPGVRIY HSRDLKNWSL VSTPLDRVSM LDMKGNPDSG GIWAPCLSYA DGKFWLLYTD VKIVDSPWKN GRNFLVTAPS IEGPWSEPIP MGNGGFDPSL FHDDDGRKYY IYRPWGPRHH SNPHNTIVLQ AFDPQTGTLS PERKTLFTGT PLCYTEGAHL YRHAGWYYLM AAEGGTSYEH AVVVLRSKNI DGPYELHPDV TMMTSWHLPE NPLQKSGHGS LLQTHTGEWY MAYLTSRPLR LPGVPLLASG GRGYCPLGRE TGIARIEWRD GWPYVEGGKH AQLTVKGPQV AEQPAAVPGN WRDDFDASSL DPELQTLRIP FDDTLGSLTA RPGFLRLYGN DSLNSTFTQS TVARRWQHFA FRAETRMEFS PVHFQQSAGL TCYYNSKNWS YCFVDYEEGQ GRTIKVIQLD HNVPSWPLHE QPIPVPEHAE SVWLRVDVDT LVYRYSYSFD GETWHTVPVT YEAWKLSDDY IGGRGFFTGA FVGLHCEDIS GDGCYADFDY FTYEPV
|
| |