Gene EcDH1_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3334 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3583755 
End bp3585365 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content64% 
IMG OID 
ProductAlpha-N-arabinofuranosidase 
Protein accessionACX40956 
Protein GI260450534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.783434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCA CTAACCCGAT ACTCACCGGC TTCAACCCGG ACCCGTCCCT GTGCCGCCAG 
GGCGAGGACT ACTACATCGC CACCTCGACC TTCGAGTGGT TCCCGGGCGT GCGCATCTAC
CACTCCCGTG ACCTGAAAAA CTGGTCGCTG GTCAGCACCC CGTTGGACCG CGTGTCGATG
CTGGACATGA AGGGCAACCC GGACTCCGGC GGCATCTGGG CGCCGTGCCT GAGCTACGCC
GACGGTAAAT TCTGGCTGCT CTACACCGAC GTGAAGATTG TCGACTCGCC GTGGAAAAAC
GGCCGCAACT TCCTCGTCAC CGCGCCCTCC ATCGAGGGGC CATGGAGCGA GCCAATCCCG
ATGGGCAACG GCGGGTTTGA CCCGTCCCTG TTCCACGACG ACGATGGCCG CAAATACTAT
ATCTACCGCC CGTGGGGGCC GCGCCACCAC AGCAACCCGC ACAACACCAT CGTGTTACAG
GCGTTTGACC CGCAGACCGG CACGCTCTCG CCCGAGCGCA AAACGCTGTT TACCGGCACG
CCGCTCTGCT ACACCGAAGG CGCGCACCTG TATCGCCACG CGGGATGGTA CTACCTGATG
GCCGCCGAGG GCGGCACCAG CTACGAGCAC GCCGTCGTGG TGCTGCGTTC CAAAAATATC
GACGGGCCGT ACGAGCTGCA CCCGGACGTA ACGATGATGA CCAGCTGGCA CCTGCCGGAG
AACCCGCTGC AGAAGAGCGG CCACGGCTCG CTGCTGCAGA CGCATACGGG TGAATGGTAC
ATGGCCTACC TCACCAGCCG CCCGCTGCGC CTGCCCGGCG TGCCGCTGCT GGCCTCCGGC
GGACGCGGCT ACTGCCCGCT GGGGCGCGAG ACCGGCATCG CCCGCATTGA ATGGCGCGAC
GGCTGGCCGT ACGTGGAAGG CGGCAAGCAC GCGCAGCTGA CCGTGAAAGG CCCGCAAGTA
GCCGAGCAGC CTGCAGCCGT TCCGGGCAAC TGGCGGGACG ATTTCGACGC CAGTTCGCTT
GACCCGGAGC TGCAGACCCT GCGCATTCCG TTCGACGACA CCCTCGGCTC GCTCACCGCG
CGCCCGGGCT TCTTACGGCT CTATGGCAAC GACTCGCTCA ATTCGACCTT CACCCAATCG
ACCGTGGCGC GCCGCTGGCA GCACTTCGCC TTCCGGGCAG AAACGCGGAT GGAGTTCTCG
CCGGTGCACT TCCAGCAGAG CGCGGGGCTG ACCTGCTACT ACAACAGCAA AAACTGGAGC
TACTGCTTTG TGGACTACGA GGAGGGACAG GGTAGAACCA TCAAAGTTAT CCAGCTCGAC
CACAACGTGC CGTCGTGGCC GCTGCACGAG CAGCCCATTC CGGTGCCGGA ACATGCGGAG
AGCGTCTGGC TGCGGGTGGA CGTGGATACG CTGGTCTACC GCTACAGCTA CTCGTTTGAT
GGCGAGACGT GGCACACCGT GCCGGTGACG TATGAGGCGT GGAAGCTGTC GGACGACTAC
ATCGGCGGGC GCGGCTTCTT CACCGGCGCG TTTGTGGGCC TGCACTGCGA GGACATCAGC
GGCGACGGCT GCTACGCGGA CTTCGACTAC TTCACCTACG AGCCGGTCTA A
 
Protein sequence
MEITNPILTG FNPDPSLCRQ GEDYYIATST FEWFPGVRIY HSRDLKNWSL VSTPLDRVSM 
LDMKGNPDSG GIWAPCLSYA DGKFWLLYTD VKIVDSPWKN GRNFLVTAPS IEGPWSEPIP
MGNGGFDPSL FHDDDGRKYY IYRPWGPRHH SNPHNTIVLQ AFDPQTGTLS PERKTLFTGT
PLCYTEGAHL YRHAGWYYLM AAEGGTSYEH AVVVLRSKNI DGPYELHPDV TMMTSWHLPE
NPLQKSGHGS LLQTHTGEWY MAYLTSRPLR LPGVPLLASG GRGYCPLGRE TGIARIEWRD
GWPYVEGGKH AQLTVKGPQV AEQPAAVPGN WRDDFDASSL DPELQTLRIP FDDTLGSLTA
RPGFLRLYGN DSLNSTFTQS TVARRWQHFA FRAETRMEFS PVHFQQSAGL TCYYNSKNWS
YCFVDYEEGQ GRTIKVIQLD HNVPSWPLHE QPIPVPEHAE SVWLRVDVDT LVYRYSYSFD
GETWHTVPVT YEAWKLSDDY IGGRGFFTGA FVGLHCEDIS GDGCYADFDY FTYEPV