Gene EcDH1_3873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3873 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4171315 
End bp4172670 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content52% 
IMG OID 
Productglycoside hydrolase family 4 
Protein accessionACX41474 
Protein GI260451052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTCTG CACCCAAAAT TACATTTATC GGCGCTGGTT CGACGATTTT CGTTAAAAAT 
ATTCTTGGTG ATGTGTTCCA TCGCGAGGCG CTGAAAACGG CGCATATTGC CCTGATGGAC
ATTGACCCCA CCCGCCTGGA AGAGTCGCAT ATTGTGGTGC GTAAGCTGAT GGATTCAGCA
GGGGCCAGCG GCAAAATCAC CTGCCACACC CAACAGAAAG AAGCCTTAGA GGATGCCGAT
TTTGTCGTGG TGGCATTTCA GATTGGCGGT TATGAACCTT GCACGGTGAC TGATTTCGAG
GTCTGTAAGC GGCATGGTCT GGAACAAACC ATTGCCGATA CGTTGGGGCC GGGCGGTATT
ATGCGCGCGC TACGTACCAT TCCGCATCTG TGGCAAATTT GCGAGGACAT GACGGAAGTC
TGCCCCGATG CCACCATGCT CAACTATGTT AACCCAATGG CGATGAATAC CTGGGCGATG
TATGCCCGCT ATCCGCATAT CAAACAGGTC GGGCTGTGCC ATTCGGTGCA GGGAACGGCG
GAAGAGTTGG CGCGTGACCT CAATATCGAC CCAGCTACGC TGCGTTACCG TTGCGCAGGT
ATCAACCATA TGGCGTTTTA CCTGGAGCTG GAGCGCAAAA CCGCCGACGG CAGTTATGTG
AATCTCTACC CGGAACTGCT GGCGGCTTAT GAAGCAGGGC AGGCACCGAA GCCGAATATT
CATGGCAATA CTCGCTGCCA GAATATTGTG CGCTACGAAA TGTTCAAAAA GCTGGGCTAT
TTCGTCACGG AATCGTCAGA ACATTTTGCT GAGTACACAC CGTGGTTTAT TAAGCCAGGT
CGTGAGGATT TGATTGAGCG TTATAAAGTA CCGCTGGATG AGTACCCGAA ACGCTGCGTC
GAGCAGCTGG CGAATTGGCA TAAAGAGCTG GAGGAGTATA AAAAAGCCTC CCGGATTGAT
ATTAAACCGT CACGGGAATA TGCCAGCACA ATCATGAACG CTATCTGGAC TGGCGAGCCG
AGTGTGATTT ACGGCAACGT CCGTAACGAT GGTTTGATTG ATAACCTGCC ACAAGGATGT
TGCGTGGAAG TAGCCTGTCT GGTTGATGCT AATGGCATTC AGCCGACCAA AGTCGGTACG
CTACCTTCGC ATCTGGCCGC CCTGATGCAA ACCAACATCA ACGTACAGAC GCTGCTGACC
GAAGCTATTC TTACGGAAAA TCGCGACCGT GTTTACCACG CCGCGATGAT GGACCCGCAT
ACTGCCGCCG TGCTGGGCAT TGACGAAATA TATGCTCTTG TTGACGACCT GATTGCCGCC
CACGGCGACT GGCTGCCAGG CTGGTTGCAC CGTTAA
 
Protein sequence
MMSAPKITFI GAGSTIFVKN ILGDVFHREA LKTAHIALMD IDPTRLEESH IVVRKLMDSA 
GASGKITCHT QQKEALEDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI
MRALRTIPHL WQICEDMTEV CPDATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA
EELARDLNID PATLRYRCAG INHMAFYLEL ERKTADGSYV NLYPELLAAY EAGQAPKPNI
HGNTRCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIERYKV PLDEYPKRCV
EQLANWHKEL EEYKKASRID IKPSREYAST IMNAIWTGEP SVIYGNVRND GLIDNLPQGC
CVEVACLVDA NGIQPTKVGT LPSHLAALMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH
TAAVLGIDEI YALVDDLIAA HGDWLPGWLH R