Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3873 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4171315 |
End bp | 4172670 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | glycoside hydrolase family 4 |
Protein accession | ACX41474 |
Protein GI | 260451052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGTCTG CACCCAAAAT TACATTTATC GGCGCTGGTT CGACGATTTT CGTTAAAAAT ATTCTTGGTG ATGTGTTCCA TCGCGAGGCG CTGAAAACGG CGCATATTGC CCTGATGGAC ATTGACCCCA CCCGCCTGGA AGAGTCGCAT ATTGTGGTGC GTAAGCTGAT GGATTCAGCA GGGGCCAGCG GCAAAATCAC CTGCCACACC CAACAGAAAG AAGCCTTAGA GGATGCCGAT TTTGTCGTGG TGGCATTTCA GATTGGCGGT TATGAACCTT GCACGGTGAC TGATTTCGAG GTCTGTAAGC GGCATGGTCT GGAACAAACC ATTGCCGATA CGTTGGGGCC GGGCGGTATT ATGCGCGCGC TACGTACCAT TCCGCATCTG TGGCAAATTT GCGAGGACAT GACGGAAGTC TGCCCCGATG CCACCATGCT CAACTATGTT AACCCAATGG CGATGAATAC CTGGGCGATG TATGCCCGCT ATCCGCATAT CAAACAGGTC GGGCTGTGCC ATTCGGTGCA GGGAACGGCG GAAGAGTTGG CGCGTGACCT CAATATCGAC CCAGCTACGC TGCGTTACCG TTGCGCAGGT ATCAACCATA TGGCGTTTTA CCTGGAGCTG GAGCGCAAAA CCGCCGACGG CAGTTATGTG AATCTCTACC CGGAACTGCT GGCGGCTTAT GAAGCAGGGC AGGCACCGAA GCCGAATATT CATGGCAATA CTCGCTGCCA GAATATTGTG CGCTACGAAA TGTTCAAAAA GCTGGGCTAT TTCGTCACGG AATCGTCAGA ACATTTTGCT GAGTACACAC CGTGGTTTAT TAAGCCAGGT CGTGAGGATT TGATTGAGCG TTATAAAGTA CCGCTGGATG AGTACCCGAA ACGCTGCGTC GAGCAGCTGG CGAATTGGCA TAAAGAGCTG GAGGAGTATA AAAAAGCCTC CCGGATTGAT ATTAAACCGT CACGGGAATA TGCCAGCACA ATCATGAACG CTATCTGGAC TGGCGAGCCG AGTGTGATTT ACGGCAACGT CCGTAACGAT GGTTTGATTG ATAACCTGCC ACAAGGATGT TGCGTGGAAG TAGCCTGTCT GGTTGATGCT AATGGCATTC AGCCGACCAA AGTCGGTACG CTACCTTCGC ATCTGGCCGC CCTGATGCAA ACCAACATCA ACGTACAGAC GCTGCTGACC GAAGCTATTC TTACGGAAAA TCGCGACCGT GTTTACCACG CCGCGATGAT GGACCCGCAT ACTGCCGCCG TGCTGGGCAT TGACGAAATA TATGCTCTTG TTGACGACCT GATTGCCGCC CACGGCGACT GGCTGCCAGG CTGGTTGCAC CGTTAA
|
Protein sequence | MMSAPKITFI GAGSTIFVKN ILGDVFHREA LKTAHIALMD IDPTRLEESH IVVRKLMDSA GASGKITCHT QQKEALEDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI MRALRTIPHL WQICEDMTEV CPDATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA EELARDLNID PATLRYRCAG INHMAFYLEL ERKTADGSYV NLYPELLAAY EAGQAPKPNI HGNTRCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIERYKV PLDEYPKRCV EQLANWHKEL EEYKKASRID IKPSREYAST IMNAIWTGEP SVIYGNVRND GLIDNLPQGC CVEVACLVDA NGIQPTKVGT LPSHLAALMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH TAAVLGIDEI YALVDDLIAA HGDWLPGWLH R
|
| |