Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_1279 |
Symbol | |
ID | 8118672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | + |
Start bp | 1449815 |
End bp | 1451014 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644851673 |
Product | Arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_003003621 |
Protein GI | 251788900 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000518384 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TGATACCCAC CTTACTGGCT GTTTCCCTGT CGCTGGGCGC GATGCCGCTC ATGGCGGCGG AGTCCGTCGT GATTAAACCA CTGCGCAATG CCCCGGCCGA TTTTATCAAG GGTGCGGATA TTTCCACCTT GCTGGAAGTG GAGCGCCAGG GCGGCGTGTT TTATGACGAA AACCACGTGC GCGTCGACCC GGTCGCGTTG CTGAAAAAGA ACGGCGTCAA CTATATCCGG CTGCGTTTGT GGGTTGACCC GCACGATGCC GCCGGGCGTC CTTACGGCGG CGGTGATAAC GATCTGGCGA CGACGCTGGC GCTGGCTAAA CGCGTCAAAG CGGCAGGCAT GAAGCTACTG CTGGATTTCC ACTACAGCGA CTTCTGGACC GACCCCGGCA AGCAGTTCAA GCCGAAAGCC TGGGCTAACC TGTCCTACGA ACAACTGAAA ACCGCTGTTC ATGACTATAC CCGCGACACC ATCGCACGTT TTAAGCGGGA AGGGGTACTG CCGGATATGG TGCAGATCGG TAACGAAGCC AACGGCGGTA TCTTGTGGCC GGAAGGCAAA AGCTGGGGGC AGGGCGGCGG CGAATTCGAC CGGCTGGCCG GCCTGCTGAA CGCCGCGATC GCCGGCTTGC GTGAAAACCT TAGTTCACCG GGGCAGGTGA AAATCATGCT GCATCTGGCG GAAGGCACCA AGAACGACAC CTTCCGCTGG TGGTTTGATG AAATCACCCA ACGCGGCGTG CCGTTCGATG TGATTGGCCT GTCGATGTAC ACCTATTGGG ATGGCCCGAT CAGCTCGCTG AAAGCCAACA TGGACGACAT CAGCCAACGC TACAACAAGG ACGTTATCGT GGTAGAGGCC GCCTACGGCT ACACCCTGGC TAACTGCGAC AACGCCGAAA ACAGCTTCGG CGAAAAAGAA GCGGCGGCGG GCGGTTATCC GGCTACCGTG CAAGGGCAGG CCGATTTCAT TCGCGACCTG ATGCAAAGCG TAATCGACGT CCCGAAAAAG CACGGCAAAG GCGTGTTCTA TTGGGAACTG GCCTGGATAA CGCCGGCGGG AAATACCTGG GCCACCGAAG CCGGCATGAA TTATATCAAC GACCACTGGA AATTGGGCAA CGCCCGTGAA AATCAGGCGT TATTTAATTG CCAGGGGGAG GTGTTGCCTT CGATAAAAGC CTTTAAATAA
|
Protein sequence | MKKMIPTLLA VSLSLGAMPL MAAESVVIKP LRNAPADFIK GADISTLLEV ERQGGVFYDE NHVRVDPVAL LKKNGVNYIR LRLWVDPHDA AGRPYGGGDN DLATTLALAK RVKAAGMKLL LDFHYSDFWT DPGKQFKPKA WANLSYEQLK TAVHDYTRDT IARFKREGVL PDMVQIGNEA NGGILWPEGK SWGQGGGEFD RLAGLLNAAI AGLRENLSSP GQVKIMLHLA EGTKNDTFRW WFDEITQRGV PFDVIGLSMY TYWDGPISSL KANMDDISQR YNKDVIVVEA AYGYTLANCD NAENSFGEKE AAAGGYPATV QGQADFIRDL MQSVIDVPKK HGKGVFYWEL AWITPAGNTW ATEAGMNYIN DHWKLGNARE NQALFNCQGE VLPSIKAFK
|
| |