Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_4079 |
Symbol | |
ID | 8120931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | - |
Start bp | 4603248 |
End bp | 4604246 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644854456 |
Product | Cellulase |
Protein accession | YP_003006356 |
Protein GI | 251791635 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAGC TAATGTGGCG TGGTTGGATA TTGATGCTGA TGGTGTGGTT TAGTGTGTCG GCGACGGCGG CGACCGGCTG GGAAACCTAT AAAAGCCGCT TCATGACCGC AGACGGACGC ATTCAGGATA CCGGCAATAA GAATGTCAGC CATACCGAAG GCCAGGGTTT CGCCATGCTG ATGGCGGTAC ATTACGATGA CCGTGCGGCG TTCGATAACC TGTGGAACTG GACGCAGAGC CACCTGAAAA ATACCGTCAA CGGCTTGTTT TACTGGCGTT ATGACCCGGC GGCATCCAAC CCGGTAGCCG ATCGGAACAA CGCATCGGAC GGCGATGTGC TGATTGCCTG GGCGTTGCTG AAGGCCGGTA ACAAGTGGCA GGACAACCGC TATTTGCAGG CTTCCGACGG CATCCAGAAA GCGATCATCA GCAACGAGAT TATTCAGTTC GCCGGGCGCA CCGTGATGTT GCCGGGCGCG TATGGCTTCA ACAAGAACAG CTATGTCGTC CTTAACCCGT CGTATTTCTT GTTCCCCGCC TGGCGCGACT TTGCCAATCG CAGCCACCTT CAGGTCTGGC GGCAACTGAT TGACGATAGC TTATCGTTGG TTGGAGAAAT GCGTTTCGGG CAGACGGGAT TGCCGACGGA TTGGGTGGCG TTAAACGCCG ATGGCAGTAT GGCGCCGGCA ACCGCCTGGC CGTCGCGTTT CAGTTATGAT GCTATCCGTA TTCCGTTGTA CCTGTACTGG TATGACGCCA AAACGATGGC GCTGGTGCCG TTCCAACTGT ATTGGCGGAA TTATCCCCGT TTGGCGACGC CGGCCTGGGT CGATGTACTG AGCAATAACA CCGCGCCTTA CAGTATGCAA GGCGGTTTAC TGGCGGTGCG TGACCTGACG ATGGGCAGTT TCGGCGCGCT TAGCGATCAG CCTGGCGCGG CGGAGGATTA TTACTCGTCC AGCTTGCGTT TGCTGGTTGC CCTGGCGCGC GGCCAGTAA
|
Protein sequence | MLKLMWRGWI LMLMVWFSVS ATAATGWETY KSRFMTADGR IQDTGNKNVS HTEGQGFAML MAVHYDDRAA FDNLWNWTQS HLKNTVNGLF YWRYDPAASN PVADRNNASD GDVLIAWALL KAGNKWQDNR YLQASDGIQK AIISNEIIQF AGRTVMLPGA YGFNKNSYVV LNPSYFLFPA WRDFANRSHL QVWRQLIDDS LSLVGEMRFG QTGLPTDWVA LNADGSMAPA TAWPSRFSYD AIRIPLYLYW YDAKTMALVP FQLYWRNYPR LATPAWVDVL SNNTAPYSMQ GGLLAVRDLT MGSFGALSDQ PGAAEDYYSS SLRLLVALAR GQ
|
| |