Gene Dd1591_1529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_1529 
Symbol 
ID8117409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp1747386 
End bp1748666 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content45% 
IMG OID644851922 
ProductCellulase 
Protein accessionYP_003003863 
Protein GI251789142 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATTT TTGATCTGGA CAAGAAAAAC ACCTCCAATA AAAAACACTA CTCTTCACGT 
AAAAGCCTGT ATTTTTCCGG TATTTTCTTA GGATTAAGTA TTACCTGTCT CTCCGGTAGT
GCCTGGGCCA GTGTTGAACC CCTTTCCGTC AGCGGTAATA AAATCTATGC GGGCGAAAAA
GCTCAGAGCT TCGCCGGCAA TAGCCTATTC TGGAGTAATA ATGGCTGGGG TGGTGAAAAA
TTCTATACTG CCGATACGGT TGCCTCACTA AAAAAAGACT GGGGTTCCAG TATTGTTCGT
GCGGCGATGG GGGTACAAGA CGCCGGTGGT TATCTCCAGG ACCCAGCCGG CAACAAAGCT
AAAGTTGAAA AAGTTGTGGA TGCGGCTATC GCCAACGACA TGTATGTGAT TATTGACTGG
CATTCACACT CAGCAGAAAA TAACCGTAAC GAAGCGATTA GCTTCTTCCA GGAAATGGCC
AGAAAGTATG GCAAAAATCC TAATGTTATT TATGAAATCT ACAATGAGCC ACTTCAGGTT
TCATGGAGTA ACACAATCAA ACCTTATGCA GAAGCAGTTA TTTCCGCTAT CAGGGCGATT
GATCCGGATA ATCTCATTAT TGTCGGCACA CCCAGTTGGT CACAAAACGT AGACGAAGCC
TCACGAGATC CAATCAACGC CAATAATATT GCCTATACAT TACACTTCTA TGCCGGAACA
CATGGCGAGT CGTTACGCAA TAAAGCCCGC CAGGCGCTAA ATAATGGTAT CGCTCTCTTT
GTTACCGAGT GGGGAGCAGT CAATGCCGAT GGTAATGGTG GGGTAAATCA AACAGAAACA
GATGCCTGGG TAACATTTAT GAAGGATAAC AATATCAGCA ACGCTAACTG GGCGTTGAAT
GATAAAAATG AAGGGGCATC TACTTATTAT CCTGATTCCA AAAACCTTAC CGAATCGGGC
AAGAAAGTAA AATCGATCAT TCAAAATTGG CCTTATAAAA TCAACGGCAC ATCCAGTACC
ACAACTGAAC CATCAACCGA ACCCACACCA ACGCCTACCA CAGACGAGCC GGTGACGACG
GATGAGCCGG CAACAACAAA CTGTTCGAAT ACCAATGTGT ATCCCAATTG GGTCAGTAAG
GACTGGGCTG GTGGGCAACC AAATCATAAT GAAGCGGGTC AATCGATCGT CTACAAAGGC
AATCTCTATA CTGCAAACTG GTACACGACC TCTACCCCTG GCAGCGACTC CTCATGGACG
TTGGTCGGTA GCTGCAATTG A
 
Protein sequence
MPIFDLDKKN TSNKKHYSSR KSLYFSGIFL GLSITCLSGS AWASVEPLSV SGNKIYAGEK 
AQSFAGNSLF WSNNGWGGEK FYTADTVASL KKDWGSSIVR AAMGVQDAGG YLQDPAGNKA
KVEKVVDAAI ANDMYVIIDW HSHSAENNRN EAISFFQEMA RKYGKNPNVI YEIYNEPLQV
SWSNTIKPYA EAVISAIRAI DPDNLIIVGT PSWSQNVDEA SRDPINANNI AYTLHFYAGT
HGESLRNKAR QALNNGIALF VTEWGAVNAD GNGGVNQTET DAWVTFMKDN NISNANWALN
DKNEGASTYY PDSKNLTESG KKVKSIIQNW PYKINGTSST TTEPSTEPTP TPTTDEPVTT
DEPATTNCSN TNVYPNWVSK DWAGGQPNHN EAGQSIVYKG NLYTANWYTT STPGSDSSWT
LVGSCN