Gene Dd1591_1279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_1279 
Symbol 
ID8118672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp1449815 
End bp1451014 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content57% 
IMG OID644851673 
ProductArabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_003003621 
Protein GI251788900 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000518384 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TGATACCCAC CTTACTGGCT GTTTCCCTGT CGCTGGGCGC GATGCCGCTC 
ATGGCGGCGG AGTCCGTCGT GATTAAACCA CTGCGCAATG CCCCGGCCGA TTTTATCAAG
GGTGCGGATA TTTCCACCTT GCTGGAAGTG GAGCGCCAGG GCGGCGTGTT TTATGACGAA
AACCACGTGC GCGTCGACCC GGTCGCGTTG CTGAAAAAGA ACGGCGTCAA CTATATCCGG
CTGCGTTTGT GGGTTGACCC GCACGATGCC GCCGGGCGTC CTTACGGCGG CGGTGATAAC
GATCTGGCGA CGACGCTGGC GCTGGCTAAA CGCGTCAAAG CGGCAGGCAT GAAGCTACTG
CTGGATTTCC ACTACAGCGA CTTCTGGACC GACCCCGGCA AGCAGTTCAA GCCGAAAGCC
TGGGCTAACC TGTCCTACGA ACAACTGAAA ACCGCTGTTC ATGACTATAC CCGCGACACC
ATCGCACGTT TTAAGCGGGA AGGGGTACTG CCGGATATGG TGCAGATCGG TAACGAAGCC
AACGGCGGTA TCTTGTGGCC GGAAGGCAAA AGCTGGGGGC AGGGCGGCGG CGAATTCGAC
CGGCTGGCCG GCCTGCTGAA CGCCGCGATC GCCGGCTTGC GTGAAAACCT TAGTTCACCG
GGGCAGGTGA AAATCATGCT GCATCTGGCG GAAGGCACCA AGAACGACAC CTTCCGCTGG
TGGTTTGATG AAATCACCCA ACGCGGCGTG CCGTTCGATG TGATTGGCCT GTCGATGTAC
ACCTATTGGG ATGGCCCGAT CAGCTCGCTG AAAGCCAACA TGGACGACAT CAGCCAACGC
TACAACAAGG ACGTTATCGT GGTAGAGGCC GCCTACGGCT ACACCCTGGC TAACTGCGAC
AACGCCGAAA ACAGCTTCGG CGAAAAAGAA GCGGCGGCGG GCGGTTATCC GGCTACCGTG
CAAGGGCAGG CCGATTTCAT TCGCGACCTG ATGCAAAGCG TAATCGACGT CCCGAAAAAG
CACGGCAAAG GCGTGTTCTA TTGGGAACTG GCCTGGATAA CGCCGGCGGG AAATACCTGG
GCCACCGAAG CCGGCATGAA TTATATCAAC GACCACTGGA AATTGGGCAA CGCCCGTGAA
AATCAGGCGT TATTTAATTG CCAGGGGGAG GTGTTGCCTT CGATAAAAGC CTTTAAATAA
 
Protein sequence
MKKMIPTLLA VSLSLGAMPL MAAESVVIKP LRNAPADFIK GADISTLLEV ERQGGVFYDE 
NHVRVDPVAL LKKNGVNYIR LRLWVDPHDA AGRPYGGGDN DLATTLALAK RVKAAGMKLL
LDFHYSDFWT DPGKQFKPKA WANLSYEQLK TAVHDYTRDT IARFKREGVL PDMVQIGNEA
NGGILWPEGK SWGQGGGEFD RLAGLLNAAI AGLRENLSSP GQVKIMLHLA EGTKNDTFRW
WFDEITQRGV PFDVIGLSMY TYWDGPISSL KANMDDISQR YNKDVIVVEA AYGYTLANCD
NAENSFGEKE AAAGGYPATV QGQADFIRDL MQSVIDVPKK HGKGVFYWEL AWITPAGNTW
ATEAGMNYIN DHWKLGNARE NQALFNCQGE VLPSIKAFK