Gene Dd1591_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_3669 
Symbol 
ID8117500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4156219 
End bp4157529 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content54% 
IMG OID644854043 
Productputative 5-methylcytosine restriction system component 
Protein accessionYP_003005956 
Protein GI251791235 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGAGG TTATCTCGGT GTTTGAATAC GGCTGTGTCG GCGCGGCTCC GGTTCGGCTG 
GCTGATGTCG CCGCGGTTCC ACCGGCGGTA TTCGACTATC TGGAATCGCT GGCGCTGGAT
GAACAGGGCT GCCCGTTCCT GCGTCTGACA TCACGCAAAG GGCATCGGCT GATTCAGGTG
CAGAACTATG CCGGGGTGCT TGCCACGCCC TTTGGCGTGC AACTGGAGAT CTTGCCTAAA
ATTGGGCGGA CGTCCCCCCC TGAACAAGCA CGGCATGTGT TGCTGGCTAT GCTGGCGGTA
TTACCGGATT TTCGGCATAT CGAGACAGAG CAGGCGTTGG TGCAGGTGCA GCGGATGACG
CTGCTGGAAA TCTTCATCAG CCAGTTTTTG CAGAGCGTCA GCCAACTGAT CAGGCAAGGG
TTGCGCTCCG ATTATGTGAG CCAACAAGGC AACCTGCCGT TTATCAAAGG TAAGCTGTTG
CTGCCTGAGC AGTTGCGCCG CAATAATGTG AATCGGCATA AGTTCTGGGT TGAATATGAA
GACTATTTAC CGGACTGTCC GGCAAATAGG TTATTACATT CGGCCCTTAA TTTGGTCAGC
CAGTGGCGGT TGTCGTCGGA AAATCAGCGT GAATGCCGGA TGCTGCGGTT TGTATTTGAT
GGCATTCCAC CTAGCCGGGA TATCGACAGT GATATCAGCA GGCTGCGTGT GGACCGCAAT
ATGGCGCATT ATCAGGCACC GCTGGCTTGG GCGAAACTGA TTCTGACCGG GATGAGCCCG
CGAACGTCGG CGGGCAGCGA GGGGGCGATA TCGCTGTTAT TTCCGATGGA AGCCGTGTTT
GAGGCGTTTG TGGCGCAAAC GTTGTTGGAA GAGATTCCGC CCGACCAGCA TCTGAAAGCT
CAGGTGGCGG AGCAGACCTT GGTAAGTTAC GCGGGTAGGG CGCGGTTCAA ATTACAGCCT
GATTTATTGC TCCAGTCACG CCACCCTGCC TGCAATCTGG CGGTGTTGGA TACCAAATGG
AAGTTGATCC GTGAACGACA GTGGCTCCGC GATGGGCAAC AAGGGGACAG GCTCCGTGGT
TTGTCCGAAT CCGATTTTTA TCAGATGTTT GCTTATGGGC AGCGCTATCT GGCCGATAAG
GGTGATATGT ACCTGATTTA TCCCGAGCAC GATGAATTTA CCCAGCCGCT CCCATCCCCT
TTTATTTTTT CGGAAACGTT ACGGTTGTGG GTGGTGCCGT ATCGTATTTC GGCATTAGAC
GGGCAGAGAA TGCAGTGGCC ACATCGAGAA TACAAGGCAG CGGTAAATTA A
 
Protein sequence
MHEVISVFEY GCVGAAPVRL ADVAAVPPAV FDYLESLALD EQGCPFLRLT SRKGHRLIQV 
QNYAGVLATP FGVQLEILPK IGRTSPPEQA RHVLLAMLAV LPDFRHIETE QALVQVQRMT
LLEIFISQFL QSVSQLIRQG LRSDYVSQQG NLPFIKGKLL LPEQLRRNNV NRHKFWVEYE
DYLPDCPANR LLHSALNLVS QWRLSSENQR ECRMLRFVFD GIPPSRDIDS DISRLRVDRN
MAHYQAPLAW AKLILTGMSP RTSAGSEGAI SLLFPMEAVF EAFVAQTLLE EIPPDQHLKA
QVAEQTLVSY AGRARFKLQP DLLLQSRHPA CNLAVLDTKW KLIRERQWLR DGQQGDRLRG
LSESDFYQMF AYGQRYLADK GDMYLIYPEH DEFTQPLPSP FIFSETLRLW VVPYRISALD
GQRMQWPHRE YKAAVN