Gene Dd1591_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_3900 
Symbol 
ID8118865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4412118 
End bp4413449 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content57% 
IMG OID644854279 
Productproline dipeptidase 
Protein accessionYP_003006179 
Protein GI251791458 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.022198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGC TGACTTCTTT GTATCATCAA CATGTGGCGA CTCTGCAGCA ACGCACGCAG 
GCGGTTCTGG CACGGCATAA TCTGGATGCC TTATTGATCC ACTCCGGTGA GTTGATGATG
GCGTTTCTGG ATGATCATGC TTATCCGTTC AAAGTTAACC CGCAGTTCAA AGCCTGGCTG
CCGGTGACGC AAGTGCCGAA CTGCTGGCTG TGGGTGGATG GAGTCAATAC GCCGAAGCTG
TGGTTCTACT CCCCCGTTGA TTACTGGCAT AACGTGGCGC CGGTGCCGGA CAGTTTTTGG
ACCACATCGC TGGACATCCA GGTACTGCGC AAGGCCGATG ACATCGTCCG GCAATTGCCG
GTTCAACGCC AGCGTGTCGC TTACATCGGT TCTGCGCCGC AACGGGCGTT GAATCTGGGC
GTGGCGTCGG AACACATCAA CCCGAAAGGC GTGCTGGATT ATCTGCATTA CTACCGCGCC
TACAAAACGG ATTACGAACT GGCCTGCCTG CGTGAAGCGC AGAAAACGGC GGTGGTCGGC
CACCACGCCG CATACGAAGC GTTCCAGTCC GGCATGAGCG AATTTGACAT CAATCTGGCG
TACCTGACCG CCACCGGTCA CCGTGATACC GATGTGCCTT ATGGCAATAT CGTCGCTCTC
AACGAGCACG CGGCGGTGCT GCACTATACT CAACTTGAAC ACCGGGTGCC GATGGAAATG
CGCAGTTTCC TGCTGGATGC CGGCGCAGAA TATAACGGCT ATGCGGCGGA CATTACCCGT
ACCTATGCCG CGCAGCATGA TAATGACTAT GCTGCGCTGG TAAAAGACCT GAACCGCGAG
CAACTGGCGC TGATAGATAC CCTGAAGGCC GGCGTGCGTT ATACCGACTA CCATTTGCAG
ATGCATCGCC GGGTGGCGGC GTTGCTTAAA CGTCATCAAC TGGTGACCGG GCTGAGCGAA
GAAGCGATGG TGGAACAGAG CGTGACCTCG CCGTTCCTGC CGCACGGTCT GGGCCATCCG
CTCGGTTTGC AGGTGCACGA CGTCGGCGGA TTTATGCAGG ACGACGCCGG CACGACGCTG
CCTGCGCCAT CAGCCCATCC CTACCTGCGC TGTACCCGGA TTCTGGAGCC GCGCATGGTG
CTGACTATCG AACCGGGTAT CTATTTCATC GATTCGTTGC TTGAGCCCTG GCGTCAGGGC
GAGCTACGCC AGCATTTCAA CTGGCAGAAG CTGGATGCGT TGCGTCCGTT CGGCGGTATT
CGTATTGAAG ACAATATCGT GGTTCATGAC AAACGCATCG AAAACCTGAC CCGCGCGCTT
GATCTGGCCT GA
 
Protein sequence
METLTSLYHQ HVATLQQRTQ AVLARHNLDA LLIHSGELMM AFLDDHAYPF KVNPQFKAWL 
PVTQVPNCWL WVDGVNTPKL WFYSPVDYWH NVAPVPDSFW TTSLDIQVLR KADDIVRQLP
VQRQRVAYIG SAPQRALNLG VASEHINPKG VLDYLHYYRA YKTDYELACL REAQKTAVVG
HHAAYEAFQS GMSEFDINLA YLTATGHRDT DVPYGNIVAL NEHAAVLHYT QLEHRVPMEM
RSFLLDAGAE YNGYAADITR TYAAQHDNDY AALVKDLNRE QLALIDTLKA GVRYTDYHLQ
MHRRVAALLK RHQLVTGLSE EAMVEQSVTS PFLPHGLGHP LGLQVHDVGG FMQDDAGTTL
PAPSAHPYLR CTRILEPRMV LTIEPGIYFI DSLLEPWRQG ELRQHFNWQK LDALRPFGGI
RIEDNIVVHD KRIENLTRAL DLA