Gene Dd1591_3993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_3993 
Symbol 
ID8119592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4511000 
End bp4512595 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content57% 
IMG OID644854372 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003006272 
Protein GI251791551 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTACCA GAAGACGCTT TCTTGCCGGT TGTGCTACCG TGCCTGTCTT GTCCTGGCTT 
AATTTGAACA CCGCGTTCGC CGATACGCCG CCATCGATGC TGGTGATGGC GATGCAGCTT
GATAACATGA CCAGTCTCGA CCCGCAGGAA GGGTTTGAGA CGGTGGGAAC CGAAATCATC
GGTAACCTGT ACCAACGTTT GGTGATGCCG AACCCAGCCA ATCCGCAAGA GGTGATCGGC
GATCTGGCCG CCAGTTGGGA AGTCGGCAAC GACAGCAAAA CCTTCACTTT CCATCTCAAT
CCGCAAGCCA AATTCGCCGA CGGCACACCG GTGACCGCCG ACGACGCCGC CTTCTCGTTA
CAACGCGCGG TTAAGCTGGA TAAAAGCCCG GCGTTCATCA TCAACCAGTT CGGTTTTACC
AAAGACAACG TGGAGCAGCA CATTACCGCG CCGGATGAAA AAACGCTGGT GATAAGCCTC
GACAAACCGG CGGCGGAAAC CTTCCTGCTG TATTGCCTGT CGGCCCCGGT GAGCAGCATC
GTACAGAAAA AAGCCGCGCT GGCTAACCAG CAAAATAACG ATCTGGGTAA CCAGTGGTTG
AAGCAGAATA GCGCCGGTTC CGGCCCTTTC TCGCTGGTGA GCTGGAAAGC CAGCGAAAGT
ATTATCCTGC AGAAAAACGA TCACTTCCCG GCGGATAACG CCTTTAAGCG CGTGCTGCTC
AAGCACATTG TCGACCCGTC CGCCCAGTTG CTGATGCTGC AAAAAGGGGA TGTAGATATC
GCCCGCAACC TGACCACCGA GCAAATTCGC CCGCTGGTGA ACGACAGTAA CTACCATCTG
GTGCGCCAGA GCATCGCCAG CGTGATGCTG CTGTCGTGCA ACACCGCCAA CGAGTTTCTG
AAAAAGCCGC AGGTGTGGCA GGCCATCAAA TGGGCGCTGG ACTATGACGG CATTCAGAAA
AATATTCTGC CGCTCACGCA CAAAGTCCAT CAGAGCTTCC TGCCGGGCGG CTTTCCGGCG
GCGCTGAACG ATACCCCGTT TCATATGGAT GTCGCCAAAG CCAAAGCGTT GCTGAAAGAC
GCCGGTTATC CGGATGGCTT CGACATTACG CTGGATCACT ACTCCGCCCA GCCGTACCCG
GATATCGCGC AGGCAGTCCA GACCCAATTG GGTGCCATCG GCATCCGGGT GAAACTGATT
GCGGCGGAAA ACCGTCAGGT ACTGACCAAA ATGCGTGCCC GCCAGCAGCA ACTGGCGCTG
ACCGCGTGGG GCGCTGACTA TTTCGACCCG AACTCCAACG CCGAAGCCTT CTGCATCAAC
ACCGACAACA GCGACGGCGC CCGCAACCGC ACGCTGGCGT GGCGCTGCAA CTGGTCGGAC
GAAAAATTCA ATCAGTTGAC CGAACAGGCG CTGCACGAGC AGGACCCGGC CAAACGCATC
GCGCTGTATG AAACTCTGCA ACGCAACCAC CGCGAGCAGA GCCCGTTCAC GCTGATGATG
CAGGATGAGA AAACGCTGGC TTGCCGCAAG AATCTCAGCG GCGTCACCAT GACGGTGTTG
AGCAAGGTGC CCTACCAGCA GGTGAAGAAA GCCTGA
 
Protein sequence
MVTRRRFLAG CATVPVLSWL NLNTAFADTP PSMLVMAMQL DNMTSLDPQE GFETVGTEII 
GNLYQRLVMP NPANPQEVIG DLAASWEVGN DSKTFTFHLN PQAKFADGTP VTADDAAFSL
QRAVKLDKSP AFIINQFGFT KDNVEQHITA PDEKTLVISL DKPAAETFLL YCLSAPVSSI
VQKKAALANQ QNNDLGNQWL KQNSAGSGPF SLVSWKASES IILQKNDHFP ADNAFKRVLL
KHIVDPSAQL LMLQKGDVDI ARNLTTEQIR PLVNDSNYHL VRQSIASVML LSCNTANEFL
KKPQVWQAIK WALDYDGIQK NILPLTHKVH QSFLPGGFPA ALNDTPFHMD VAKAKALLKD
AGYPDGFDIT LDHYSAQPYP DIAQAVQTQL GAIGIRVKLI AAENRQVLTK MRARQQQLAL
TAWGADYFDP NSNAEAFCIN TDNSDGARNR TLAWRCNWSD EKFNQLTEQA LHEQDPAKRI
ALYETLQRNH REQSPFTLMM QDEKTLACRK NLSGVTMTVL SKVPYQQVKK A