Gene Dd1591_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd1591_3603 
Symbol 
ID8117434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya zeae Ech1591 
KingdomBacteria 
Replicon accessionNC_012912 
Strand
Start bp4082928 
End bp4084556 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content57% 
IMG OID644853977 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003005890 
Protein GI251791169 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACAC ACTCTGTGGC AAACAAAATA TCACGACACG CCTTACTCAC CGCAGGGTTG 
CTGTCCCTGG CGGTAAGCGC ATCGCCTTTC GCCAACGCGG CAGACGCGAC CACAACCCCG
GTGCAGGGCG GTACGCTGAA TATCGGGCTC GGCAGCGACA CCCCGGTGAT CGACCCCTCG
ATTACCGCTT ACTCCGTGGC GGCGCTGGTC GCTCGCAACG TGGTAGATTC GCTGGTTGGT
CAGGCGGAAG ACAACCGTTT TACCCCCTGG CTGGCGGAAC GCTGGGAAAT CAACGACAAC
AACACCCGTT ATACTTTTCA CCTGCGTAAA GACGTCACGT TCAGTGACGG CACTAAACTG
GACGCAGCGG CGGTGAAATA CAATCTGGAC CGTATTCTCG ACCCGAAAAC CACCTCCAGT
TACGCGAAAT CCCTACTGGG GCCGATCGAT AACATCGCCA CGCCAGACGA CTACACGGTG
GTGATCAGCT ATAAGAGCCC GTTTGCCGCA CTGTTACAGG GGCTGAGCCT GCCGTATTTA
GGCATTCAGT CGTCGACGTA CCTGAAAAAT ACCCCGAATA CCAGCAACAC GCTGGTCGGC
TCCGGGCCGT TTATTCTGGA ATCGTTCGTC AAAGGCAGCG GTAGCCGCCT GAAAAAACGT
CCGGATTACC ACTGGGGGCC GGGTTATGCC GCGCATACCG GCCCGGCGTA TCTGGATAAA
ATCGAATTCA AATACCTGCC GGAATCGTCC GTGCGCCTTG GCGCATTGAG CAGCGGCCAG
GTACAGGCGA TTGACGCCGT GCCGCCGGCC AATGCCGCTG CACTGAAAAA AGATACCCGC
CTGGATGTGA TCACCCGAGA AAACCCCGGC GTTAACCGTG TGCTCTACCT GAACACCTCT
AAAGGCCCGT TCCAGGACGT CAACGTACGC CGTGCCTTCT TACATGCGGT GGATGCCGCC
TCGGCAACGA AAGTCGCGTT CTTCGGCACG CTGAAAGCCG CGGATAGCGT TCTCGGCCCG
TCCACGCTGT ATTACGACAA ATCCGCCGCC GCACTGGGGG GGTTTGACCT GAAAAAAGCC
AATCAGTTGC TGGACGAGGC CGGCTGGAAA ACCCAAGACG GCGAAGGTTA CCGCACCAAA
GACGGCAAAC GCCTGACCGT GCATTTCGTC TACAGTACCG GTTCGTCGGA AGCGGCGGAG
ATAACCCTGT TCCAGGCGGT ACAGTTCCAG GTGAAAAAAG CCGGTATCGA CATCCAGCTC
AATCCGGTGG ACAGCGGAGG CTTTACCAGC CGCACCAACG ATAACGACTA CGACATCGCT
TCTAACTACT TTGTACGCGC CGAACCGGAT ATTCTGCGCA CGGTGTTCGA CTCTAACTAC
ATTCCGCCCA ACGGCAACAA CTTCACCCGC ACCCACTTGC TGGATGACAA GCTGAGAAAA
GCCATCGGCG CCGGTGATGC CGAACGCCAG CAGTTGTATA GCGAAATCCA GCGTGAACTG
CTTGATCAGG CTTACGCGGT GCCGCTGTTC GTTCCAGCCT ACCAGTTGGG CCTGTCGAAG
AAAGTGCAGG GCATTAGCTG GGCCACCAAC GCCAAACCGA ACTTCTATGA TGTCTGGATT
AAACCGTAA
 
Protein sequence
MATHSVANKI SRHALLTAGL LSLAVSASPF ANAADATTTP VQGGTLNIGL GSDTPVIDPS 
ITAYSVAALV ARNVVDSLVG QAEDNRFTPW LAERWEINDN NTRYTFHLRK DVTFSDGTKL
DAAAVKYNLD RILDPKTTSS YAKSLLGPID NIATPDDYTV VISYKSPFAA LLQGLSLPYL
GIQSSTYLKN TPNTSNTLVG SGPFILESFV KGSGSRLKKR PDYHWGPGYA AHTGPAYLDK
IEFKYLPESS VRLGALSSGQ VQAIDAVPPA NAAALKKDTR LDVITRENPG VNRVLYLNTS
KGPFQDVNVR RAFLHAVDAA SATKVAFFGT LKAADSVLGP STLYYDKSAA ALGGFDLKKA
NQLLDEAGWK TQDGEGYRTK DGKRLTVHFV YSTGSSEAAE ITLFQAVQFQ VKKAGIDIQL
NPVDSGGFTS RTNDNDYDIA SNYFVRAEPD ILRTVFDSNY IPPNGNNFTR THLLDDKLRK
AIGAGDAERQ QLYSEIQREL LDQAYAVPLF VPAYQLGLSK KVQGISWATN AKPNFYDVWI
KP