Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd1591_3746 |
Symbol | |
ID | 8120745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya zeae Ech1591 |
Kingdom | Bacteria |
Replicon accession | NC_012912 |
Strand | - |
Start bp | 4236082 |
End bp | 4237623 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644854114 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003006026 |
Protein GI | 251791305 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTC GTCATTTTAT TAAAAGTGCC TGCGCCTTAT CCGTTGCCGC AGCCGCTACC GGCTGGCCGC TGAGCGCCGC CTGGGGCGCC GAAGCCGCCG AACAACCGAA AAAAGGCGGT CACCTGATTG TCGGGGTGGA TAACGCCTCC AGCACCGACC GTCTTGACCC GGCGTTTTGG TTTGAAACCT ATATGTATTT CGTGGGCTCC CAACTGTTTA ATAACCTGCT GGAGCTGGAC GAAAAAGGCG AGCTGGTGCC GTCGCTGGCC GAATCCTGGG ACAGCAAGGA CGGCGGGCGA ACCTGGGTGT TGAATATCCG TAAAGGCGTG CAGTTCCATG ATGGCCGCAC GCTCAGCGCC AAAGACGTTA TCTATTCGCT TAACCATCAC CGCGGCGAGC AGTCCAGCTC GTCGGTGAAA GGGTATCTGG ACCCGGTGGT GGCGATGGAT GCGACCGACA GCCATCAAGT GACTATCCGC CTGAGCGAGC CGAACGTGGA GTTCGTCGCG CTGCTGAGCG ACGTACACTT CGCCATTACC CCGGAAAACG AGAACTTCGA CAAAGGCATC GGCACCGGCG CTTTTATTCT GGAGAGCTTC CAGCCGGGTG TGCGCACGTT GGTGAAACGC AACCCGAATC ACTGGAACAG CGCGCGCGGC CATGTCGATT CGGTCGAAAC GCTGGCGATG AACGACTCTA CCGCCCGGGT GGCGGCGTTG GTGAGCGGGT CGGCGCACAT TATCAACCGC GTGAACCCGC GTATCGTCGG GCGCATTCAG AACATGCCGA CCTTGCAACT GCTGCGTTCG CGCGACAGCC AGATTTTTAC CTTCCCGGGT TTGAGCAGTG TGGCGCCGTT TAACAACGAA GACGGCCGAC TGGCGCTGAA ATACGCCATC GATCGCCAGC AGATTATCGA TACCGTGCTG GGTGGTTACG CCAGCGTGGC GAACGACAAC CCTATTTTCC CGTCCAACCG CTATTTCGCC AAAGACATTC CCCAGCGTCC GTACGACCCG GAGAAAGCCA AATGGCACTG GCAGAAAGCC GGATTCAGCG GTCCGCTGAC GTTGTCCGTC GCCGATGCCG GTTTCCCCGG CGCGGTAGAT GCCGGTCAAC TGTATCAGGC GTCGGCGCAG AAAGCCGGTA TCCCGCTGAA TGTGGAGCGT GTGCCGGATG ATGCATTCTG GGACAATGTA TGGATGAAAA AACCGTTTGT TTCCTCTAAC TGGTCGGTAC GCCCGACTGC CGATGCGCTG CTGTCGCTGG TGTTCACCAG CCAGGCGCCG TGGAACGAAT CCGGCTGGAA GAATGATGCG TTCGACCAAC TGGTACGTGC GGCACGCGGC GAAGTGAACG AAGAGAAACG CCGCCAAATC TACCATGACA TTCAGGTGAT GCTGGTGGAT CAGAGCAGCG AAATCATCCC GCTGTATGCT GATGCGCTCG ACGCCTGCAG TACCAAAGTG AAAGGGCTGA ATGCCATTCC GGGCTTCCCG TTGAGCGGCA ACCGCGCAGC GGAAAAAGTG TGGCTGGCCT GA
|
Protein sequence | MNRRHFIKSA CALSVAAAAT GWPLSAAWGA EAAEQPKKGG HLIVGVDNAS STDRLDPAFW FETYMYFVGS QLFNNLLELD EKGELVPSLA ESWDSKDGGR TWVLNIRKGV QFHDGRTLSA KDVIYSLNHH RGEQSSSSVK GYLDPVVAMD ATDSHQVTIR LSEPNVEFVA LLSDVHFAIT PENENFDKGI GTGAFILESF QPGVRTLVKR NPNHWNSARG HVDSVETLAM NDSTARVAAL VSGSAHIINR VNPRIVGRIQ NMPTLQLLRS RDSQIFTFPG LSSVAPFNNE DGRLALKYAI DRQQIIDTVL GGYASVANDN PIFPSNRYFA KDIPQRPYDP EKAKWHWQKA GFSGPLTLSV ADAGFPGAVD AGQLYQASAQ KAGIPLNVER VPDDAFWDNV WMKKPFVSSN WSVRPTADAL LSLVFTSQAP WNESGWKNDA FDQLVRAARG EVNEEKRRQI YHDIQVMLVD QSSEIIPLYA DALDACSTKV KGLNAIPGFP LSGNRAAEKV WLA
|
| |