Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dd703_3443 |
Symbol | |
ID | 8089397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dickeya dadantii Ech703 |
Kingdom | Bacteria |
Replicon accession | NC_012880 |
Strand | + |
Start bp | 4011482 |
End bp | 4012642 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644837521 |
Product | protein of unknown function DUF1501 |
Protein accession | YP_002989025 |
Protein GI | 242240844 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0821691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACCC GACGTGGTTT TATTCAACTT GCCGCCGCAG GCGCGGCAAT GCTCATCGCG CCGAGGATCG TCTTCGCGCG CGCCGCCACC GACAGCCGTT TCATTTTTAT CATTCAGCGC GGCGCCGCCG ACGGGCTGAA CATCGTCATT CCCTATGCCG ATCCCGGTTA TACCGCGCTG CGCGGCGAAC TGGCGATAGA TACCGCCACG GCGATAAAAC TCGACGGTAC ATTCGCGCTG CACCCGTCGC TCAGTCAGAT CGGCAACCTG TATGCACAAC GCGAAGCGCT GTTCATACAC GCTATCGCCT CGCCCTATCG CGACCGTTCG CATTTCGACG GCCAGAATGT GCTGGAAACC GGCGGCTCCG CGCCGTTTCA AATTCGGGAC GGCTGGCTCA ACCGGCTGGT CGGCATGATA CACGGCGCCA CCCCCACGCC GCATGAAAAC GCCATTGCTT TCGCCCCGAC GATACCGCTG GCGCTGCGCG GCACAACGGA CATCGCTTCC TACGCCCCGT CGGGGTTGCC CCGCGCGCCG GAGGATTTGC TGATGCGCGT ATCGCAGCTC TACGCGGAAG ATGCGCAGTT GCATGCGTTG TGGGAATCGG CGCTGGCGAC CCGCGGTCTG GTGGGTAACA GCGAACCGCG TCAGGATCCG GCCAGCATCG GTAAAATGGC GGCGAATATT TTGTCCCGCG CCGACGGGCC GCGCATCGCC ATGCTGGAAA CCAGCGGCTG GGATACGCAC AGCGCCCAGA CGCCCCGGCT GGCTTCGCAA CTCAAAGCGC TGGATATGTT GATCGCCGCC CTACGCGACG GACTCGGCCC GGTATGGAAC AACACGGTGA TACTGGTCGC CACCGAGTTC GGCCGCACCG TCGCGACCAA TGGCACCGAC GGCACCGATC ACGGCACCGC CTCGGCGGCA ATGGTCATCG GCGGCGCGGT CTCGGGCGGA CGGATCATGG CTGACTGGCC GGGACTACGA CCGGGCGATC TGTACGAATC GCGCGATCTG AAACCCACCG CCTCGCTGGA CGCACTGATC GCCGGCCTCG CCAGCGAAAG TTTTCACCTC GATCCTGGGC GCATCACCCG AACCTTATTC GCCCGCTCGC CCGGTATCAC GCCGATGACG GGATTGCTGC GTTCGTCGTA G
|
Protein sequence | MITRRGFIQL AAAGAAMLIA PRIVFARAAT DSRFIFIIQR GAADGLNIVI PYADPGYTAL RGELAIDTAT AIKLDGTFAL HPSLSQIGNL YAQREALFIH AIASPYRDRS HFDGQNVLET GGSAPFQIRD GWLNRLVGMI HGATPTPHEN AIAFAPTIPL ALRGTTDIAS YAPSGLPRAP EDLLMRVSQL YAEDAQLHAL WESALATRGL VGNSEPRQDP ASIGKMAANI LSRADGPRIA MLETSGWDTH SAQTPRLASQ LKALDMLIAA LRDGLGPVWN NTVILVATEF GRTVATNGTD GTDHGTASAA MVIGGAVSGG RIMADWPGLR PGDLYESRDL KPTASLDALI AGLASESFHL DPGRITRTLF ARSPGITPMT GLLRSS
|
| |