Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3384 |
Symbol | epsH |
ID | 5712442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3560203 |
End bp | 3561804 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641269313 |
Product | transmembrane exosortase |
Protein accession | YP_001534718 |
Protein GI | 159045924 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) [TIGR02914] EpsI family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACGT CTACGGATTT TCTAGGTCGT GGCCCATCTG GCCTGAAAGT AAATTTTATC GGGCTGACGT CCTTTGTGCT GCTTCTGGTG GTATCGCTGC CTTTGTTCTG GTCGGGGTTC GTGTCCCTCG CACGGGCGTG GTCGACACCC GAGTATAGCC ATGGTCCGCT GATCCCTCTG ATTTCGCTCT ATCTGTTCTT GCGCGAGCTG CGGGACAAGC CCGCAGCCAC ACCGGGCGAG GCTGTGAACC GCTGGCCCGG TGTCCTTGTC ATCGGACTGG GTCTTCTGAT TGCGCTTGCT GGAAACCTCT CCCGGATTCC GGACATCATC ACTTACGGGT TCATCATATG GACCGGCGGG GTCGTGCTGA CGGTGTTCGG TTGGAAACAG GGTATCAGGC ACCAGCTGCC TGTGTTCCAC CTAATCTTCA TGCTGCCGTT GCCGCAGTTC ATGTACTGGC ACATGACCAT ATTTCTGCAG GGTATATCTT CGGTGGTTGG GGTCTGGTTC GTGCAGTTGG CGGGCGTGCC CGTGTTCCTC GAAGGCAACA TTATCGACCT CGGGGTCTAC AAGCTGCAGG TTGCCGAGGC GTGTTCCGGT TTGCGGTATC TCTTCCCCAT CTTGTCCTTC TCCTACCTGT TCGCGATCCT GTACCGCGGT CCTTTCTGGC ACAAGATCGT GCTGCTGGTC ATGGCGGCAC CGCTCGCAGT GTTCATGAAC TCCGTTCGGA TCGGGATCAT CGGCATCATG GTCAATTCCT ACGGCATCCA GCACGCCGAG GGCTTCATGC ACTTCTTCGA GGGCTGGGTG ATCTTTGGAT CCTGCATCGC GATCCTCTTC CTGACTGCAA TTGCCCTGCA GCGTCTGACA CCGAACCCCC TGCCGTTGAG CGAAGCGATC GACCTTGACT TCAACGGGCT TGGAGGGATC TCCGCCCGGA TCACGCAGAT CAGCAATTCG ACCCAGATGA TGGTGGCGAC CGGCATCACG GTGGTGTTCT CGGCCCTGCT TCTGGCGCTC AACCCAGGCG CAGGCGCACC CGTTGAACGC GATCCATTCG CGATCTTCCC GCGCACGATA GAAGACTGGC GGGGGGTGCA GGTGCCGCTT GACACTCAAG TCGAGCAAGT CTTGGGCGCC GACGATTATA TAAATGCCAC TTACCAGAAC AGGGTCACGG ACACCGCGAT TAACTTCTTC GTCGCGTTCT ATAACAACCA AACCGAAGGG AGCGGTATTC ATTCGCCGGA GGTTTGTTTG CCCGCGGGTG GCTGGGAGGT GTTTTCCCTG GAGCCATATG AAGTCTCGTT CCCGGATACG GAATATGGCA CGTTCGAACT GAACCGCGCC GTCATCCAGA AGGGCCTCGT GAAGCAGCTT GTCTACTACT GGTTTGAGCA GCGCGGGAAG CGCCTGACGA ATGATTATGT GGCGAAGGCC TACGTGGTCT ACGATGGGCT GATGCACGGT CGGACCGAGG GGGCGATGGT CCGTTTCGTC ACCCCGATTT ACGGAGATGA AGACGAGGCT CTGGCGGATC AGCGTTTGCA AGAATTCATG GAAGAGGCAC TGGTCAAGCT GCCGCGCTTC GTTCCCCTTT GA
|
Protein sequence | MSTSTDFLGR GPSGLKVNFI GLTSFVLLLV VSLPLFWSGF VSLARAWSTP EYSHGPLIPL ISLYLFLREL RDKPAATPGE AVNRWPGVLV IGLGLLIALA GNLSRIPDII TYGFIIWTGG VVLTVFGWKQ GIRHQLPVFH LIFMLPLPQF MYWHMTIFLQ GISSVVGVWF VQLAGVPVFL EGNIIDLGVY KLQVAEACSG LRYLFPILSF SYLFAILYRG PFWHKIVLLV MAAPLAVFMN SVRIGIIGIM VNSYGIQHAE GFMHFFEGWV IFGSCIAILF LTAIALQRLT PNPLPLSEAI DLDFNGLGGI SARITQISNS TQMMVATGIT VVFSALLLAL NPGAGAPVER DPFAIFPRTI EDWRGVQVPL DTQVEQVLGA DDYINATYQN RVTDTAINFF VAFYNNQTEG SGIHSPEVCL PAGGWEVFSL EPYEVSFPDT EYGTFELNRA VIQKGLVKQL VYYWFEQRGK RLTNDYVAKA YVVYDGLMHG RTEGAMVRFV TPIYGDEDEA LADQRLQEFM EEALVKLPRF VPL
|
| |