Gene Dshi_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3384 
SymbolepsH 
ID5712442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3560203 
End bp3561804 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content58% 
IMG OID641269313 
Producttransmembrane exosortase 
Protein accessionYP_001534718 
Protein GI159045924 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGT CTACGGATTT TCTAGGTCGT GGCCCATCTG GCCTGAAAGT AAATTTTATC 
GGGCTGACGT CCTTTGTGCT GCTTCTGGTG GTATCGCTGC CTTTGTTCTG GTCGGGGTTC
GTGTCCCTCG CACGGGCGTG GTCGACACCC GAGTATAGCC ATGGTCCGCT GATCCCTCTG
ATTTCGCTCT ATCTGTTCTT GCGCGAGCTG CGGGACAAGC CCGCAGCCAC ACCGGGCGAG
GCTGTGAACC GCTGGCCCGG TGTCCTTGTC ATCGGACTGG GTCTTCTGAT TGCGCTTGCT
GGAAACCTCT CCCGGATTCC GGACATCATC ACTTACGGGT TCATCATATG GACCGGCGGG
GTCGTGCTGA CGGTGTTCGG TTGGAAACAG GGTATCAGGC ACCAGCTGCC TGTGTTCCAC
CTAATCTTCA TGCTGCCGTT GCCGCAGTTC ATGTACTGGC ACATGACCAT ATTTCTGCAG
GGTATATCTT CGGTGGTTGG GGTCTGGTTC GTGCAGTTGG CGGGCGTGCC CGTGTTCCTC
GAAGGCAACA TTATCGACCT CGGGGTCTAC AAGCTGCAGG TTGCCGAGGC GTGTTCCGGT
TTGCGGTATC TCTTCCCCAT CTTGTCCTTC TCCTACCTGT TCGCGATCCT GTACCGCGGT
CCTTTCTGGC ACAAGATCGT GCTGCTGGTC ATGGCGGCAC CGCTCGCAGT GTTCATGAAC
TCCGTTCGGA TCGGGATCAT CGGCATCATG GTCAATTCCT ACGGCATCCA GCACGCCGAG
GGCTTCATGC ACTTCTTCGA GGGCTGGGTG ATCTTTGGAT CCTGCATCGC GATCCTCTTC
CTGACTGCAA TTGCCCTGCA GCGTCTGACA CCGAACCCCC TGCCGTTGAG CGAAGCGATC
GACCTTGACT TCAACGGGCT TGGAGGGATC TCCGCCCGGA TCACGCAGAT CAGCAATTCG
ACCCAGATGA TGGTGGCGAC CGGCATCACG GTGGTGTTCT CGGCCCTGCT TCTGGCGCTC
AACCCAGGCG CAGGCGCACC CGTTGAACGC GATCCATTCG CGATCTTCCC GCGCACGATA
GAAGACTGGC GGGGGGTGCA GGTGCCGCTT GACACTCAAG TCGAGCAAGT CTTGGGCGCC
GACGATTATA TAAATGCCAC TTACCAGAAC AGGGTCACGG ACACCGCGAT TAACTTCTTC
GTCGCGTTCT ATAACAACCA AACCGAAGGG AGCGGTATTC ATTCGCCGGA GGTTTGTTTG
CCCGCGGGTG GCTGGGAGGT GTTTTCCCTG GAGCCATATG AAGTCTCGTT CCCGGATACG
GAATATGGCA CGTTCGAACT GAACCGCGCC GTCATCCAGA AGGGCCTCGT GAAGCAGCTT
GTCTACTACT GGTTTGAGCA GCGCGGGAAG CGCCTGACGA ATGATTATGT GGCGAAGGCC
TACGTGGTCT ACGATGGGCT GATGCACGGT CGGACCGAGG GGGCGATGGT CCGTTTCGTC
ACCCCGATTT ACGGAGATGA AGACGAGGCT CTGGCGGATC AGCGTTTGCA AGAATTCATG
GAAGAGGCAC TGGTCAAGCT GCCGCGCTTC GTTCCCCTTT GA
 
Protein sequence
MSTSTDFLGR GPSGLKVNFI GLTSFVLLLV VSLPLFWSGF VSLARAWSTP EYSHGPLIPL 
ISLYLFLREL RDKPAATPGE AVNRWPGVLV IGLGLLIALA GNLSRIPDII TYGFIIWTGG
VVLTVFGWKQ GIRHQLPVFH LIFMLPLPQF MYWHMTIFLQ GISSVVGVWF VQLAGVPVFL
EGNIIDLGVY KLQVAEACSG LRYLFPILSF SYLFAILYRG PFWHKIVLLV MAAPLAVFMN
SVRIGIIGIM VNSYGIQHAE GFMHFFEGWV IFGSCIAILF LTAIALQRLT PNPLPLSEAI
DLDFNGLGGI SARITQISNS TQMMVATGIT VVFSALLLAL NPGAGAPVER DPFAIFPRTI
EDWRGVQVPL DTQVEQVLGA DDYINATYQN RVTDTAINFF VAFYNNQTEG SGIHSPEVCL
PAGGWEVFSL EPYEVSFPDT EYGTFELNRA VIQKGLVKQL VYYWFEQRGK RLTNDYVAKA
YVVYDGLMHG RTEGAMVRFV TPIYGDEDEA LADQRLQEFM EEALVKLPRF VPL