Gene Dshi_0897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0897 
Symbol 
ID5710587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp915495 
End bp918620 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content67% 
IMG OID641266807 
Productvirulence protein SrfB 
Protein accessionYP_001532243 
Protein GI159043449 
COG category[S] Function unknown 
COG ID[COG4457] Uncharacterized protein conserved in bacteria, putative virulence factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.112933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.362744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAG ACCGCGCCAA GCGGCTGAGA CGGCTGGTAT CCTTGGGCGA CGAGATCACG 
CTGGTGCCCT ATTCTGGCAT CCAGATCCTC GATTTCGGGT TCGATATCAA CGCGCTGGAT
ATCCGGTCCT TTCGTTTCAT CGAACGCCCC GAGGGCGCGG CCGAGGGCCG TCATCGCACG
CTCTATCCCC TGACCGGGGA GGCTGAGCGC GATGCGCCGA TCCTCGCGGC CACCACGCCG
GAGGATGACG AGTATTCCGT CCGGCCCCTC GCCGCACTGG AGCCGTTTCT GGAGAAATGG
GTGCCGATCC CGGTGCTGCG GTTGAAAAAC CAGCGCGGGG CAGGGGGCGA GGAGCTTTAC
GATCCCGGCC CGTCCTCCTG GGCGCGGCTG CGGACGGTGG AATTGCCCCA GCCCGATCCG
GAGACCGGCC ATACCCACCG GGTGCAGATC GCGCTCGACA CCGCGCTCAG CGACCAGGAC
CAGAGCGCCC ACTATGTCGC CCCCGAACGC GCCGACAGCG AGAAGCCGCG CGAGTTCCGG
CTGGTCTCTG ACCCCGGTGC GATGAGCTGG TTCCTGCAGC GGCTGGAGGC GGACGAGGAC
GGCAATGCGG TCGATCTGCA GCTCTGGGTG TCGGACTGGC TCAAGGAGAT GTTCCTCGAC
TTCAAACGGG CCGAACGCCC CGGGCGATCG ATCTCGGAAG AAAACCTGCC GCATATGTTC
GAGCATTGGG CGCGCTACCT GTCGTACCTT CAGGTGATTC AGCGCGCCGT GGCCCCGCCC
AAGATGCGCT TTGCCAACAC CGTCGCGCCG CGCGATGCGG TCGCCCCGGT CGAGGTCGAC
CTGGTGCTCG ACATCGGCAA CTCGCGGACC TGCGGCATTC TGATCGAGCG CTTCCCGGGC
GAGACGCGGG TCGACCTGAC CCGGTCCTTC CCGCTGGAGA TCCGCGACCT GTCTCGTCCG
GAGTTCCATT ATTCGGGCCT CTTCGAGAGC CGGGTGGAGT TCGCCGACCT GCGCTTCGGC
GACGAGCGCT ACGCGTCGCG GTCGGGTCGG CGCAACGCGT TCATGTGGCC CAGCTTCGTG
CGCATGGGGC CCGAAGCCGT GCGGCTCGTG CAGGCCGAGG AGGGGACAGA GACCCTGTCG
GGCCTGTCCT CGCCCAAGCG GTACCTTTGG GATGATGACG CGGTGTTGCA GGACTGGCGG
TTCCAGAACC ACCACGACCC CAACAACCTG CCGCGCCCGG TGCGCGCGGC CATGCGGCAC
CTGAACGAGG CGGGCGACGT GCTGGCCCAG GTCAAGACCG AGATCGGGCT CAACCTGCGC
AAGCCGAAAA AGACCACGCC GCTGACCCCG GCGATCCGGC CGCGGTTCTC ACGCTCCTCG
CTTTTCGGGT TCATGCTGGC CGAGGTCATT GCCCATGCCA TGGTGCAGAT CAACGATCCG
GCCTCGCGCT CGCGCCGGTC GCAGTCTGAC CTGCCGCGGC GGCTCAACCG CGTGATCCTG
TCCCTGCCCA CGGCGACCTC GGTGCAGGAA CAGGCGATGA TCCGGTCGCG GGTCTCGGGC
GCGCTGACCC TGGTCAAGGA GATGCTCGGC ACCAAGGACG GCACCAGCAC CATCGCGGTC
GAGGGCAAGC CCGAACTGCT GGTGGATTGG GACGAGGCGA GCTGCACCCA GCTGGTCTAT
CTTTATTCCG AGCTGACCCA GAAATTCGAT GGCCGGATCG ACACGTTTCT CGACCTCAAG
GGACAGCCAC GTCCGGACCC GGCAGGCGGC GAGAGCCCGT CCCTGCGGCT GGCCTGCATC
GACGTGGGCG GGGGGACCAC GGACCTGATG GTCACCACCT ATCGCGGCGA GGACAACCGG
GTGCTGCATC CCGAGCAGAC CTTCCGCGAA GGGTTCCGCG TGGCGGGCGA CGACCTGGTG
CACCGGGTGA TCAGCGCCAT CGTCTTGCCG CGGCTGCAGG ATTCCATCGC GCAGGCGGGC
GGGCAGTTCG TGGCCGAGCG GATGCGGGAG CTTTTCGGCG GCGATATCGG CGGGCAGGAA
CAGCAGACCG TGCAACGGCG AAGGCAGTTT TCCATCCGGG TGCTGGTGCC GCTGGCCGAG
GCGATCCTGT CGGCCTGCGA GGATGCCGAG GAGGCCGACC GCATCGACAT CCCCGTGGCG
GATGTGCTGG GCCTCGTGCC CACGCCGGTT GGCGAAGAAG GCGATGAGGA AGGCCACGAA
GACGCGTCGC CGCAGGTGAC CGACGAGATC CTCGACTATC TCGAAAAGCC CGCGACGCAG
CTGGGCGCCG AGGGCTGGCG GCTTGCGGAC ATGGTCCTGA GCGCCAGCCG CGAAGACCTC
GACGCCATCG CGCGGGAGGT GTTCCAGAAG GTGCTCGGCA ACATGTGCGA GGTGATCGAC
CATCTCGGCT GCGATGTGGT GTTGCTGACG GGCCGTCCCT CGCGGCTGCC GGCGGTGCGC
GCCATCGTCG AGGAAATGCT CGTGGTCCCG CCCCATCGCC TGATCTCGAT GCACCGCTAC
AAGACCGGCA ACTGGTACCC GTTCCGCGAT CCGGTCAGCC AGCGGGTGGG CGATCCGAAA
TCCACCGTGG CCGTGGGCGG GATGCTGATT GCACTGTCCG AGAACCGCAT CCCGAACTTC
AAGGTCACCA CCGGCGCGTT CCAGATGAAA TCCACCGCCC GTTTCGTGGG CGAGATGGAC
ACGAACGGCC AGATCCCCGA GGGGCGCATG CTGTTCGAGG ATCTCGATCT GGATGCCAGG
AAATCCGCGC AGGATCCCAC GGCGATCGTC CGCATGCACT CGCCGGTCTA TATCGGCGCG
CGCCAGTTGC CGCTGGAGCG CTGGACGACC ACGCCGCTCT ACCGGCTCGA CTTCGCGAAT
GACAGCATCG CGGGCAAGAT CAAGCTGCCG GTTAAGGTCG AACTGGTCCG CGAGGACGAT
GATTTCGACG AGGCCGAGAC CAGCCTGGAA AAACTGCGCG CCGAACGGGT GCGCGAGGTG
TTCCGGGTGG ATGCGGCCGA GGATGCCGAG GGCACGATGA TCAAGAACGA CGACGTGGTG
CTGTCGCTGC ACACGCTCGG GTTCGAGGAT GAATACTGGC TCGATACCGG CGTGTTCCGG
ATCTGA
 
Protein sequence
MLQDRAKRLR RLVSLGDEIT LVPYSGIQIL DFGFDINALD IRSFRFIERP EGAAEGRHRT 
LYPLTGEAER DAPILAATTP EDDEYSVRPL AALEPFLEKW VPIPVLRLKN QRGAGGEELY
DPGPSSWARL RTVELPQPDP ETGHTHRVQI ALDTALSDQD QSAHYVAPER ADSEKPREFR
LVSDPGAMSW FLQRLEADED GNAVDLQLWV SDWLKEMFLD FKRAERPGRS ISEENLPHMF
EHWARYLSYL QVIQRAVAPP KMRFANTVAP RDAVAPVEVD LVLDIGNSRT CGILIERFPG
ETRVDLTRSF PLEIRDLSRP EFHYSGLFES RVEFADLRFG DERYASRSGR RNAFMWPSFV
RMGPEAVRLV QAEEGTETLS GLSSPKRYLW DDDAVLQDWR FQNHHDPNNL PRPVRAAMRH
LNEAGDVLAQ VKTEIGLNLR KPKKTTPLTP AIRPRFSRSS LFGFMLAEVI AHAMVQINDP
ASRSRRSQSD LPRRLNRVIL SLPTATSVQE QAMIRSRVSG ALTLVKEMLG TKDGTSTIAV
EGKPELLVDW DEASCTQLVY LYSELTQKFD GRIDTFLDLK GQPRPDPAGG ESPSLRLACI
DVGGGTTDLM VTTYRGEDNR VLHPEQTFRE GFRVAGDDLV HRVISAIVLP RLQDSIAQAG
GQFVAERMRE LFGGDIGGQE QQTVQRRRQF SIRVLVPLAE AILSACEDAE EADRIDIPVA
DVLGLVPTPV GEEGDEEGHE DASPQVTDEI LDYLEKPATQ LGAEGWRLAD MVLSASREDL
DAIAREVFQK VLGNMCEVID HLGCDVVLLT GRPSRLPAVR AIVEEMLVVP PHRLISMHRY
KTGNWYPFRD PVSQRVGDPK STVAVGGMLI ALSENRIPNF KVTTGAFQMK STARFVGEMD
TNGQIPEGRM LFEDLDLDAR KSAQDPTAIV RMHSPVYIGA RQLPLERWTT TPLYRLDFAN
DSIAGKIKLP VKVELVREDD DFDEAETSLE KLRAERVREV FRVDAAEDAE GTMIKNDDVV
LSLHTLGFED EYWLDTGVFR I