Gene Dshi_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1919 
Symbol 
ID5712912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2002691 
End bp2004421 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content48% 
IMG OID641267844 
Producthypothetical protein 
Protein accessionYP_001533262 
Protein GI159044468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000840766 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.47349e-16 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCAGTTA ACTCGCAGTT CCTGTTTTTC TATATCGATG AACTTGTTAA ACACCACTGG 
GTAGACTGGT TGGAAGGCCC GTTGGCCAAA GTGCCATCCG AAGAGCACAG AGGCGAGAGA
GGATTCCAGC TGGCGGTCGG GGACACTGGA GCAGGTCCTC GAACTCGTGC TGGAGAAGCT
TATCAACTCG GTATTCTCTC CGGTCGCATT GTTCGTCCTC GTATTGCGGG GGCGTCTTTC
CCATCTGGTT GGCAGGCGGT GATTGAGGGT CGCGTTCCCT CCACTAAGCT TGGCTACTAT
TCCTATGTCA TCGGAAGGGC GCTTCGCGAG AATCCTTCTG ATCCGAATCT TGATCGAGCA
CTAGACTGCC TCGCAAGTAT TCTTATCCTA CACATAGGCT TTCAATCTGA TGGTCGCTGG
TTCAATACAG TACTGCTTTT CCAGTATGCC AATAATCAAC TCTCAAGCTA CTTGGGACGA
CAGCTGAACG ATGATGAGGT CAGATATCTA GCTGCTGCGG TAATCAATAC ACTCGACAGT
GAGGAGAATG ATCTACTTTT GGATCGCTAT GTTGCTGACC TACCAATAGA GAGCCTCCGC
TTCGATTTGA ACGATGGAAA GTCTTTGCAT TTTGACCTAA ACGGACTTCC CGATATTGTT
TCAGACAAGA AAGTAGGGAT AAACGATTTT CAGAAAACTA AAGCAGCTCT CTTCTGGGCG
GATCCTTCTC GCGGAATCTA TATGCGCGCG GCGAGATCGC GAGGAATTCG CCTAATTCAA
GAATTGATCA AAGTAGATCA GAGGCTTGCT CAATCAGGCG AGAGCAATTT ACTCAACGAA
GTTTTGGATT CCATTCAAAA CAACGAAAAT AATCCGGGCA TTTTTTCGGT TGCAGAAGAC
GCATTTACAG ATTTTCCGGG TCTGCTGCGC GAATGGCAAG ATCTTGAGGC CGAGCTTTTT
GACGCAACCA ACAAAGCCGT TGAAGCGGTC GACTTGGACA CAGCGCCCCC GATCTGGCTA
GCTTCGCAGC ATGAGATACG TGGGTCGGTG CCAGCTGGGT CCGACGACGA AAGCGGAAGC
GATGAAGTAG ATGAGGTTGA GGCAGAAACT GCCGCAATTG AAGGCTCAAC TACTGCTACT
GACGAAGTTT CCATAGCTAG CGGTACTCAA GAAAAGAAAC CAAGCAAGAA AATTGAATTG
CCGACGAAAG AAGCTCTTCG AGAATTTATC GAGAGCGAGC TGGACGAAAT TTCGGAAGCT
CCCCCAGTGA CTGCTGCACC TACAGAAGTC TCAGGAAAAA AAGGAAAGCC TCGAGCGACA
CAAACAGACT TTGCTGCGAA AGAAGCGCGA AACCGCAAGC TCGGTGAAGC AGGCGAGTAC
TTTGTATTCC AATATGAAGT TATGAAACTC ACCGCCGCGG GCAGAGTGGA TCTGGCCAAG
CGCGTCAAGT GGGTGTCAAA GGATATAGGC GATGGCCTTG GCTATGATAT TCGATCTTTC
GATCAAGATG GCAATGAGGT TTTTCTTGAG GTGAAGACAA CGAATAGCGG AAGAGCAACA
CCATTTTTTG TATCTAACAA CGAAGTTGCT GTTTCGGAAG AAAAAGGAGA CTCCTACCGT
CTAGTAAGAG TGTTTAATTT TTCGAAGAAA CCGAGGTTCT TTTCGTTAAC AGGAAGCTTG
TCCGAAGTGC TTCAGCTCGA AGCAACGTCA TATCGAGCTC GAGTGGTATA G
 
Protein sequence
MAVNSQFLFF YIDELVKHHW VDWLEGPLAK VPSEEHRGER GFQLAVGDTG AGPRTRAGEA 
YQLGILSGRI VRPRIAGASF PSGWQAVIEG RVPSTKLGYY SYVIGRALRE NPSDPNLDRA
LDCLASILIL HIGFQSDGRW FNTVLLFQYA NNQLSSYLGR QLNDDEVRYL AAAVINTLDS
EENDLLLDRY VADLPIESLR FDLNDGKSLH FDLNGLPDIV SDKKVGINDF QKTKAALFWA
DPSRGIYMRA ARSRGIRLIQ ELIKVDQRLA QSGESNLLNE VLDSIQNNEN NPGIFSVAED
AFTDFPGLLR EWQDLEAELF DATNKAVEAV DLDTAPPIWL ASQHEIRGSV PAGSDDESGS
DEVDEVEAET AAIEGSTTAT DEVSIASGTQ EKKPSKKIEL PTKEALREFI ESELDEISEA
PPVTAAPTEV SGKKGKPRAT QTDFAAKEAR NRKLGEAGEY FVFQYEVMKL TAAGRVDLAK
RVKWVSKDIG DGLGYDIRSF DQDGNEVFLE VKTTNSGRAT PFFVSNNEVA VSEEKGDSYR
LVRVFNFSKK PRFFSLTGSL SEVLQLEATS YRARVV