Gene Dshi_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3854 
Symbol 
ID5714383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp59311 
End bp60357 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID641276767 
Producthypothetical protein 
Protein accessionYP_001542063 
Protein GI159046392 
COG category[L] Replication, recombination and repair 
COG ID[COG5534] Plasmid replication initiator protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.677421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATCGG ATCTGAGTAT CTCCAACGCC GGGCGCGGGC TTGCGCCGGA TCGGTATCGT 
CAGGCGGATT TCTTCGTCTG CGATATCTTC GACGCGATCC CCAAGGATGA TCTCGCGACC
ATGGAGCACC CGGTGTTCAG CCTCGCTACC CGGCCGGACA GGCGGGTGCT GTCCTATGCC
CATAACGGGG TGGAGATCGA GGTGACCCCG AGCGTCAAGG GGCTGGCCAC GATCCACGAC
AAGGACATCC TGATCTTCTG CATCAGCCAG CTGATGGCGG CGCTGAACGC GGGGCGCGCG
GTGAGCCGGA CGCTCCAGAT CAAGGCCCAT GATTTGCTGG TGGCGACGAA CCGCGAGACA
TCCGGAGACG CGTATCGGCG GCTGCGCGAG GCGTTCGAGC GGCTGGCAGG CACGCGGATC
ACCACCAACC TGACCACCGG AGGGCAGGAG GTGACGCGCG GCTTCGGGCT GATCGAGAGC
TGGGAGATCG TGCGCAAGGC GCGCGGCGGG CGGATGGTGA GCGTGAGCGT GACGCTGTCG
GAATGGCTCT TCAATGCGGT GGTCAGCAAA TCGGTGCTGA CGCTGAGCCG GGATTACTTC
CGGCTGCGCA AGCCCCTGGA GCGGCGGATC TACGAGTTGG CGCGCAAGCA TTGTGGCCGG
CAGGCGCGCT GGGTGGTGTC GGTGGACCTG CTCTTGAAGA AGTCGGGCTC GGCCTCCCCG
CGCCGGGTGT TTCGCAAGAT GCTGCGCGAC ATGATCGCCG CCGATCATCT GCCCGATTAC
GAGATGGTCG AGGTGCCGGG CGACAAGATC GCGTTTGCCT TGCGCGGCGG GCTGGTGGAG
GACGCGGGCC CCGGGTTGGG TGCGGCCCTG CCGCCCCTGC GGGCCGAGAC CCTGGAGGAG
GCCCGGGCGC TGGCACCCGG CTGGGATGTC TACGGGCTGG AGGCGGACTG GCGCGCGTAC
TGGGCCGGAT CGGGCCGTCC GCGCCTGCGC AGCGCCGACA AGGCGTTCCT GGGCTTCGTG
CGGGCGCGCA TAGGGGCGGA AGGGTAG
 
Protein sequence
MTSDLSISNA GRGLAPDRYR QADFFVCDIF DAIPKDDLAT MEHPVFSLAT RPDRRVLSYA 
HNGVEIEVTP SVKGLATIHD KDILIFCISQ LMAALNAGRA VSRTLQIKAH DLLVATNRET
SGDAYRRLRE AFERLAGTRI TTNLTTGGQE VTRGFGLIES WEIVRKARGG RMVSVSVTLS
EWLFNAVVSK SVLTLSRDYF RLRKPLERRI YELARKHCGR QARWVVSVDL LLKKSGSASP
RRVFRKMLRD MIAADHLPDY EMVEVPGDKI AFALRGGLVE DAGPGLGAAL PPLRAETLEE
ARALAPGWDV YGLEADWRAY WAGSGRPRLR SADKAFLGFV RARIGAEG