Gene Dshi_2794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2794 
Symbol 
ID5713694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2953140 
End bp2954270 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID641268720 
Producthypothetical protein 
Protein accessionYP_001534128 
Protein GI159045334 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.10969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.809001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCT TCGATTTCCT CTCCGGTCAG TTCATCGACG TGATCCACTG GACCGACGAC 
ACCCGCGACA CGATGGTGTG GCGGTTCGAG CGGCAGGGCC ACGAGATCAA GTACGGCGCC
AAGCTGACCG TGCGCGAGGG CCAGTCGGCG GTGTTCGTGC ACGAGGGGCA GCTGGCGGAT
GTGTTCACGC CGGGGCTCTA CATGCTCGAG ACCAACAACA TGCCGATCAT GACCAGTCTG
CAGCACTGGG ATCACGGCTT TCGCAGCCCG TTCAAGTCCG AGATCTATTT CGTCAACACG
ACGCGGTTCA ACAACCTCAA ATGGGGCACC AAGAACCCGA TCATGCTGCG CGATCCGGAG
TTCGGGCCGG TGCGGATCCG GGCCTTCGGG ACCTATTCGG TGCGCGTGGT GGACCCGGCG
CGGTTCCTGT CGGAGATCGT GGGCACCGAT GGCGAGTTCA CCATGGATGA GATCAGCTTC
CAGATCCGCA ACATCATCGT GCAGGAATTC AGCCGGGTGA TCGCGGGCGC GGGCATTCCG
GTGCTGGACA TGGCCGCCAA CACCGCCGAG CTGGGCAAGG GGGTGGCCAC GGCGATCTCC
GAGACCATCG CGGGCTACGG GCTGTCCCTG CCGGAGCTTT ACATCGAGAA CATCTCGCTA
CCCCCGGCGG TGGAGACGGC GCTCGACAAG CGGACCTCGA TGGGTCTGGT GGGCGATCTG
GGCAAGTTCA CGCAATACTC GGCCGCCGAG GCGATGACGG CCGCCGGCAA GGCCGGGGGC
GACAGTGGCA TGGGCGCGGG CCTGGGGGCC GGCATGGGCA TGGCCATGGG CGCGCAGATG
GCCCAGGCCG GGCCCTGGGG CGCGCGCCCT GCCCCGGCAC CTGCGGCGGC CACCCCGGTC
GCGCCGCCGC CCCCGCCGGT GGAGCATGTC TGGCACATCG CCGAGGGCGG CCAGACCAAG
GGCCCGTTTT CCAAGGCCGA TCTGGGGCGC ATGGCCGCCG AGGGCGGGCT GACCCGCCAG
ACCCATGTCT GGACGCCCGG GCAGGACGGC TGGATGCGGG CAGAGGATGT CACCGAGCTG
GCGCAGCTTT TCACCATTCT GCCGCCCCCG CCCCCGCCGC CGGGCGCGTA A
 
Protein sequence
MSIFDFLSGQ FIDVIHWTDD TRDTMVWRFE RQGHEIKYGA KLTVREGQSA VFVHEGQLAD 
VFTPGLYMLE TNNMPIMTSL QHWDHGFRSP FKSEIYFVNT TRFNNLKWGT KNPIMLRDPE
FGPVRIRAFG TYSVRVVDPA RFLSEIVGTD GEFTMDEISF QIRNIIVQEF SRVIAGAGIP
VLDMAANTAE LGKGVATAIS ETIAGYGLSL PELYIENISL PPAVETALDK RTSMGLVGDL
GKFTQYSAAE AMTAAGKAGG DSGMGAGLGA GMGMAMGAQM AQAGPWGARP APAPAAATPV
APPPPPVEHV WHIAEGGQTK GPFSKADLGR MAAEGGLTRQ THVWTPGQDG WMRAEDVTEL
AQLFTILPPP PPPPGA