Gene Dshi_3862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3862 
Symbol 
ID5714391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp70763 
End bp72700 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content63% 
IMG OID641276775 
Producthypothetical protein 
Protein accessionYP_001542071 
Protein GI159046400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.242184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCT ATATCATGAC CGCCGCGCAA CAGGCGCCGG ATGGTGATGC CAAGGACCTG 
AAACAGGCTC TTCAAAAGGC TTTTGAGAGG CATGCGGGCC GTCTTCCCGC TGATGGATCA
GAGGTGATGT CCTCGATCAA CGCCTCGGTG GCGGTCCGGC TCTTTGCCTT GATGTACATG
CGCGGTGTGC AGCATCGCCT GATGCCGGAT AACGAGAGTG CGGTCCTGTG CGAGGCTTTG
TGCAAGACGG TCTGCGAGGT CGAGATCACC GATAAGTTCG ACGACATGTT CGTGTCGAAC
TTCGTGCTGG GGCTGATCGT GTTTCTCGAT CTTGTGCAGG ATGATTTGCA CCCTGACACG
CGGGCCGGCC TGGTTGCTAA GATCGCCGAA TGCCGGGACT GGTTGTCCGA GGCGCGTCAT
CGCAAGGTAT TCGGCACGCG CGAGACCGAG GGCACCTATG CCTGGAATCA CTCCGCCTGC
GCGGCGGCGG GCCTGGCGCT GAGCGTGATC TGGACCCGGG ATAGCCAGGC GGACTGGACC
GACACAGACT TTCACGATGT GGATTTCGGG CTGCGCCGGA TCGAAGACTA TTTCCTGCAC
GGCATCCGCG AAACAGGGGT CCCCTATGAG GGGTTCTATT ACTGTGGCGC GGTGTTTCGG
GTGCTTGGCC CCTTCGACAT TCTGGTGCGC AAGGACGCCG AGGTAGAACG CCGCTATCGG
CGCATCCGGG ATCGTCACAA GCGTAAGCTT GGCCAGTTGC TGGACTGGTA CGAGAGTGGC
ACCATCGTCA ACCAACGCGC GCTGCTCAGC TACAATCATT CGCTTTATGA CGCTCACCCG
GCGGTGAACG GTTTTCTGAC CTTCTTTCGG TCCGAGTTCG AGGTAAAGGC GGGCCGCATG
TGGTCTCGGC TGATGGCGAA AGGCACGTCC CTGCAATTCG TCGAGCGCAG CCGGGACTGG
GGCGACAACA CTCTGCACGA GGCTTTGTTG TTCCTGCATC CCAAAGGCTA TTCAGCACCG
CGTAACAAGG TGCAGACCCT TCTGTCGCGG ACCGAAGGCT ATGGGCTGCT GGTGTCGGAG
GACGCCTCCA GCCGACTGTT CGTGAAGGCC AGCAAACTGC TGATCGGGCC GCATAACCAG
TCTGATGCGG GCCATGTCAG TTGGGTCTGT AATGGTGATG CGGTGTTGAT CGACGCCGGG
CCGGGCCGCA AGGTGCGCGA TGCGTCGAAG AAGTGGGCGG AGTATTCCAA GGGCACCTAC
CGGACCGAAG GCAGCGGCGC TTCGTCCTAT GGCCATAACG CCGTGCTGAT CGACGGGCGG
GGGCAGTTGC CCTCTGGCGA GGGGGACGGC ATCGAGGGGC GCCTAAGCTA TGTCCGTCAA
ACCGAAGATT TCTGGTTGCT CGGGACCGAT GCCAGGGCGG CCTACAATAA GGATGAGTAT
AACCCCGTCC AGGTGGCCGA TAGGCATGTG GCGTTTTCCA AGGCGGCGGG CGCCTATCTG
ATCCTGATCG ACCGGGTGGT ACCGCAGGCG CCGGGGACCC ATCGGTTCCA GCGGCTTCTG
CAGTTTGCAG ACCCCGCGCA GGTGGTTGAG GAGGATGGTC CCGGCCGAAT GGCGGTGACC
AGCGGGGGCA CGGTCTATGA CCTGTGGACC CTGAGCCCGA CCGGGCCCCT GAGCACGGTC
TACGAGGAAG AGAAATTCCA GATGCCGATC AAGACCCGGG GCGTGCTGGC CCATGGGGTC
GAGGCCGAGG ACCTGTGGAT GTATACGGTC CTCGCCGCCC GCGGGAGTGC CGGGGTGCCG
ACGGATGTGT CCCTGCGCCC GGCCGAGGAC GCGGCCTTCG GGGCCGCGCT GCAGCTGGTG
CTGGAGGGGG GCGACACCCG GCTCCTGGCC CTGTCGCGGA CGACGGGGGA GCTTGAGCGG
ATTTCCGACG ACGGCTGA
 
Protein sequence
MTSYIMTAAQ QAPDGDAKDL KQALQKAFER HAGRLPADGS EVMSSINASV AVRLFALMYM 
RGVQHRLMPD NESAVLCEAL CKTVCEVEIT DKFDDMFVSN FVLGLIVFLD LVQDDLHPDT
RAGLVAKIAE CRDWLSEARH RKVFGTRETE GTYAWNHSAC AAAGLALSVI WTRDSQADWT
DTDFHDVDFG LRRIEDYFLH GIRETGVPYE GFYYCGAVFR VLGPFDILVR KDAEVERRYR
RIRDRHKRKL GQLLDWYESG TIVNQRALLS YNHSLYDAHP AVNGFLTFFR SEFEVKAGRM
WSRLMAKGTS LQFVERSRDW GDNTLHEALL FLHPKGYSAP RNKVQTLLSR TEGYGLLVSE
DASSRLFVKA SKLLIGPHNQ SDAGHVSWVC NGDAVLIDAG PGRKVRDASK KWAEYSKGTY
RTEGSGASSY GHNAVLIDGR GQLPSGEGDG IEGRLSYVRQ TEDFWLLGTD ARAAYNKDEY
NPVQVADRHV AFSKAAGAYL ILIDRVVPQA PGTHRFQRLL QFADPAQVVE EDGPGRMAVT
SGGTVYDLWT LSPTGPLSTV YEEEKFQMPI KTRGVLAHGV EAEDLWMYTV LAARGSAGVP
TDVSLRPAED AAFGAALQLV LEGGDTRLLA LSRTTGELER ISDDG