Gene Dshi_3242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3242 
Symbol 
ID5712299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3410436 
End bp3411863 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content64% 
IMG OID641269170 
Productprotein of unknown function DUF404 
Protein accessionYP_001534576 
Protein GI159045782 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.657698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAA AGCAGCCCTT TGACGAGATG ACGGGCATGG GGGACGGCGC GATCCGGTCC 
CCGTATGCAG ATTTTGACAC CTGGTTCAGC GGCGAGGACC CCGCCCGCCT GCGCAAGAAG
GCCCGCGAAG CCGAAGACGT TTTCCGGCTG ACGGGGATCA CCTTCAACGT CTATGGGCGC
GAAGAGGCGG CCGAACGGCT CATTCCCTTC GACATCGTGC CGCGCATCAT CTCGGCCCGG
GAATGGACCC GGCTGGCCAA GGGGATCGAG CAGCGGGTGC GCGCGATCAA CGCCTTTCTG
CACGACATCT ATCACCGGCA GGAGATCATC CGCGCGGGCC GCATACCCGC CGAGATGATC
GCCAATAACG AGGCGTTCCT GCCCCAGATG ATCGGCATGA CGCCGCCGGG CAATGTCTAT
ACCCATATCG TCGGCATCGA CCTGGTGCGC ACCGGCGAGG ACGAGTTCTA CGTGCTGGAG
GACAATGCGC GTACGCCTTC GGGGGTCTCC TACATGCTGG AGAACCGGGA GACGATGCTG
CAGATGTTCC CGGAGCTGTT CGCGCGCAAC CGGGTGCGGT CGGTCAGCGA GTACCCGCAA
AACCTGCGCC GGTCGTTGAG CGACTGTTTC CCGCCGGCCT GTACCGGCAA GCCGGTGGTG
GGGGTTCTGA CCCCGGGGAT CCACAACTCG GCCTATTACG AGCATGCGTT TCTCGCCGAC
AAGATGGGCG CGGCACTGGT GGAGGGGCAT GACCTGAAGG TGGTGGACGG GCGCGTGGCG
ATGCGGACCA CCCGCGGCTT CACCCCGATC GACGTGCTCT ACCGGCGGGT GGATGACGAT
TTCCTCGACC CGATGAATTT CAGGCCCGAG AGCCTGCTGG GCGTGCCGGG TATCATGGAT
GTCTACCGCG CGGGCGGGAT CACCATCGCC AACGCCCCGG GCACCGGGAT CGCGGACGAC
AAGGCGATCT ATTCCTACAT GCCGGAGATC GTCGAGTTCT ATACCGGCGA GCAGGCGATC
CTGAAGAACG TGCCGACCCA TCGCTGCAAC GACCCCGACA CGCTGGCCTA TGTTCTGGAC
AATCTGGCCG ACTTGGTGGT CAAGGAGGTG CATGGCTCGG GCGGCTACGG GATGCTGGTG
GGGCCTGCGG CCTCGAAAAA GGAGATCGCC GCCTTCCGCG AGAAGCTGAT CGCCAAGCCC
GACAGCTATA TCGCCCAGCC GACGCTGAGC CTGAGCACGG TGCCGATTTT CGCGCGTTCG
GGGCTGGCGC CGCGGCATGT GGATTTGCGG CCCTTCGTGC TGGTCTCGCC AAAGAAGATC
CATATCACGC CCGGCGGGCT GACGCGGGTG GCGTTGCAGA AGGGGTCGCT GGTGGTCAAT
TCGAGCCAGG GAGGCGGCAC CAAGGACACC TGGGTGCTGG AGGAGTAG
 
Protein sequence
MKIKQPFDEM TGMGDGAIRS PYADFDTWFS GEDPARLRKK AREAEDVFRL TGITFNVYGR 
EEAAERLIPF DIVPRIISAR EWTRLAKGIE QRVRAINAFL HDIYHRQEII RAGRIPAEMI
ANNEAFLPQM IGMTPPGNVY THIVGIDLVR TGEDEFYVLE DNARTPSGVS YMLENRETML
QMFPELFARN RVRSVSEYPQ NLRRSLSDCF PPACTGKPVV GVLTPGIHNS AYYEHAFLAD
KMGAALVEGH DLKVVDGRVA MRTTRGFTPI DVLYRRVDDD FLDPMNFRPE SLLGVPGIMD
VYRAGGITIA NAPGTGIADD KAIYSYMPEI VEFYTGEQAI LKNVPTHRCN DPDTLAYVLD
NLADLVVKEV HGSGGYGMLV GPAASKKEIA AFREKLIAKP DSYIAQPTLS LSTVPIFARS
GLAPRHVDLR PFVLVSPKKI HITPGGLTRV ALQKGSLVVN SSQGGGTKDT WVLEE