Gene Dshi_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2050 
Symbol 
ID5713045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2170309 
End bp2171463 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content67% 
IMG OID641267972 
Producthypothetical protein 
Protein accessionYP_001533388 
Protein GI159044594 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.449343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGCAA CGGATGCCGC AGAATATGCA GAAGGACGCG GGGTCGATGC GCGCCCCGCG 
CCCGACTTGC GCTTGTCGCA GGATGAAGCC GGGGTCGCAG GCGCGCTGAG TGGCGATCTG
ACGATCTACG GGGTGGCCGA TCTGCAGCGG CAGCTTGCCG CGCGCCCGGC CGGGTCGCTG
ACCCTGGATC TGTCCGGCAT CGGGCGGATC GATACGGCGG GCGCCTGGCT GCTGGCAGAA
CTGGCGCGGG GCGAGGGGGT CCGCCTTGTG GGGGCGCCCG ACAAGGTGGC GCGGCTGATC
GCCAACGTGG CCAAGGCCGA GCCCGAACAC CCGGAGCGGA CAGAGACGCC GCCGACCCTG
ACCGATCGCT TGGAGCGGCT CGGGCGGCAG GTGGTGGAGG GGACGAAATT CCTCGGCGGG
CTGACCGGGA TGCTGGGCCT CGTGCTGGCC CGGTTCGGTC GGGCCCTGCG GCATCCGCGG
GAGTTCCGGA TGACCGCGCT GGTGCATCAT TGCGAGGAGG TGGGGCTCAG GGCCGTGCCC
ATCGTGGCGC TGATGGCCTT TCTGATCGGT ATCGTTCTTG CCTTCCAGGG CGCGAGCCAG
CTGCGGCAAT TCGGGGCCGA GGTCTTCGTC GTCGACCTGA TCTCGATCTC GATTCTGCGA
GAGCTTGGCA TTCTGCTGAC GGCGATCATC GTGGCCGGGC GGACGGCCTC ATCGTTTACC
GCGGCCATCG GGTCGATGAA GATGCGCGAA GAGATCGATG CGATGCGGAC GCTGGGCCTC
GACCCCGCGA TGCTGCTGTT CCTGCCGCGG GTACTGGCGC TGCTGATCAT GTTGCCGATC
CTGGGGCTGA TCGCCAACCT GTCGGGGTTG CTGGGCGGGG CGCTGATGTC CTGGATCGAG
CTGGGCATCT CGCCCGCGAT GTTCCAGACC CGGCTGATCG AGGGGACCGA TATCAATCAT
GCGGTGGTCG GGCTCGTCAA GGCGCCGTTC TTCGCCATCC TGATCGGGGT GGTTGGCTGT
CATGCGGGGA TGCAGGTGGA GGGCAACGCC GAATCCCTTG GCCGGATGAC CTCGGGTGCG
GTGGTGACCG CGATCTTCGC CGTGATCGTG ACCGATGCGG CGTTTTCGAT TTTCTTTGCG
CAGATGGGGA TCTGA
 
Protein sequence
MAATDAAEYA EGRGVDARPA PDLRLSQDEA GVAGALSGDL TIYGVADLQR QLAARPAGSL 
TLDLSGIGRI DTAGAWLLAE LARGEGVRLV GAPDKVARLI ANVAKAEPEH PERTETPPTL
TDRLERLGRQ VVEGTKFLGG LTGMLGLVLA RFGRALRHPR EFRMTALVHH CEEVGLRAVP
IVALMAFLIG IVLAFQGASQ LRQFGAEVFV VDLISISILR ELGILLTAII VAGRTASSFT
AAIGSMKMRE EIDAMRTLGL DPAMLLFLPR VLALLIMLPI LGLIANLSGL LGGALMSWIE
LGISPAMFQT RLIEGTDINH AVVGLVKAPF FAILIGVVGC HAGMQVEGNA ESLGRMTSGA
VVTAIFAVIV TDAAFSIFFA QMGI