Gene Dshi_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2044 
Symbol 
ID5713039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2162631 
End bp2164097 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content67% 
IMG OID641267967 
ProductHTH-type transcriptional regulator 
Protein accessionYP_001533383 
Protein GI159044589 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.411969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCA GACCCGAGTC CCTCATCTTC GATCAGGACG GCGAAGGCAC CCGCCAGCAC 
CGCATCAAGC GGCAGGTGAT CGACGGTATC CTCAGCGGCC GGTTCAAGCC GGGGGACAAG
ATGCCCTCCA GCCGTGGCCT CGCCCGCCAG CTCGGCGTCA GCCGGATCAC CGTGACCATC
GCCTACACCG ACCTCGTGGC CGACGACTAC CTCGTGGCGC GCGGGCGATC CGGGTACTTC
GTCTCCGACA GCGCCCCCAG CGCGCCCAGC CTGGTGCAGG CCTGCCAGAA CGGCGAGAGC
ATCGTGGACT GGACGCGCCT GATGTCCCAC CGGGAACCGC CGCCCGAGGA GATCGCCCGC
CCCTTCGACT GGCACAGCTT CCCCTACCCG TTCATCTACG GACAGACGGA TCCCCAGCTT
TTCGATCATC GTAATTGGCG GCTCTGCGCG CTCGAGGCGC TCGGCCTGCG GGAATTCGAG
AGCCTGACCG CCGACCATTA TGAGCGGGAC GATCCGAAAC TCGTGGAGTA TATCCAACGC
AACATCCTGC CCCGGCGGGG GATCGCGGCG CGCCCCAACC AGATCCTGAT CACCATGGGC
GCGCAGAACG CGCTCTGGCT CTGTGCGCAA CTTCTGCTCA CCCAGCGGCG CAAGGCGGTG
CTGGAGAACC CCGGCTACCC CAGCCTGCGC CAGATCCTCG GCGCCACACG CTGCCATACC
CAAAGCGTCG ATGTGGACGC AGACGGTCTG GCCCCCGAAA CGCTGCCCGA CGCGCTCGAC
GTGCTCTTCA CCACCGTCAG CCACCAATGC CCCACCAACG CCACCATGCC ACTGGCCCGG
CGCAAGGCGC TGCTGTCCCT GGCCGCCGAA CGCGGCTTCG TGGTGGTGGA GGACGAGTAC
GAGTTCGAGC TGGCTTTCGG GCGCACTGCA ACACCGTCGC TCAAGTCGTT CGACACGCAT
GGCACGGTGA TCTATGTCGG CTCGTTCTCG AAGTCCCTGT TTCCGGGGCT TCGGCTGGGT
TTCATGGTCG CCCCGCACCC CTTCATCGCC GAGGCCCGCC GGTTGCGCGG CACCGTGCTC
CGCCATCCGC CGGGATTGAT TCAGCGGACC ACGGCGAACT TCCTGTCGCG CGGGCATTTC
GATGCGCAGA TCAACCGGAT GCGCAAGGCC TACGAGGTGC GCCGCCGCGC CATGGAAACG
GCGATCGCCG AGACCGGGCT GCAAGTCGCC TCCCAACCCG CCAATGGCGG ATCGAGCCTG
TGGATGCGGG CGCCGGACGG CGTGGATACC GACCTGCTGG CCCGGCGGCT CCGGTCCAAG
GGGGTGGTGA TCGAACCGGG CGCGGCCTTC TTCGACCCCA CCCGCCCTCA GCGCAACTTT
TACCGGCTCG CCTATTCGTC GATCGAGGTG GCCCGCATTC CCCAGGGCAT TCGGCTGATC
GCCGCTGCGC TGGCCGATCT GGACTGA
 
Protein sequence
MDIRPESLIF DQDGEGTRQH RIKRQVIDGI LSGRFKPGDK MPSSRGLARQ LGVSRITVTI 
AYTDLVADDY LVARGRSGYF VSDSAPSAPS LVQACQNGES IVDWTRLMSH REPPPEEIAR
PFDWHSFPYP FIYGQTDPQL FDHRNWRLCA LEALGLREFE SLTADHYERD DPKLVEYIQR
NILPRRGIAA RPNQILITMG AQNALWLCAQ LLLTQRRKAV LENPGYPSLR QILGATRCHT
QSVDVDADGL APETLPDALD VLFTTVSHQC PTNATMPLAR RKALLSLAAE RGFVVVEDEY
EFELAFGRTA TPSLKSFDTH GTVIYVGSFS KSLFPGLRLG FMVAPHPFIA EARRLRGTVL
RHPPGLIQRT TANFLSRGHF DAQINRMRKA YEVRRRAMET AIAETGLQVA SQPANGGSSL
WMRAPDGVDT DLLARRLRSK GVVIEPGAAF FDPTRPQRNF YRLAYSSIEV ARIPQGIRLI
AAALADLD