Gene Dshi_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2601 
Symbol 
ID5713499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2765180 
End bp2766208 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content58% 
IMG OID641268525 
Productputative transcriptional regulator 
Protein accessionYP_001533935 
Protein GI159045141 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0251856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTG AGAGCACGCC ACAACTGTCA ACAGTTTCCA GTCAGTTCAT TGCGGACTGG 
CTGCAGGCGC TGCGACCCCT GTGCAGCGTG GATCATTTTC GATCTCTCCT GAAACGCGCC
GCACTTCAAG ACGATTGCGA CGCCCCGACC GGCAGGGTGA CGCTTGATCA AGTTGTTTGT
CTCTATCAAC TGGCAGCTGT CGAGACCGAG GATGAGATGA TGGGCCTTTG GTCCCGCCCC
ATCCGCCAAC GCGCACTGCA ACACTTGCTG ACGTCGATCC GCGAAGCGAC AACCCTTTCC
TCAGCACTCT ACCGGTTCTC GACTTTCTGG AACCTGCTGC TTGATGATTA CCAATTCTCG
TTTGCGGAAG CGGATAATGT GTCGGTGTTG AAGCTTGTGC CGCAGACCGA TCAAGCCGCT
CAGCGGTTCG GCCACATGCT GATCCTAAAA CTGGCACATG GACTGATTTC CTGGCTGGCC
CGATATGAAG TCCCCGTCAA AGCGGTGGGT TTCGCCTTCG AGGCGCCAGC CTTCGAGGAA
GACTACGCAG TGATCTTTCC TGCGCCCGTG CGGTTCGCTC AGTCGGCGAC GTCGATTGCA
TTTGGTCCTG GCGTCTTGGG TCCGGTTCAG GTGCGCAGTT CGGCAGACTT GACTCTGTTT
TTGGAAAACG CGCCCCGGGA TTGGATATTT ACCCAGTCCC AAGTGCATAC CCAGTCCTTG
CGGGTCCGCA CCTATCTGAG CCAGACGGGT TGGGACAGTG CAAACCTGAC AGAGGCCGCG
GCCGCGATGC ACGTGACCCC GCGCACACTC ATCCGCAAGT TGGAGGCCGA CGGCACATCG
TTTCAGGCCA TCAAGGACGC CTTACGTCGC GACATTGCGA TCCGTCATCT GCAAACAGGG
CAGCATAGTG TCGAAGCCAT CGCCCATGAG GTGGGGTTTT CGTCGGCGGC TAACTTCCAC
AAAGCGTTTC AAAGGTGGAC TGGCAATACA CCCAGCTCAT ATCGACGCAA GCCTTACTCC
GGCGTGTAA
 
Protein sequence
MPPESTPQLS TVSSQFIADW LQALRPLCSV DHFRSLLKRA ALQDDCDAPT GRVTLDQVVC 
LYQLAAVETE DEMMGLWSRP IRQRALQHLL TSIREATTLS SALYRFSTFW NLLLDDYQFS
FAEADNVSVL KLVPQTDQAA QRFGHMLILK LAHGLISWLA RYEVPVKAVG FAFEAPAFEE
DYAVIFPAPV RFAQSATSIA FGPGVLGPVQ VRSSADLTLF LENAPRDWIF TQSQVHTQSL
RVRTYLSQTG WDSANLTEAA AAMHVTPRTL IRKLEADGTS FQAIKDALRR DIAIRHLQTG
QHSVEAIAHE VGFSSAANFH KAFQRWTGNT PSSYRRKPYS GV