Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2601 |
Symbol | |
ID | 5713499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2765180 |
End bp | 2766208 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641268525 |
Product | putative transcriptional regulator |
Protein accession | YP_001533935 |
Protein GI | 159045141 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0251856 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCTG AGAGCACGCC ACAACTGTCA ACAGTTTCCA GTCAGTTCAT TGCGGACTGG CTGCAGGCGC TGCGACCCCT GTGCAGCGTG GATCATTTTC GATCTCTCCT GAAACGCGCC GCACTTCAAG ACGATTGCGA CGCCCCGACC GGCAGGGTGA CGCTTGATCA AGTTGTTTGT CTCTATCAAC TGGCAGCTGT CGAGACCGAG GATGAGATGA TGGGCCTTTG GTCCCGCCCC ATCCGCCAAC GCGCACTGCA ACACTTGCTG ACGTCGATCC GCGAAGCGAC AACCCTTTCC TCAGCACTCT ACCGGTTCTC GACTTTCTGG AACCTGCTGC TTGATGATTA CCAATTCTCG TTTGCGGAAG CGGATAATGT GTCGGTGTTG AAGCTTGTGC CGCAGACCGA TCAAGCCGCT CAGCGGTTCG GCCACATGCT GATCCTAAAA CTGGCACATG GACTGATTTC CTGGCTGGCC CGATATGAAG TCCCCGTCAA AGCGGTGGGT TTCGCCTTCG AGGCGCCAGC CTTCGAGGAA GACTACGCAG TGATCTTTCC TGCGCCCGTG CGGTTCGCTC AGTCGGCGAC GTCGATTGCA TTTGGTCCTG GCGTCTTGGG TCCGGTTCAG GTGCGCAGTT CGGCAGACTT GACTCTGTTT TTGGAAAACG CGCCCCGGGA TTGGATATTT ACCCAGTCCC AAGTGCATAC CCAGTCCTTG CGGGTCCGCA CCTATCTGAG CCAGACGGGT TGGGACAGTG CAAACCTGAC AGAGGCCGCG GCCGCGATGC ACGTGACCCC GCGCACACTC ATCCGCAAGT TGGAGGCCGA CGGCACATCG TTTCAGGCCA TCAAGGACGC CTTACGTCGC GACATTGCGA TCCGTCATCT GCAAACAGGG CAGCATAGTG TCGAAGCCAT CGCCCATGAG GTGGGGTTTT CGTCGGCGGC TAACTTCCAC AAAGCGTTTC AAAGGTGGAC TGGCAATACA CCCAGCTCAT ATCGACGCAA GCCTTACTCC GGCGTGTAA
|
Protein sequence | MPPESTPQLS TVSSQFIADW LQALRPLCSV DHFRSLLKRA ALQDDCDAPT GRVTLDQVVC LYQLAAVETE DEMMGLWSRP IRQRALQHLL TSIREATTLS SALYRFSTFW NLLLDDYQFS FAEADNVSVL KLVPQTDQAA QRFGHMLILK LAHGLISWLA RYEVPVKAVG FAFEAPAFEE DYAVIFPAPV RFAQSATSIA FGPGVLGPVQ VRSSADLTLF LENAPRDWIF TQSQVHTQSL RVRTYLSQTG WDSANLTEAA AAMHVTPRTL IRKLEADGTS FQAIKDALRR DIAIRHLQTG QHSVEAIAHE VGFSSAANFH KAFQRWTGNT PSSYRRKPYS GV
|
| |