Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_0951 |
Symbol | |
ID | 5693786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1107407 |
End bp | 1109122 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641263548 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001528838 |
Protein GI | 158520968 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00021324 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGTC GGATAAAATT ATTTTCAAAC ATTAAAAACA GTTATTTTCT TTCTGCTTCT GTTTTTCTCC TGCTTTTCGC TCTCGGAATC TTTATGTTTT TTCCGCTTTT TCGAATCGGA ACACCCGGCC GTTCTATCCC TTTGGCCGGC CAGTGGAAAA TATCTCTTAA CGATTCGACT GCGTTTTCCT TTGCTGAATA TGACGACACC GACTGGGGTA CCATTGACCT GCCCGGAAAA GTCGTTCCTT ATTCTATAGA GCAAAACAGG GGCATACGCG GCACATGCTG GCTGCGAAAA ACATTTTTTT TAGAGGATGC GCCCGGCGAC TACGGCCTTA TACTGGGGAG AATCGCCAAC GCGGACCAGA CCTATCTCAA CGGCGAAAAA ATCGGTGAAA CCGGCCGGTT TTTCCCCAAC ACGTTTTCCA TGTGGAACTA TCCCAGAAAC TACCTGCTGC CGGACCGGCA TCTGCGGCGG GGCGGCCAAA ACGTGATTGC CGTGCGGGTC TCCTACGATG TCATCGGCGA AGTGATGGGG AACCTGATGA TTGTCGATGC CGGCTATTTA AGACGGTACA CACCCTTTGC CCGGTTTATT CATGTCACCA TCGGATACGC GGCCATCAGC ATCGGGCTTG TGCTGGTGCT GGCCTTTCTC TTTTCCCGGT TCCGGTACCT GACCTTTGAT GAAAACTATC TTTATTTTTT GCAGTTTTTT GCCGGCCTGC CCATTGTGCT TGACCTCTGT CTCACCTGGG AGGTTTATCC GGACCACACC ACCCGGCTCA AGGTGCTGGG CCTTTCCTGG GTGGCCATCA ACGTGTTTCA TCCCGCGTTT CTTCACCGGT TTTACCGGCT GAAAAGAAAA TGGCCGGAAC GCATCCTCTG GACCTATCTG GCCGCCTGCC TTCTGGGTGC GGTGTTCTGC ACGCATGCAG GCAATATCCG TCCCATGGCC ACCCTGCTCA TCGGTGTCAC CTGGTGCATC GGGTTTTACA ACATGTCCTG CCATATCGAG GCCCTGGTCA GAAAGCGGGA CCACGCAAAA ATATTCAGTG TTTTCGGCAT TGTCACCATT CTGGCCGCCA TGAACGACGG GTGGTGCTAT TTCAACAAGT TCGTGGATTT CAACGTCACG GTATTCGGCT GGGCCCCCAC GGTCATGGTG ATTCAGACCG GCGCCATCTT TCTTTACATG GGCACCTTTC TTGTGCTGGA GACAAAATAC AGGGAGATGG TGGAGGAGGT GGACGACCTG AACCGGAACC TTGAGAATTT TGTTCTTGAA AACGCCTTTC TGACCATGGC CGTAAAACAG AGCCGAGCCC CGAAATCCGG CCCGTCTCGT ATCACGCCCC AGGCCGAGGA AAAGATTCAG GCCGCCATCG ATATCATCGG GGAGAACTAC CTGTCCGAAC TGTCAAGAAC CGACCTTGCG AAGACGCTGG ATGTCAGCCC GGACAGCCTG GGCAAACAGT TCAAACAGTA CACCGGCAAA AAGCTGGGGG ACTACATCAA CGAGCTGCGC ATTCACGAGG CGGCCCGGCG CCTGCGCGAG ACCGACGACA AGGTGATCCA TATCGCCTTT GATACCGGGT TTGAAAGCCT GCGCACCTTC AACCGGGTGT TTTCAAAGCT CATGAACACC ACCCCGGCCC AGTACCGGCA GGAAACCGTT CCTGATGAAG GTAAAAGCGC GGTGCCGGGG AATTAA
|
Protein sequence | MTGRIKLFSN IKNSYFLSAS VFLLLFALGI FMFFPLFRIG TPGRSIPLAG QWKISLNDST AFSFAEYDDT DWGTIDLPGK VVPYSIEQNR GIRGTCWLRK TFFLEDAPGD YGLILGRIAN ADQTYLNGEK IGETGRFFPN TFSMWNYPRN YLLPDRHLRR GGQNVIAVRV SYDVIGEVMG NLMIVDAGYL RRYTPFARFI HVTIGYAAIS IGLVLVLAFL FSRFRYLTFD ENYLYFLQFF AGLPIVLDLC LTWEVYPDHT TRLKVLGLSW VAINVFHPAF LHRFYRLKRK WPERILWTYL AACLLGAVFC THAGNIRPMA TLLIGVTWCI GFYNMSCHIE ALVRKRDHAK IFSVFGIVTI LAAMNDGWCY FNKFVDFNVT VFGWAPTVMV IQTGAIFLYM GTFLVLETKY REMVEEVDDL NRNLENFVLE NAFLTMAVKQ SRAPKSGPSR ITPQAEEKIQ AAIDIIGENY LSELSRTDLA KTLDVSPDSL GKQFKQYTGK KLGDYINELR IHEAARRLRE TDDKVIHIAF DTGFESLRTF NRVFSKLMNT TPAQYRQETV PDEGKSAVPG N
|
| |