Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1470 |
Symbol | |
ID | 5694307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 1756083 |
End bp | 1758863 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641264065 |
Product | NifA subfamily transcriptional regulator |
Protein accession | YP_001529351 |
Protein GI | 158521481 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCTG CACCCCCGCC TGCCTATACT GAAGAAGCGG CCCGCCTGGA GGTGGACCTG GCCCGGGCGG CCCTTGAAAC GGCTGACCGG GAGGCGGCCC GGATCCATTT TCACAACGCG GTGGCCCGGC TTGCCGCCGG GGACCGGCCG GAAACCAGCC ATGTGCTGGT GGCAGCCACC CTGGAGCTTT CCAACCTGAA CTTTATTCTG GGCAAAGGGT TTGGCCAAAC CATCCCCTTT CTCCAGGCGG CCCTGAAAGC GGCCGAACAC CTGGGCGACC TTCGCTCAAA GGCCATGATC AAGCTGCACC TGGGCCGGCA CTACTATTTT TCCAACCAGC GGTCGGCGGC CATTCCTTTT TTTCAACAGG GCAAAACCGA GGTGGAAGCC CTGGGGGACG AGGACATTCA GGCCGGCGCC GCCGAATTTG TCGGCCTTTA CCACCACCTT CAAGGCATGT TCACCAAAGC CATCGCCTAC TTTGAAGCCG CGGTGGAACG GTTTGAAACC ACTCAGGGGC CCCTGGTACC CAACCCGTCG GCCCCCATGT GGCTGGGCTA CTGTGCCGCC TACCTGGGCC AGTTTCACCG GGCCATCGGC ACCCTGGACT ACTACCGCCG CATGCTCCTG GAACGGGGAG ACCGGGCACT TGCGGCCACC ACCCGGGCCG TGCTGGGTAT TGTCCTGCTG GAACTCAAGA AGAACAAGGA GGCGGCCATT CACCTGTCCG GCGCCATTTC TGAAGGGGAA AAGACCGGCA ACGACCTGGC CCTCTATTTT GCCAGGGGCG GCATGGCCTA CTACCACTTT GCCGAGGGCC GGCTGGAGGA GGCCAACCGC TACATCACCC ACGCCATTGT CGAAGGGACC ACCGCCGGCC TGGTCCGGCA GTATGCCACG CCGATTTTTC TGGAGATGGG GTTTGAACTC TACAAGGCGG GCAAGCCCAT CTTTTCCGAA ACCGAGATGC AGCGGGAAGT GATGCGCATC ATGCGGGAAC CCAACATTCA TTTAAAGGGC GTGCTGCTGC GGCTGATGGC CGAGCAGCGC CTTTTTCTCG GTGCCGAGCC CGAAACCGTT GAAAAGGACC TGAACGCCAG CGCGGAATAC CTGGCGCAGT CGGGCACCCC GGTACAACTG GCCAAAACCC GGTTTACCCT GATGCGCATT CACCTGAAGC GGGACGACCC TGCCGAAGCC CGGCGCCAGG CCCGAAAAGC CTGGAAAGAA CTGGCCGGCT ACGGAGAGAC CTTTTTCCCG GACGACCTTC GCCACCTGCT TGCCGAAGAG CCGGCCCGGG AACCGACGCC GGAGGTAAAG GAGGAATTCC TGCTCCGGTT TATCGACATC ATTTCCGAGC TCATGCCCGG CCCGGCGCCG GACCGGCTGT TTACCCGGCT GGTGCAGGCC ACCAACCGTT ACTTCGGCGC GGAACGGGGC GCCCTGTTCT GGTTTTCCGA TACGCCGAAA CAGACACCGG CCCTGCGTGC CGCCTGCAAC CTGACCGAAA CCGAGATCTT TTCCAACGCC TTCCGCTCCA ATCTGGCCCT GGTGTTTGAC GCATGGCGGG AAGGCCGGTC TATCCTGGTA CGTCGGGGCG ACAGGTCCGA CGATCCCTAC CGGGAAAAGG CGATTCTCTG CCTTCCCTTT CAAATCGAGG GTAAACCCAG GGGCGTGCTG TACCATGACA ACGCCTATGT AACCGACTGT TTCAACTTCC TCGACCCGGC CCAGTTAAAC CGGCTGGTCC ACACACTGGG CAGCTATATC GAACACGCAT GGGATCTTTC CCGGGGATTT GAAAAACTCC GGCCGCCACT GGCCGTGCCC TCCACCGTCA CCGGAACGGT GGAGATCGTG GGGGAAAGCA GCCGGATCAA GGCCGTGCTG ACCCAGGTGG ACCAGGTGGC GCCAACGGAC AGCACCGTAC TGATCCTGGG TGAAACCGGC GTGGGTAAGG AGCTGGTGGC CCGGCGAATA CACCAGCAGA GCCGCCGGTG CGATATGCCC CTGATCGTGG TGGACCCCAC CGCCATACCC GAGGGCCTGG TGGAAAGCGA GCTGTTCGGC CATGAAAAAG GGGCTTTCAC CGGCGCGGAC CGCCAAAAAA AGGGGCTGCT GGAACTGGCC CACCAGGGCA CCCTGTTTAT CGACGAGGTG GGTGAGATTC CCAAGTCGAT CCAGGTCAAG CTGCTGCGGG CGCTCCAGGA AAAGACCATT CAGCGCCTGG GCGGCACCAA GCCCCTTTTC TCCGACTTCC GGCTGATCGC CGCCACCAAC CGGGACCTGG CCGGGCAAGT AGCTGCCGGC CGGTTCCGGG AGGACCTCTA TTACCGGCTT AACGTCATTC CCATCACGGT GCCGCCCCTG AGGGAACGGA AACAGGACAT CGTTCTTCTG GCCGCCTGTT TCCTGCGGCG CCACAGCCTG CGCCACAACC GGCCTGCAAT GGTGCTGAGC CCGGAAGTCC AACAACGGCT TTGTGCCTAT CCATGGCCGG GCAATGTGCG GGAGCTGGAA AACGTGATTG AGCGCGGGGT ACTGCTCTCC ACCGGAGACC ACCTGGAACT GGACCTGCCC GCGGGCCATA CGCGGCCCGG GGCCGATCCG TTTACCGACC TGCCCACCCT TGACGAACTG CAGCGGCGCT ATATTCACCA CGTGCTTGAA AAAACCGGGG GCAAAATCGC CGGGCCAGAC GGGGCCGCCA AAATCCTTGG CATGAAACGC ACCAGCCTGT ATAACAGAAT GAAGCGGCTG AATGTGCGGG GAAACCGGTA A
|
Protein sequence | MNPAPPPAYT EEAARLEVDL ARAALETADR EAARIHFHNA VARLAAGDRP ETSHVLVAAT LELSNLNFIL GKGFGQTIPF LQAALKAAEH LGDLRSKAMI KLHLGRHYYF SNQRSAAIPF FQQGKTEVEA LGDEDIQAGA AEFVGLYHHL QGMFTKAIAY FEAAVERFET TQGPLVPNPS APMWLGYCAA YLGQFHRAIG TLDYYRRMLL ERGDRALAAT TRAVLGIVLL ELKKNKEAAI HLSGAISEGE KTGNDLALYF ARGGMAYYHF AEGRLEEANR YITHAIVEGT TAGLVRQYAT PIFLEMGFEL YKAGKPIFSE TEMQREVMRI MREPNIHLKG VLLRLMAEQR LFLGAEPETV EKDLNASAEY LAQSGTPVQL AKTRFTLMRI HLKRDDPAEA RRQARKAWKE LAGYGETFFP DDLRHLLAEE PAREPTPEVK EEFLLRFIDI ISELMPGPAP DRLFTRLVQA TNRYFGAERG ALFWFSDTPK QTPALRAACN LTETEIFSNA FRSNLALVFD AWREGRSILV RRGDRSDDPY REKAILCLPF QIEGKPRGVL YHDNAYVTDC FNFLDPAQLN RLVHTLGSYI EHAWDLSRGF EKLRPPLAVP STVTGTVEIV GESSRIKAVL TQVDQVAPTD STVLILGETG VGKELVARRI HQQSRRCDMP LIVVDPTAIP EGLVESELFG HEKGAFTGAD RQKKGLLELA HQGTLFIDEV GEIPKSIQVK LLRALQEKTI QRLGGTKPLF SDFRLIAATN RDLAGQVAAG RFREDLYYRL NVIPITVPPL RERKQDIVLL AACFLRRHSL RHNRPAMVLS PEVQQRLCAY PWPGNVRELE NVIERGVLLS TGDHLELDLP AGHTRPGADP FTDLPTLDEL QRRYIHHVLE KTGGKIAGPD GAAKILGMKR TSLYNRMKRL NVRGNR
|
| |