Gene Dole_1470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1470 
Symbol 
ID5694307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1756083 
End bp1758863 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content62% 
IMG OID641264065 
ProductNifA subfamily transcriptional regulator 
Protein accessionYP_001529351 
Protein GI158521481 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCTG CACCCCCGCC TGCCTATACT GAAGAAGCGG CCCGCCTGGA GGTGGACCTG 
GCCCGGGCGG CCCTTGAAAC GGCTGACCGG GAGGCGGCCC GGATCCATTT TCACAACGCG
GTGGCCCGGC TTGCCGCCGG GGACCGGCCG GAAACCAGCC ATGTGCTGGT GGCAGCCACC
CTGGAGCTTT CCAACCTGAA CTTTATTCTG GGCAAAGGGT TTGGCCAAAC CATCCCCTTT
CTCCAGGCGG CCCTGAAAGC GGCCGAACAC CTGGGCGACC TTCGCTCAAA GGCCATGATC
AAGCTGCACC TGGGCCGGCA CTACTATTTT TCCAACCAGC GGTCGGCGGC CATTCCTTTT
TTTCAACAGG GCAAAACCGA GGTGGAAGCC CTGGGGGACG AGGACATTCA GGCCGGCGCC
GCCGAATTTG TCGGCCTTTA CCACCACCTT CAAGGCATGT TCACCAAAGC CATCGCCTAC
TTTGAAGCCG CGGTGGAACG GTTTGAAACC ACTCAGGGGC CCCTGGTACC CAACCCGTCG
GCCCCCATGT GGCTGGGCTA CTGTGCCGCC TACCTGGGCC AGTTTCACCG GGCCATCGGC
ACCCTGGACT ACTACCGCCG CATGCTCCTG GAACGGGGAG ACCGGGCACT TGCGGCCACC
ACCCGGGCCG TGCTGGGTAT TGTCCTGCTG GAACTCAAGA AGAACAAGGA GGCGGCCATT
CACCTGTCCG GCGCCATTTC TGAAGGGGAA AAGACCGGCA ACGACCTGGC CCTCTATTTT
GCCAGGGGCG GCATGGCCTA CTACCACTTT GCCGAGGGCC GGCTGGAGGA GGCCAACCGC
TACATCACCC ACGCCATTGT CGAAGGGACC ACCGCCGGCC TGGTCCGGCA GTATGCCACG
CCGATTTTTC TGGAGATGGG GTTTGAACTC TACAAGGCGG GCAAGCCCAT CTTTTCCGAA
ACCGAGATGC AGCGGGAAGT GATGCGCATC ATGCGGGAAC CCAACATTCA TTTAAAGGGC
GTGCTGCTGC GGCTGATGGC CGAGCAGCGC CTTTTTCTCG GTGCCGAGCC CGAAACCGTT
GAAAAGGACC TGAACGCCAG CGCGGAATAC CTGGCGCAGT CGGGCACCCC GGTACAACTG
GCCAAAACCC GGTTTACCCT GATGCGCATT CACCTGAAGC GGGACGACCC TGCCGAAGCC
CGGCGCCAGG CCCGAAAAGC CTGGAAAGAA CTGGCCGGCT ACGGAGAGAC CTTTTTCCCG
GACGACCTTC GCCACCTGCT TGCCGAAGAG CCGGCCCGGG AACCGACGCC GGAGGTAAAG
GAGGAATTCC TGCTCCGGTT TATCGACATC ATTTCCGAGC TCATGCCCGG CCCGGCGCCG
GACCGGCTGT TTACCCGGCT GGTGCAGGCC ACCAACCGTT ACTTCGGCGC GGAACGGGGC
GCCCTGTTCT GGTTTTCCGA TACGCCGAAA CAGACACCGG CCCTGCGTGC CGCCTGCAAC
CTGACCGAAA CCGAGATCTT TTCCAACGCC TTCCGCTCCA ATCTGGCCCT GGTGTTTGAC
GCATGGCGGG AAGGCCGGTC TATCCTGGTA CGTCGGGGCG ACAGGTCCGA CGATCCCTAC
CGGGAAAAGG CGATTCTCTG CCTTCCCTTT CAAATCGAGG GTAAACCCAG GGGCGTGCTG
TACCATGACA ACGCCTATGT AACCGACTGT TTCAACTTCC TCGACCCGGC CCAGTTAAAC
CGGCTGGTCC ACACACTGGG CAGCTATATC GAACACGCAT GGGATCTTTC CCGGGGATTT
GAAAAACTCC GGCCGCCACT GGCCGTGCCC TCCACCGTCA CCGGAACGGT GGAGATCGTG
GGGGAAAGCA GCCGGATCAA GGCCGTGCTG ACCCAGGTGG ACCAGGTGGC GCCAACGGAC
AGCACCGTAC TGATCCTGGG TGAAACCGGC GTGGGTAAGG AGCTGGTGGC CCGGCGAATA
CACCAGCAGA GCCGCCGGTG CGATATGCCC CTGATCGTGG TGGACCCCAC CGCCATACCC
GAGGGCCTGG TGGAAAGCGA GCTGTTCGGC CATGAAAAAG GGGCTTTCAC CGGCGCGGAC
CGCCAAAAAA AGGGGCTGCT GGAACTGGCC CACCAGGGCA CCCTGTTTAT CGACGAGGTG
GGTGAGATTC CCAAGTCGAT CCAGGTCAAG CTGCTGCGGG CGCTCCAGGA AAAGACCATT
CAGCGCCTGG GCGGCACCAA GCCCCTTTTC TCCGACTTCC GGCTGATCGC CGCCACCAAC
CGGGACCTGG CCGGGCAAGT AGCTGCCGGC CGGTTCCGGG AGGACCTCTA TTACCGGCTT
AACGTCATTC CCATCACGGT GCCGCCCCTG AGGGAACGGA AACAGGACAT CGTTCTTCTG
GCCGCCTGTT TCCTGCGGCG CCACAGCCTG CGCCACAACC GGCCTGCAAT GGTGCTGAGC
CCGGAAGTCC AACAACGGCT TTGTGCCTAT CCATGGCCGG GCAATGTGCG GGAGCTGGAA
AACGTGATTG AGCGCGGGGT ACTGCTCTCC ACCGGAGACC ACCTGGAACT GGACCTGCCC
GCGGGCCATA CGCGGCCCGG GGCCGATCCG TTTACCGACC TGCCCACCCT TGACGAACTG
CAGCGGCGCT ATATTCACCA CGTGCTTGAA AAAACCGGGG GCAAAATCGC CGGGCCAGAC
GGGGCCGCCA AAATCCTTGG CATGAAACGC ACCAGCCTGT ATAACAGAAT GAAGCGGCTG
AATGTGCGGG GAAACCGGTA A
 
Protein sequence
MNPAPPPAYT EEAARLEVDL ARAALETADR EAARIHFHNA VARLAAGDRP ETSHVLVAAT 
LELSNLNFIL GKGFGQTIPF LQAALKAAEH LGDLRSKAMI KLHLGRHYYF SNQRSAAIPF
FQQGKTEVEA LGDEDIQAGA AEFVGLYHHL QGMFTKAIAY FEAAVERFET TQGPLVPNPS
APMWLGYCAA YLGQFHRAIG TLDYYRRMLL ERGDRALAAT TRAVLGIVLL ELKKNKEAAI
HLSGAISEGE KTGNDLALYF ARGGMAYYHF AEGRLEEANR YITHAIVEGT TAGLVRQYAT
PIFLEMGFEL YKAGKPIFSE TEMQREVMRI MREPNIHLKG VLLRLMAEQR LFLGAEPETV
EKDLNASAEY LAQSGTPVQL AKTRFTLMRI HLKRDDPAEA RRQARKAWKE LAGYGETFFP
DDLRHLLAEE PAREPTPEVK EEFLLRFIDI ISELMPGPAP DRLFTRLVQA TNRYFGAERG
ALFWFSDTPK QTPALRAACN LTETEIFSNA FRSNLALVFD AWREGRSILV RRGDRSDDPY
REKAILCLPF QIEGKPRGVL YHDNAYVTDC FNFLDPAQLN RLVHTLGSYI EHAWDLSRGF
EKLRPPLAVP STVTGTVEIV GESSRIKAVL TQVDQVAPTD STVLILGETG VGKELVARRI
HQQSRRCDMP LIVVDPTAIP EGLVESELFG HEKGAFTGAD RQKKGLLELA HQGTLFIDEV
GEIPKSIQVK LLRALQEKTI QRLGGTKPLF SDFRLIAATN RDLAGQVAAG RFREDLYYRL
NVIPITVPPL RERKQDIVLL AACFLRRHSL RHNRPAMVLS PEVQQRLCAY PWPGNVRELE
NVIERGVLLS TGDHLELDLP AGHTRPGADP FTDLPTLDEL QRRYIHHVLE KTGGKIAGPD
GAAKILGMKR TSLYNRMKRL NVRGNR