Gene Dole_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1096 
Symbol 
ID5693930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1301718 
End bp1304903 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content49% 
IMG OID641263690 
Productankyrin 
Protein accessionYP_001528980 
Protein GI158521110 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0912242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGAC AAATAATCAT TTTTTTTCTT TTGCTTCTAA TATCAACTTT GTTTTTCGGA 
AGCGGTTTTG CAGTTGCTGA AGATATAAAC ACGGCGTTTG TTTCAGCCTG CAGGGAAGGA
GATTATGAAA CCGTGGTGCG CCTGCTGGAT AAGGGCGCGG ATGTTAATTT CGGGAATAGG
GACTACAATT CACCGTTAAT CGGGGCGGTA CAGTCCGGCA GGATGGATAT TGTCGATCTC
CTCCTTGAAA AAGGCGCTGA TATTAACCAG GCTAACAGAA ATGGCTATAC GCCTTTAATG
ACGGCGTCGT CAAAATGCCG GCTTGATATG ATAAAATATT TCATTGACCG GGGCGCGGAC
ATTAACGCCA GAACCCGGTC AAAAAACACG ACGATCATGA GCGCCGTTCA TGCGGGATGC
GCGGAAGCCG TCAAACTCTT GATTTTAAAT GGTGCGGATT TAAACGACAG GGATGATCAT
GGTGATACGC TGTTGCATAT TGCCGCCAGA AGCCCCCGTG ACGCGCCTGG AATCATACAC
CTGCTTTTGG ACCGGGGCGC TGATATCGAA GCCAGAAATA ACCAGAAGAA GACACCGTTG
ATTTATGCTG CCGGCAAACC GAAATCTTTG AAAGTGCTGC TCGAACAGGG CGCGGACATT
CACGCGGTGG ATATTCATGG AGACACTGTT ATCACCACAG GCTCAATGAA GGACAATCCT
GAAGCGATAC AAGTTCTTTT GCAAGCCGGT TGTGACGTAA ACATCAGGAA TAAAGAAACG
GGCAAAACCC CTTTAATGGA AGCATGTGTA AATGGGCACA TCAACACGGC TGAATGCCTG
ATTAAAAACA GGGCGGATGT TAACGCGGGC TACGTCCTTA GAACATCGGG TTTTCAAAAT
ATGCCCCGCG TGTATAGCAG CCCCGACGTC GCTTTTATCT CCGTTGCCGG TACCTCATAC
ACTGACGCTG AAAACCGTGA AACTATTATA TATAAGGAAA ACGGTATGAC ACCGTTAATG
GAAGTGTCTC AGCGGGGATT TTGTGATATT GCAGCCCTGC TGATAAAAAA CAGGGCCAGA
ATCAATACCG CGTCAGAAAG CGGGCAAACC GCGCTGATGA TGGCATGCGC CAACGGCCAT
GATGATGTTG TTGAACTGCT GATAGCCCAG AAGGCTGATA TTAATGCCAG GGCCAGAAAT
AATACCACGG CCTTGCAACT GGCAGCCCAA AGCAATTATC CCCGAATAGC CATGCGCCTC
CTGGAAAACG GGGCAAAGAT TGATTCCCAA CAAGCGGATG ACAGTGCCAC GCTGCTGGTC
ACATCCGCTG AAAACGGAAA CGCCACTATT GTGAAGATGC TTTTGGACAT GGGAGTAGAC
ATCGAGTCTC GGGAGAAAAA AGACGGAAGT ACGGCGTTAA TCAAAGCAGC CGCCAAAAAC
AATCTGGAAG TTGCGGAAAT TCTGCTGAAA AAGGGCGCAA ATGTTGATGG GCGGGACAGG
AGCGGGTGTA CGGCGTTTTA TAGGGCGACG GAAAACGGAT ACGTGGAAAT GGCGAAACTG
CTGCATTCGC ACGGGGCTGA CATTAACGGG TCGGTGGAAA ACGGTTACAC GCCGTTGATT
GCCGCCGCTT TGGCAAATAA CATCGAAATG GTAAAATTCC TGCTGGACCG AAAAGCCGGG
ATTGACATGC AGGCTAGGAA CAATTCAACC GCTCTATCAG TGGCGGCTTA TGAGGGCAAC
AGAGAAGCAA TAAAGCTCCT CGTTAAATAT GGCGCGGACT GCAATGTCAG GGGGGAATTC
GGTCGCCTCC CATTTCACTC AGCCGCCGAT AGGGGGGATC TGGATATCTT GAAGCTTCTT
TTAACATGCA CCAGGGATGT GAATGCCAGG GACGCTTCAG GAAATACGGT ACTTATGTCT
GCATGTGGCA GTGGCGATGC GAATGTTGTC GCTTACCTGC TGACCAGGAA ACTGGAGGTA
AATGTAACGG ACAATTACGG TACCACCCCG CTGATGCGCG CCAGCAGCAG CGGTTACACC
GATATCGCCG ATATTTTAAT AAAATCCGGG GCCGATATTA ATGCCAGAAA CTATAAAGGC
AATTCCGCGT TGTCAGAGGC AGCGGACCGA GGGCAGCTCG ATATGGTTAG ATTTTTAATC
AACAAGGGAG CTGATGTAAA TTTCGCGAAT AACGATGGTG ACTATCCGAT AGGACTAGCG
GCCCGGACCA ACCGCCTCAT GGTCGTAGAA GTTCTTCTTG ATACAGCAAG CCCGGATGCC
GTCAACAGAG CCTTAAGATC AACGATAAAA GGTGGTTATC TTGAAATCGC CAAACGTCTG
TTGAAAAAAA ACGCGGACCC GAACTTTCTT TATAATTCGG ACATGTCACC ACTTATTATG
GCAGTCAATT ATGTCCACAT GGGGATGGTG GAGCTTTTGC TGTCACACGG CGCGGATCTG
GACTATCGGG ACAAGAACGG CAGAACCGCT CTCATGTGGG CGTCACAACG AGGCTTGACC
AGCATCGCGC AATGCCTCCT GAAAAACGGC GCTGATGTCA ACGTCAAAGA CAAAAACCAG
GAAACTGCAT TAAAGTACAC GGCCCAAATG GGGAATATAC CGCTTATGGA TATGCTTCTG
GCAAACGGCG CTGCCCCGAG CAACTATGGC ACGCCCGAGA TCGTTTCCGC AGCCGTCAAT
GAAGATATCA ATATGGCGGA GCTTTTGCTG AAGCATGGCG CAGATATTAA CGCCCAAGAC
AGGTCGGGGG ATACGGCGCT GATGAAGGCG GCAGAGAAAG GGTCCCCGGA AATGACAAAT
TTTCTTTTGC GAAACCATGC GAAAACAGAC ACAGTCAACC GAAGCGGGGC GTCCGCTTTT
TTACTTGCAT GCCGGAACGG CAATCAGGCA ATTATTGAAA TGCTGCTGGA AAAAGGTGCT
GACATTGATG CTGTCGACAA AAGCGGCAAC ACAGCGCTGT TGAGCGCTGT CATGTCAAGA
AACTGGGAAC TTGTGAAATT CCTTATATCA AAGGGAGCGG ATGTTAATAC AACGAACAGC
CGGGGCTATT CAGTCCTGGC TGTTGCAGAG GAAGTAAAAG CGCCCGCAGA TGTTATAAAA
CTGCTGAAAA AGAAAAACGC CAGATCCACC AGGACCAGAA CCGGCTCTGG CACTGTGCTG
CAATGA
 
Protein sequence
MGRQIIIFFL LLLISTLFFG SGFAVAEDIN TAFVSACREG DYETVVRLLD KGADVNFGNR 
DYNSPLIGAV QSGRMDIVDL LLEKGADINQ ANRNGYTPLM TASSKCRLDM IKYFIDRGAD
INARTRSKNT TIMSAVHAGC AEAVKLLILN GADLNDRDDH GDTLLHIAAR SPRDAPGIIH
LLLDRGADIE ARNNQKKTPL IYAAGKPKSL KVLLEQGADI HAVDIHGDTV ITTGSMKDNP
EAIQVLLQAG CDVNIRNKET GKTPLMEACV NGHINTAECL IKNRADVNAG YVLRTSGFQN
MPRVYSSPDV AFISVAGTSY TDAENRETII YKENGMTPLM EVSQRGFCDI AALLIKNRAR
INTASESGQT ALMMACANGH DDVVELLIAQ KADINARARN NTTALQLAAQ SNYPRIAMRL
LENGAKIDSQ QADDSATLLV TSAENGNATI VKMLLDMGVD IESREKKDGS TALIKAAAKN
NLEVAEILLK KGANVDGRDR SGCTAFYRAT ENGYVEMAKL LHSHGADING SVENGYTPLI
AAALANNIEM VKFLLDRKAG IDMQARNNST ALSVAAYEGN REAIKLLVKY GADCNVRGEF
GRLPFHSAAD RGDLDILKLL LTCTRDVNAR DASGNTVLMS ACGSGDANVV AYLLTRKLEV
NVTDNYGTTP LMRASSSGYT DIADILIKSG ADINARNYKG NSALSEAADR GQLDMVRFLI
NKGADVNFAN NDGDYPIGLA ARTNRLMVVE VLLDTASPDA VNRALRSTIK GGYLEIAKRL
LKKNADPNFL YNSDMSPLIM AVNYVHMGMV ELLLSHGADL DYRDKNGRTA LMWASQRGLT
SIAQCLLKNG ADVNVKDKNQ ETALKYTAQM GNIPLMDMLL ANGAAPSNYG TPEIVSAAVN
EDINMAELLL KHGADINAQD RSGDTALMKA AEKGSPEMTN FLLRNHAKTD TVNRSGASAF
LLACRNGNQA IIEMLLEKGA DIDAVDKSGN TALLSAVMSR NWELVKFLIS KGADVNTTNS
RGYSVLAVAE EVKAPADVIK LLKKKNARST RTRTGSGTVL Q