Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1096 |
Symbol | |
ID | 5693930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1301718 |
End bp | 1304903 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641263690 |
Product | ankyrin |
Protein accession | YP_001528980 |
Protein GI | 158521110 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0912242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAGAC AAATAATCAT TTTTTTTCTT TTGCTTCTAA TATCAACTTT GTTTTTCGGA AGCGGTTTTG CAGTTGCTGA AGATATAAAC ACGGCGTTTG TTTCAGCCTG CAGGGAAGGA GATTATGAAA CCGTGGTGCG CCTGCTGGAT AAGGGCGCGG ATGTTAATTT CGGGAATAGG GACTACAATT CACCGTTAAT CGGGGCGGTA CAGTCCGGCA GGATGGATAT TGTCGATCTC CTCCTTGAAA AAGGCGCTGA TATTAACCAG GCTAACAGAA ATGGCTATAC GCCTTTAATG ACGGCGTCGT CAAAATGCCG GCTTGATATG ATAAAATATT TCATTGACCG GGGCGCGGAC ATTAACGCCA GAACCCGGTC AAAAAACACG ACGATCATGA GCGCCGTTCA TGCGGGATGC GCGGAAGCCG TCAAACTCTT GATTTTAAAT GGTGCGGATT TAAACGACAG GGATGATCAT GGTGATACGC TGTTGCATAT TGCCGCCAGA AGCCCCCGTG ACGCGCCTGG AATCATACAC CTGCTTTTGG ACCGGGGCGC TGATATCGAA GCCAGAAATA ACCAGAAGAA GACACCGTTG ATTTATGCTG CCGGCAAACC GAAATCTTTG AAAGTGCTGC TCGAACAGGG CGCGGACATT CACGCGGTGG ATATTCATGG AGACACTGTT ATCACCACAG GCTCAATGAA GGACAATCCT GAAGCGATAC AAGTTCTTTT GCAAGCCGGT TGTGACGTAA ACATCAGGAA TAAAGAAACG GGCAAAACCC CTTTAATGGA AGCATGTGTA AATGGGCACA TCAACACGGC TGAATGCCTG ATTAAAAACA GGGCGGATGT TAACGCGGGC TACGTCCTTA GAACATCGGG TTTTCAAAAT ATGCCCCGCG TGTATAGCAG CCCCGACGTC GCTTTTATCT CCGTTGCCGG TACCTCATAC ACTGACGCTG AAAACCGTGA AACTATTATA TATAAGGAAA ACGGTATGAC ACCGTTAATG GAAGTGTCTC AGCGGGGATT TTGTGATATT GCAGCCCTGC TGATAAAAAA CAGGGCCAGA ATCAATACCG CGTCAGAAAG CGGGCAAACC GCGCTGATGA TGGCATGCGC CAACGGCCAT GATGATGTTG TTGAACTGCT GATAGCCCAG AAGGCTGATA TTAATGCCAG GGCCAGAAAT AATACCACGG CCTTGCAACT GGCAGCCCAA AGCAATTATC CCCGAATAGC CATGCGCCTC CTGGAAAACG GGGCAAAGAT TGATTCCCAA CAAGCGGATG ACAGTGCCAC GCTGCTGGTC ACATCCGCTG AAAACGGAAA CGCCACTATT GTGAAGATGC TTTTGGACAT GGGAGTAGAC ATCGAGTCTC GGGAGAAAAA AGACGGAAGT ACGGCGTTAA TCAAAGCAGC CGCCAAAAAC AATCTGGAAG TTGCGGAAAT TCTGCTGAAA AAGGGCGCAA ATGTTGATGG GCGGGACAGG AGCGGGTGTA CGGCGTTTTA TAGGGCGACG GAAAACGGAT ACGTGGAAAT GGCGAAACTG CTGCATTCGC ACGGGGCTGA CATTAACGGG TCGGTGGAAA ACGGTTACAC GCCGTTGATT GCCGCCGCTT TGGCAAATAA CATCGAAATG GTAAAATTCC TGCTGGACCG AAAAGCCGGG ATTGACATGC AGGCTAGGAA CAATTCAACC GCTCTATCAG TGGCGGCTTA TGAGGGCAAC AGAGAAGCAA TAAAGCTCCT CGTTAAATAT GGCGCGGACT GCAATGTCAG GGGGGAATTC GGTCGCCTCC CATTTCACTC AGCCGCCGAT AGGGGGGATC TGGATATCTT GAAGCTTCTT TTAACATGCA CCAGGGATGT GAATGCCAGG GACGCTTCAG GAAATACGGT ACTTATGTCT GCATGTGGCA GTGGCGATGC GAATGTTGTC GCTTACCTGC TGACCAGGAA ACTGGAGGTA AATGTAACGG ACAATTACGG TACCACCCCG CTGATGCGCG CCAGCAGCAG CGGTTACACC GATATCGCCG ATATTTTAAT AAAATCCGGG GCCGATATTA ATGCCAGAAA CTATAAAGGC AATTCCGCGT TGTCAGAGGC AGCGGACCGA GGGCAGCTCG ATATGGTTAG ATTTTTAATC AACAAGGGAG CTGATGTAAA TTTCGCGAAT AACGATGGTG ACTATCCGAT AGGACTAGCG GCCCGGACCA ACCGCCTCAT GGTCGTAGAA GTTCTTCTTG ATACAGCAAG CCCGGATGCC GTCAACAGAG CCTTAAGATC AACGATAAAA GGTGGTTATC TTGAAATCGC CAAACGTCTG TTGAAAAAAA ACGCGGACCC GAACTTTCTT TATAATTCGG ACATGTCACC ACTTATTATG GCAGTCAATT ATGTCCACAT GGGGATGGTG GAGCTTTTGC TGTCACACGG CGCGGATCTG GACTATCGGG ACAAGAACGG CAGAACCGCT CTCATGTGGG CGTCACAACG AGGCTTGACC AGCATCGCGC AATGCCTCCT GAAAAACGGC GCTGATGTCA ACGTCAAAGA CAAAAACCAG GAAACTGCAT TAAAGTACAC GGCCCAAATG GGGAATATAC CGCTTATGGA TATGCTTCTG GCAAACGGCG CTGCCCCGAG CAACTATGGC ACGCCCGAGA TCGTTTCCGC AGCCGTCAAT GAAGATATCA ATATGGCGGA GCTTTTGCTG AAGCATGGCG CAGATATTAA CGCCCAAGAC AGGTCGGGGG ATACGGCGCT GATGAAGGCG GCAGAGAAAG GGTCCCCGGA AATGACAAAT TTTCTTTTGC GAAACCATGC GAAAACAGAC ACAGTCAACC GAAGCGGGGC GTCCGCTTTT TTACTTGCAT GCCGGAACGG CAATCAGGCA ATTATTGAAA TGCTGCTGGA AAAAGGTGCT GACATTGATG CTGTCGACAA AAGCGGCAAC ACAGCGCTGT TGAGCGCTGT CATGTCAAGA AACTGGGAAC TTGTGAAATT CCTTATATCA AAGGGAGCGG ATGTTAATAC AACGAACAGC CGGGGCTATT CAGTCCTGGC TGTTGCAGAG GAAGTAAAAG CGCCCGCAGA TGTTATAAAA CTGCTGAAAA AGAAAAACGC CAGATCCACC AGGACCAGAA CCGGCTCTGG CACTGTGCTG CAATGA
|
Protein sequence | MGRQIIIFFL LLLISTLFFG SGFAVAEDIN TAFVSACREG DYETVVRLLD KGADVNFGNR DYNSPLIGAV QSGRMDIVDL LLEKGADINQ ANRNGYTPLM TASSKCRLDM IKYFIDRGAD INARTRSKNT TIMSAVHAGC AEAVKLLILN GADLNDRDDH GDTLLHIAAR SPRDAPGIIH LLLDRGADIE ARNNQKKTPL IYAAGKPKSL KVLLEQGADI HAVDIHGDTV ITTGSMKDNP EAIQVLLQAG CDVNIRNKET GKTPLMEACV NGHINTAECL IKNRADVNAG YVLRTSGFQN MPRVYSSPDV AFISVAGTSY TDAENRETII YKENGMTPLM EVSQRGFCDI AALLIKNRAR INTASESGQT ALMMACANGH DDVVELLIAQ KADINARARN NTTALQLAAQ SNYPRIAMRL LENGAKIDSQ QADDSATLLV TSAENGNATI VKMLLDMGVD IESREKKDGS TALIKAAAKN NLEVAEILLK KGANVDGRDR SGCTAFYRAT ENGYVEMAKL LHSHGADING SVENGYTPLI AAALANNIEM VKFLLDRKAG IDMQARNNST ALSVAAYEGN REAIKLLVKY GADCNVRGEF GRLPFHSAAD RGDLDILKLL LTCTRDVNAR DASGNTVLMS ACGSGDANVV AYLLTRKLEV NVTDNYGTTP LMRASSSGYT DIADILIKSG ADINARNYKG NSALSEAADR GQLDMVRFLI NKGADVNFAN NDGDYPIGLA ARTNRLMVVE VLLDTASPDA VNRALRSTIK GGYLEIAKRL LKKNADPNFL YNSDMSPLIM AVNYVHMGMV ELLLSHGADL DYRDKNGRTA LMWASQRGLT SIAQCLLKNG ADVNVKDKNQ ETALKYTAQM GNIPLMDMLL ANGAAPSNYG TPEIVSAAVN EDINMAELLL KHGADINAQD RSGDTALMKA AEKGSPEMTN FLLRNHAKTD TVNRSGASAF LLACRNGNQA IIEMLLEKGA DIDAVDKSGN TALLSAVMSR NWELVKFLIS KGADVNTTNS RGYSVLAVAE EVKAPADVIK LLKKKNARST RTRTGSGTVL Q
|
| |