Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1733 |
Symbol | |
ID | 5694571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2081143 |
End bp | 2083383 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641264329 |
Product | hypothetical protein |
Protein accession | YP_001529614 |
Protein GI | 158521744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.658961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCA AAAACCAGTT TTACCTGATT TTGTCCCTGG CCATGGTGGT GATCTTCTGG CCGATAGCCG CTTTCCCGGC AGCCCCCTTT CCGGTGCCCG ATACCGGCCA GGCCACCTGC TATGACGCGG ATGGCAATGT GCTTGATCCC TGCCCCGCAG AGGGTGAAAA TTTTTACGGC CAGGATGCAC AATACACCAT CAACCCACCA TCCTATATCA AACTGAATGC AAGCGGGCAG GCACTGGACG ATGACGCCAC CGACTGGGTT ATGGTAAAAG ACAGTCTGAC CGGCCTGATC TGGGAAGTAA AAACCGATGA CGGAGATATT CATGACAAGG ACAATAGATA TAACCTTGCG GCCGCTCTTT CTTTTATCGC CCAGCTAAAT ACCGACCAGT TCGGCGGATA TACTGACTGG CGTCTGCCGA CAAGGTTGGA ACTGACTTAT ATTGTGGATT ACAGTTACTG CGATCCCGCC ATTGATATCA ATTATTTCCC GTTCACCACT TATGATTTCG ACTGGTCATC GACGCCAAGA CCCAATGACG AAACAACCCC TTCATGGCAG GTCGAATTTT TCTATGGCCA TGACTCTTTT GCTTCTCCTG CAAACACCGT AAGCGTACGT GCTGTTCGCG GAGAAGCGAT TGGGTCGGCC AGCCTGGTTG ACAACGAGGA CGGTACGATC ACGGATGAAA GTACCGGCCT GATGTGGAAC AAGTTTACTG CCGATTTGAA CGATGACACG GTTATTGATA CAAATGACAA AATGGACTGG CGGGAGGCGC TGGCCTGGTG CGAAAGCCTG GATCTGGCCG GCCATTCCGA CTGGCGTCTG CCGACCATCA AGGAATTGAA ATCGATAGTT GACTCCGGCA CTGACGCACC GGCCTTTGAT ACAACCTTCT TCCCGGATAC GGTGCCGGAC AATTTCTACT GGTCATCGAC AACCTATGCC TCTTCCAAGG ATGTCGCATG GGGCATTTTG TCTATGTCTG GCTGGAGCCG CTGGTTTGAC AAATCCAGTT CATATTATGT ACGCGCAGTC CGTGACGGGC AGGACGAAAA CATCCCGCCC ACGGCCGATG CCGGAGAAGA CCGTGCTGTC ATGGAAGGCA CCGTGGCGGC GCTGGACGGT TCCGGCTCCT TTGATCCGGA TGACGGTATT GAGACATACG CATGGGAGCA GACCGGTGGT TCGGCGGTGA CCCTGTCCGA TGAAACCGCC GCCCAGCCGA CCTTTACGGT TCCCGCCGGC ATTCTCGGGC AGACCCTGAC CTTTGCGCTG ACGGTCACCG ATTATGCCGG GGCAAGCGAT ACGGACAGCG TCACCCTGAC CGTGACGGAG TTTGCCTGCA TCACGCTGCC GGACAAGCCG TTACTGATCT CTCCGTCCGA CGGCGCCGTC AATGTCAGCC TGACCCCGAC GTTGCAGGCC GACGCATACA ACGATCCGGG CGCGTGCAGC ACCCATTTTA AAACCCGCTG GCAGATCAGC GACCAGGCCG ATTTCTCCGG CCTGACCTAT AACGCCAACA CCTTTGGTGC TGCCCTGACC TCCCACCAGG TAACCAAAAT GGTGCTGGAA CCCGCTACCA CCTACTACTG GCGGGTCAGG TACTGGGGCG ACCACGGCCT GAAATCGGAG TGGTCCGACG TGTGGACCTT TACCACCGCA TCGGACAACC GGGACGCCAA CGGCAATGGT GTGCCGGATG ACCAGGAGTC TTCCCCGAAT ACCGATATCA ACGACAACGG CACACCCGAT TCGGAAGAGA CCAGCCTCAG GGCGTTATTG ACGGCGGACG GGACCGCCGA TATCGGCCTG GAGTTTGGCA AGGATATTCT GGCCATTGCC GTTGAAGCCA TGAGTGAAAC CGAAATCAGC GGAGACCTGT CCGGCGACTT TCCTTACGGC CTGATCGGCT TCAGGGTGAA CACCGCCACA CTCGGCGCAA CCGTCAACTT TACAGTTTAT ATGGCGGACA GGGCACCGGA CAGCGCCTTC TGGTATAAGT ATGACAATGT CAATGGCTGG TATGACTTTA CCGGGCAGGT GACCTTCAGC GCCAACCGGA AATCCCTGTC ATTTACCCTG ACGGACGGCG GTACCGGAGA CGCGGACGGG GTGGTTAACG GTGTCATTGT TGATCCCGCC GGCCTGGTTG TGGAAACGGT CGTGCCCTCC GGCGGGGGCT CCGGATCGGG CTGTTTTGTT GAAACAGTAA CCGGGTTTTA A
|
Protein sequence | MKTKNQFYLI LSLAMVVIFW PIAAFPAAPF PVPDTGQATC YDADGNVLDP CPAEGENFYG QDAQYTINPP SYIKLNASGQ ALDDDATDWV MVKDSLTGLI WEVKTDDGDI HDKDNRYNLA AALSFIAQLN TDQFGGYTDW RLPTRLELTY IVDYSYCDPA IDINYFPFTT YDFDWSSTPR PNDETTPSWQ VEFFYGHDSF ASPANTVSVR AVRGEAIGSA SLVDNEDGTI TDESTGLMWN KFTADLNDDT VIDTNDKMDW REALAWCESL DLAGHSDWRL PTIKELKSIV DSGTDAPAFD TTFFPDTVPD NFYWSSTTYA SSKDVAWGIL SMSGWSRWFD KSSSYYVRAV RDGQDENIPP TADAGEDRAV MEGTVAALDG SGSFDPDDGI ETYAWEQTGG SAVTLSDETA AQPTFTVPAG ILGQTLTFAL TVTDYAGASD TDSVTLTVTE FACITLPDKP LLISPSDGAV NVSLTPTLQA DAYNDPGACS THFKTRWQIS DQADFSGLTY NANTFGAALT SHQVTKMVLE PATTYYWRVR YWGDHGLKSE WSDVWTFTTA SDNRDANGNG VPDDQESSPN TDINDNGTPD SEETSLRALL TADGTADIGL EFGKDILAIA VEAMSETEIS GDLSGDFPYG LIGFRVNTAT LGATVNFTVY MADRAPDSAF WYKYDNVNGW YDFTGQVTFS ANRKSLSFTL TDGGTGDADG VVNGVIVDPA GLVVETVVPS GGGSGSGCFV ETVTGF
|
| |