Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1791 |
Symbol | |
ID | 5694631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2167666 |
End bp | 2169675 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641264389 |
Product | hypothetical protein |
Protein accession | YP_001529672 |
Protein GI | 158521802 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0011076 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACAA AAAAAACAGC AAAATTGTTG TCGGTATTGG CCCTGCTGGG GCTGATGGTG TTTTGGCAAT GTGCCATTCC ACAGCAGGCC GTCGCCGAAG AGATTGGAGA AATTGATTTT GGTGACGCAC CGGATCCGAC ATACCCTACA CGGATGGCCA GTAATGGAGC CGGTCACATC ATTGTGCCGT CGTACTTTAT GGGTGCCGGC ATAGACGGTG AACCGGACGG GCAACCGGAT CCCAATGCCA TGGGAGATGA TAATGACGGG AATGACGATG AAGATGGTGT GATTTTCAAC GCCGCTCTCA CGCCCTGCGC CCAGATTCCG GTAACCATCA TCGCATCGGC GCCGGGTTTC ATCGACGCAT GGATCGACTT CAATATAGAC GGCGACTGGG CAGACGCCGG GGAACAGATA TTTACGAGCC ATCCGGTGGT GGCCGGTGTA AATCCCTTAA ACTTCTCCGT CCCCTGCAAT TCGACTTTAG GGCAAACCTT TGCCCGGTTC CGGTTCAGCT CCACCGGCGG ATTGTCATAC ACAGGCCTTG CCATGGATGG TGAGGTGGAG GACTACGCGG TATGGCTTGA AGAGTCACTG GAAGAGTATC TTGATTGCGG TGATGCCCCC GACCCTTCCT ATCCGACACT GCTTGCCAAT AACGGCGCGT GTCATATCAT TGTGCCGCAA TACTATCTTG GCAACTCCAT AGACAACGAA CCTGATGGAC AGCCCGATCC CAACGCCCTG GGAGATGACA ACAACAATCT TGACGATGAA GACGGAGTGG CCATTCCCAG CGTTCTGACA CTTGGAATGC AAGCCCCCCT GATTGTCACG GCATCGGCAC CCGGTTTTAT TGATGCATGG ATTGATTTCA ACGCCGACGG CGACTGGGCA GATGCCGGAG AGCAGATATT CGTGAGCCAG CCTGTAGTGG CCGGGGCCAA CATCCTGAAC GTTACTCCCC CCACGAATGC AGTCCCAGGC AAAACCTTTG CCCGGTTCCG GTTCAGCTCC ACCGGCGGAT TGTCATACAC AGGTCTTGCC ACGGATGGCG AGGTGGAGGA CTATGCGGTA CGCATTGAAA GAGGGATCGG TCTCAACCTG ATAGAATGGA ACGGCAACCT GGTGGCCGAT TTTGGGAAAA ATGGCCTGTG GTACCACAAC GGTACAAGTT GGAACTGGAT GACCAACAAG GGTTATGTGG GGCAGATGGC AGTCTGGGGC GGTAACCTGG TGGTGGATTT CGGCGCCGGT CACGGCTTGC AGTACTATAA TGGCACTTCC TGGACCTGGA TGAGCAACAA AGGCGGCGTG AATGCCATGA CCACCTGGCA CGACGGCTCA ACAGAAAGGC TGGTGGTGGA CTTTGGCGGA GGACGGCGGG TCTACACCTA CAATGGTGCA TGGAGCTGGC TGTCCAACAA GGACGACGTC AACGCCATGA ATGTCTGGAA CAACAAGCTG GTGGTCGATT TTGGGGCCGG CCGGGGTGTG TATAATTACG ACACCTCCTG GCACTGGATG TCCAACAAGG ACGACATTGC GCTGTGGACC CTGTGGAACA ACGGCTCCAC CGAGCCCCTG GTGGTGGACT TTGGCGGAGG CCGACGGGTG TACACCTATA ACGGCGCATG GAGCTGGCTC ACCAACAAGG ACGATGTCAA TGACATGGCC GTGTGGAACA ACAGGCTGGT GATTGATTTT GGCGCCGGCC GCGGCCTCTA TAACAACAAC GGTACCTGGA ACTGGATGTC GAACAAGGAT GACACGGCGC ACATGGTGCC CTGGAACAAC GGCACCGGTG ATCAGTTGGC CGTGGATTTT GGCAATGGCC GGAATATGTA CAACTATAAC GGTGCCTGGA ACTGGATCAA AAACGCCAAC AACGTGCCGG AAATGGTGGC CTGGAACAAT TGCCTGGCCG CGGACTTCGG CTCCGGCGTG GGCATCTACA ATTACAACGG TGCCTGGAAC TTCATGAAAT CCTGGAGTAC GGCGGACTGA
|
Protein sequence | MRTKKTAKLL SVLALLGLMV FWQCAIPQQA VAEEIGEIDF GDAPDPTYPT RMASNGAGHI IVPSYFMGAG IDGEPDGQPD PNAMGDDNDG NDDEDGVIFN AALTPCAQIP VTIIASAPGF IDAWIDFNID GDWADAGEQI FTSHPVVAGV NPLNFSVPCN STLGQTFARF RFSSTGGLSY TGLAMDGEVE DYAVWLEESL EEYLDCGDAP DPSYPTLLAN NGACHIIVPQ YYLGNSIDNE PDGQPDPNAL GDDNNNLDDE DGVAIPSVLT LGMQAPLIVT ASAPGFIDAW IDFNADGDWA DAGEQIFVSQ PVVAGANILN VTPPTNAVPG KTFARFRFSS TGGLSYTGLA TDGEVEDYAV RIERGIGLNL IEWNGNLVAD FGKNGLWYHN GTSWNWMTNK GYVGQMAVWG GNLVVDFGAG HGLQYYNGTS WTWMSNKGGV NAMTTWHDGS TERLVVDFGG GRRVYTYNGA WSWLSNKDDV NAMNVWNNKL VVDFGAGRGV YNYDTSWHWM SNKDDIALWT LWNNGSTEPL VVDFGGGRRV YTYNGAWSWL TNKDDVNDMA VWNNRLVIDF GAGRGLYNNN GTWNWMSNKD DTAHMVPWNN GTGDQLAVDF GNGRNMYNYN GAWNWIKNAN NVPEMVAWNN CLAADFGSGV GIYNYNGAWN FMKSWSTAD
|
| |