Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2048 |
Symbol | |
ID | 5694891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 2490537 |
End bp | 2493455 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641264649 |
Product | YD repeat-containing protein |
Protein accession | YP_001529929 |
Protein GI | 158522059 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATGG ATGCGGCATA TAACACCATT GCCGGATTTG TGCGGGGGAT GCCCTTTCTA TCTTCTGCGG ACGGCCGCCT GATTGCCACA TCCGCTCCGG GCCTGGCCGA TACGCTTTAT GAATATGACG AATTGGGCAA CATGGTCCGC TCCGGTCTGG ATGTCAATGC CAACGGCGTT CTGGATGACG GGTCGTCAGA CCGTATTCAG GAAAGCACCA CCCAGTACAC CAGTGACGGC GCCGACTGGT GGCAGGAAAC CACCGGCAAA ATTTTTGCCG TCACCGGCAG CAGTACCCCC ACCACCACGG GCATTCAAAA AACCCGGCTC ACCGGCCTGG GCACAAACGG CCTTGTCAAT GAGACCGTTT CTACTGACAT TAACGGCAAC CAGACGACAG CCCGGACCGT TATCGACCCT GCCACCAAAA CTGTAACGCA GACCGTGGAC TACCCGGATG CATCCTTTAA CGAAATTACT GTCACGACAA ACGGGTTGTT GCAATCCCGG CAGAGCAAAA CCGGGGTGAC CACCACCTTC TCGTATGACG GCCTGGGCCG ACGTACAGGG GCCACCGATT CCCGCACCGG CACCACTGTT ACCCATTACA ACAGCCTGGG CCAGGTGGAT TATGTGGAAG ATGCCGCAGG CAGTCGGACT ACTTTTACCT ATGATGCAGC CGGCCGCCGG ACAACAGAGA CCAATGCTCT TGGCAAAACC ATCCGCTACG CCTACAACAG CCTCGGGCAA CTGGCGTATA AATGGGGCAG CGCGGTCGAG CCGGTAAAGT TTGTGTATGA CGCTTACGGC CAGATGACCC AGATGCACAT GTACCGCAAC GGCTCTGGCT GGGACGGGGC AGAGTGGCCA TCAGGTTCCA CCGGAGATCC AGATATCACC ACCTGGACCT ATCAGCCGTC CACCGGCCTG CTCACCGCCA AGACCGATGA TGAGGCCAAA TCCACGGCCT ACACCTATAC TGCTGCCAAC CGGCTGGCTA CCCGCACCTG GGCACGGGAT AACGGCACCC TTGTCACCAC CTACACCTAT GATCTGAATA CCGGGGAACT CACGGGTATT GATTATTCTG ACACCACCCC GGATGTGGCC TATGCCTACA CCCGGGCCGG GCAGGTGTAT ACCGTGGACG ATGTTGTGGG CACGCGCACC TTTGCCTATA ATAGTGCCCT GCAACCTGTC ACAGAAACCA TCGACGGCGC TTCCGGCGGG CTGTACAGCA AAACCATCAC CCGCACATAC GAGACCTCCG GCGTGGTGGG CCGGCCCACC GGGTTGAGCC TTACAGGCTA CAGCGTGGCC TATGGGTATG AGCCTTCCAC CGGCCGGTTT GCAGGCGTCA CCTGGAACAC CGGGGCCGGA GAAAAGACGG CCACCTATGC GTATGTGGCC GACTCTGATT TTGTCGACAC CCTGACCATG GAGAATCTGG TCACGGATTT TGTCTATGAG CCGAACCGGG ACCTCAAAAC CCAGGTCAGG CACACGTTTG GTGGGGCCGA TATTGCCCAA TATGATTATA CCTACACCGC CCTTGGCCAG CGCAAGACCA TGGACCTGAC CCCGGACCTG CCGGATACCC TGGTCGAGGC TACCACCACC TATACGCCCG ATAATTTAAA CCAGTACGAT GCCATTGAAA CTGGCGGCGT AACAGATAAC CCGGTCTATG ATGATGACGG CAACCTTACC CACCAGCAGG GCATGGTTTA TGCCTGGAAC GGGGAAAATC GCCTCATCTC CGTGGAACCC GAGACCCCGA CCGAAGGAAA TACCAAACTG GCCTTTGTGT ACGACTACAT GGGCCGAAGG GTGAAGAAGT CTGTCTATAC ATTCAGCGCC GGCAGCTACC AGCTGTCAGC TGCCAGCTTA TTTGTCTATG ACGGGTGGAA CCTGATCCAG GAGCTGGATG GAACCGGTGC GGTGCAAAAG TCCTATGTGT GGGGCCTTGA CCTCTCCCAG AGCCTGCAAG GAGCCGGTGG CGTGGGCGGC CTGCTGGCCA TGACCGACGG GGTGAATACT TACCTCTACT GCCATGATGC CAACGGCAAC GTGGGCCGCA TGGTGAGCGC GGCAGATGGA ACGGTTGCGG CAGCTTATGA ATATGCCCCG TTTGGCGGGC TGATTCATAA GAGCGGGGCC ATGGCGGATG AAAATGTGTT TCGGTTCTCG ACGAAGTATT TTGATGGGGA GAGTGGGCTG TATTATTACG GATACCGGTA TTATGAGCCG GAGATGGGGA GGTGGATGTC GAGGGATCCG TTGGGGGAAG AGGGCGGATA TAATTTGTAT GGGTTTGTGG GGAATGATGC TGTTAATGAT TATGATCCCT ATGGGCTTCG TAGCATAAAT TCATACAAAG AAGAGTTTAT GAGTATGTTA CTACTAGACA TGAAACGTAA GTATTATAAT ACATATATCG GTCCTGGTTT TTCCGATTCC TTAAAAAGTA GGGCGATCCT CTTTTGTGCT GCTGGCATTG ATTTTGATGA TACGATATGG GGATTTGCAA AAATGGGAGT TAAAGCCGTA GTTAATGCTA TTAGCTCTAC TACTAAACCA TTTGTAGAAT ACACCCTTAA AAAAGCAAGA GGTATATTGC AAAAACAGAC CGCGAATAAA GCGAGGGGTT TTATCTATAA TACTGTAATA ATAAAGAAAT ATACGAAAAG TGAATCTGAT TGCAAATGCA ATATGACTGT AAAATACTAT TCAGATTGGG ATGTATTTAC TGTGAATATA AAAGGGAAGG TTGGAAAAAC TGTTTACGAG TATGGGGAAA CTGAATGTGA TTGTTCAGAA GAATTTGACT ATAATTTTGA TGGGATAACT ACATGGGAAG AGGGATTTTT TAATACAGAA TATATTAACA ATGTTAATAT ATCAATAGGA GCGAAATAA
|
Protein sequence | MRMDAAYNTI AGFVRGMPFL SSADGRLIAT SAPGLADTLY EYDELGNMVR SGLDVNANGV LDDGSSDRIQ ESTTQYTSDG ADWWQETTGK IFAVTGSSTP TTTGIQKTRL TGLGTNGLVN ETVSTDINGN QTTARTVIDP ATKTVTQTVD YPDASFNEIT VTTNGLLQSR QSKTGVTTTF SYDGLGRRTG ATDSRTGTTV THYNSLGQVD YVEDAAGSRT TFTYDAAGRR TTETNALGKT IRYAYNSLGQ LAYKWGSAVE PVKFVYDAYG QMTQMHMYRN GSGWDGAEWP SGSTGDPDIT TWTYQPSTGL LTAKTDDEAK STAYTYTAAN RLATRTWARD NGTLVTTYTY DLNTGELTGI DYSDTTPDVA YAYTRAGQVY TVDDVVGTRT FAYNSALQPV TETIDGASGG LYSKTITRTY ETSGVVGRPT GLSLTGYSVA YGYEPSTGRF AGVTWNTGAG EKTATYAYVA DSDFVDTLTM ENLVTDFVYE PNRDLKTQVR HTFGGADIAQ YDYTYTALGQ RKTMDLTPDL PDTLVEATTT YTPDNLNQYD AIETGGVTDN PVYDDDGNLT HQQGMVYAWN GENRLISVEP ETPTEGNTKL AFVYDYMGRR VKKSVYTFSA GSYQLSAASL FVYDGWNLIQ ELDGTGAVQK SYVWGLDLSQ SLQGAGGVGG LLAMTDGVNT YLYCHDANGN VGRMVSAADG TVAAAYEYAP FGGLIHKSGA MADENVFRFS TKYFDGESGL YYYGYRYYEP EMGRWMSRDP LGEEGGYNLY GFVGNDAVND YDPYGLRSIN SYKEEFMSML LLDMKRKYYN TYIGPGFSDS LKSRAILFCA AGIDFDDTIW GFAKMGVKAV VNAISSTTKP FVEYTLKKAR GILQKQTANK ARGFIYNTVI IKKYTKSESD CKCNMTVKYY SDWDVFTVNI KGKVGKTVYE YGETECDCSE EFDYNFDGIT TWEEGFFNTE YINNVNISIG AK
|
| |