Gene Dole_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2048 
Symbol 
ID5694891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2490537 
End bp2493455 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content52% 
IMG OID641264649 
ProductYD repeat-containing protein 
Protein accessionYP_001529929 
Protein GI158522059 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATGG ATGCGGCATA TAACACCATT GCCGGATTTG TGCGGGGGAT GCCCTTTCTA 
TCTTCTGCGG ACGGCCGCCT GATTGCCACA TCCGCTCCGG GCCTGGCCGA TACGCTTTAT
GAATATGACG AATTGGGCAA CATGGTCCGC TCCGGTCTGG ATGTCAATGC CAACGGCGTT
CTGGATGACG GGTCGTCAGA CCGTATTCAG GAAAGCACCA CCCAGTACAC CAGTGACGGC
GCCGACTGGT GGCAGGAAAC CACCGGCAAA ATTTTTGCCG TCACCGGCAG CAGTACCCCC
ACCACCACGG GCATTCAAAA AACCCGGCTC ACCGGCCTGG GCACAAACGG CCTTGTCAAT
GAGACCGTTT CTACTGACAT TAACGGCAAC CAGACGACAG CCCGGACCGT TATCGACCCT
GCCACCAAAA CTGTAACGCA GACCGTGGAC TACCCGGATG CATCCTTTAA CGAAATTACT
GTCACGACAA ACGGGTTGTT GCAATCCCGG CAGAGCAAAA CCGGGGTGAC CACCACCTTC
TCGTATGACG GCCTGGGCCG ACGTACAGGG GCCACCGATT CCCGCACCGG CACCACTGTT
ACCCATTACA ACAGCCTGGG CCAGGTGGAT TATGTGGAAG ATGCCGCAGG CAGTCGGACT
ACTTTTACCT ATGATGCAGC CGGCCGCCGG ACAACAGAGA CCAATGCTCT TGGCAAAACC
ATCCGCTACG CCTACAACAG CCTCGGGCAA CTGGCGTATA AATGGGGCAG CGCGGTCGAG
CCGGTAAAGT TTGTGTATGA CGCTTACGGC CAGATGACCC AGATGCACAT GTACCGCAAC
GGCTCTGGCT GGGACGGGGC AGAGTGGCCA TCAGGTTCCA CCGGAGATCC AGATATCACC
ACCTGGACCT ATCAGCCGTC CACCGGCCTG CTCACCGCCA AGACCGATGA TGAGGCCAAA
TCCACGGCCT ACACCTATAC TGCTGCCAAC CGGCTGGCTA CCCGCACCTG GGCACGGGAT
AACGGCACCC TTGTCACCAC CTACACCTAT GATCTGAATA CCGGGGAACT CACGGGTATT
GATTATTCTG ACACCACCCC GGATGTGGCC TATGCCTACA CCCGGGCCGG GCAGGTGTAT
ACCGTGGACG ATGTTGTGGG CACGCGCACC TTTGCCTATA ATAGTGCCCT GCAACCTGTC
ACAGAAACCA TCGACGGCGC TTCCGGCGGG CTGTACAGCA AAACCATCAC CCGCACATAC
GAGACCTCCG GCGTGGTGGG CCGGCCCACC GGGTTGAGCC TTACAGGCTA CAGCGTGGCC
TATGGGTATG AGCCTTCCAC CGGCCGGTTT GCAGGCGTCA CCTGGAACAC CGGGGCCGGA
GAAAAGACGG CCACCTATGC GTATGTGGCC GACTCTGATT TTGTCGACAC CCTGACCATG
GAGAATCTGG TCACGGATTT TGTCTATGAG CCGAACCGGG ACCTCAAAAC CCAGGTCAGG
CACACGTTTG GTGGGGCCGA TATTGCCCAA TATGATTATA CCTACACCGC CCTTGGCCAG
CGCAAGACCA TGGACCTGAC CCCGGACCTG CCGGATACCC TGGTCGAGGC TACCACCACC
TATACGCCCG ATAATTTAAA CCAGTACGAT GCCATTGAAA CTGGCGGCGT AACAGATAAC
CCGGTCTATG ATGATGACGG CAACCTTACC CACCAGCAGG GCATGGTTTA TGCCTGGAAC
GGGGAAAATC GCCTCATCTC CGTGGAACCC GAGACCCCGA CCGAAGGAAA TACCAAACTG
GCCTTTGTGT ACGACTACAT GGGCCGAAGG GTGAAGAAGT CTGTCTATAC ATTCAGCGCC
GGCAGCTACC AGCTGTCAGC TGCCAGCTTA TTTGTCTATG ACGGGTGGAA CCTGATCCAG
GAGCTGGATG GAACCGGTGC GGTGCAAAAG TCCTATGTGT GGGGCCTTGA CCTCTCCCAG
AGCCTGCAAG GAGCCGGTGG CGTGGGCGGC CTGCTGGCCA TGACCGACGG GGTGAATACT
TACCTCTACT GCCATGATGC CAACGGCAAC GTGGGCCGCA TGGTGAGCGC GGCAGATGGA
ACGGTTGCGG CAGCTTATGA ATATGCCCCG TTTGGCGGGC TGATTCATAA GAGCGGGGCC
ATGGCGGATG AAAATGTGTT TCGGTTCTCG ACGAAGTATT TTGATGGGGA GAGTGGGCTG
TATTATTACG GATACCGGTA TTATGAGCCG GAGATGGGGA GGTGGATGTC GAGGGATCCG
TTGGGGGAAG AGGGCGGATA TAATTTGTAT GGGTTTGTGG GGAATGATGC TGTTAATGAT
TATGATCCCT ATGGGCTTCG TAGCATAAAT TCATACAAAG AAGAGTTTAT GAGTATGTTA
CTACTAGACA TGAAACGTAA GTATTATAAT ACATATATCG GTCCTGGTTT TTCCGATTCC
TTAAAAAGTA GGGCGATCCT CTTTTGTGCT GCTGGCATTG ATTTTGATGA TACGATATGG
GGATTTGCAA AAATGGGAGT TAAAGCCGTA GTTAATGCTA TTAGCTCTAC TACTAAACCA
TTTGTAGAAT ACACCCTTAA AAAAGCAAGA GGTATATTGC AAAAACAGAC CGCGAATAAA
GCGAGGGGTT TTATCTATAA TACTGTAATA ATAAAGAAAT ATACGAAAAG TGAATCTGAT
TGCAAATGCA ATATGACTGT AAAATACTAT TCAGATTGGG ATGTATTTAC TGTGAATATA
AAAGGGAAGG TTGGAAAAAC TGTTTACGAG TATGGGGAAA CTGAATGTGA TTGTTCAGAA
GAATTTGACT ATAATTTTGA TGGGATAACT ACATGGGAAG AGGGATTTTT TAATACAGAA
TATATTAACA ATGTTAATAT ATCAATAGGA GCGAAATAA
 
Protein sequence
MRMDAAYNTI AGFVRGMPFL SSADGRLIAT SAPGLADTLY EYDELGNMVR SGLDVNANGV 
LDDGSSDRIQ ESTTQYTSDG ADWWQETTGK IFAVTGSSTP TTTGIQKTRL TGLGTNGLVN
ETVSTDINGN QTTARTVIDP ATKTVTQTVD YPDASFNEIT VTTNGLLQSR QSKTGVTTTF
SYDGLGRRTG ATDSRTGTTV THYNSLGQVD YVEDAAGSRT TFTYDAAGRR TTETNALGKT
IRYAYNSLGQ LAYKWGSAVE PVKFVYDAYG QMTQMHMYRN GSGWDGAEWP SGSTGDPDIT
TWTYQPSTGL LTAKTDDEAK STAYTYTAAN RLATRTWARD NGTLVTTYTY DLNTGELTGI
DYSDTTPDVA YAYTRAGQVY TVDDVVGTRT FAYNSALQPV TETIDGASGG LYSKTITRTY
ETSGVVGRPT GLSLTGYSVA YGYEPSTGRF AGVTWNTGAG EKTATYAYVA DSDFVDTLTM
ENLVTDFVYE PNRDLKTQVR HTFGGADIAQ YDYTYTALGQ RKTMDLTPDL PDTLVEATTT
YTPDNLNQYD AIETGGVTDN PVYDDDGNLT HQQGMVYAWN GENRLISVEP ETPTEGNTKL
AFVYDYMGRR VKKSVYTFSA GSYQLSAASL FVYDGWNLIQ ELDGTGAVQK SYVWGLDLSQ
SLQGAGGVGG LLAMTDGVNT YLYCHDANGN VGRMVSAADG TVAAAYEYAP FGGLIHKSGA
MADENVFRFS TKYFDGESGL YYYGYRYYEP EMGRWMSRDP LGEEGGYNLY GFVGNDAVND
YDPYGLRSIN SYKEEFMSML LLDMKRKYYN TYIGPGFSDS LKSRAILFCA AGIDFDDTIW
GFAKMGVKAV VNAISSTTKP FVEYTLKKAR GILQKQTANK ARGFIYNTVI IKKYTKSESD
CKCNMTVKYY SDWDVFTVNI KGKVGKTVYE YGETECDCSE EFDYNFDGIT TWEEGFFNTE
YINNVNISIG AK