Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1508 |
Symbol | |
ID | 5694345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 1798304 |
End bp | 1801153 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641264103 |
Product | hypothetical protein |
Protein accession | YP_001529389 |
Protein GI | 158521519 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.144952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTACG TCTTTCTGGG TGTAATCGGT GCGCTGGCAG GCGCCTTCTG GCTGCGGGGC GACGGGGCTT TTCTGGCAGG CGGGCTGACA GGGGTGCTGG CGGCCGCTGT CATATCCCTG AAAAACCAGG TGAGTACGCT CGTGCGGCGA ATGTCCGACC TTGAAGCCCG ACAGACGGCC GACCGGACGC AGGCGGCCTC GGCCCCTCAG CAGCCGGAAC CGGAGAAGGC GGCGGCGGAA ACGTTTTCCG TGTCGCCGGA AATAAAAACC CCTTCCTTCC CTGTCCGACG ACCGGCCCCG GAAGAAGATC TCTACCTTGA ATCCGTTTCG GACACCCCGA CATCCGAAAG CGACGTGCAG GCGGTTTCAG ATTCCCCGGC ACCAGACGAA AAACCGCCAC TTAAAACACA ACAACCCCCT TCACCACCCC CTGCCGTCCA TGACCAGACC AGTTTTGAAG TGTTTATCCG GTCCCTGTTC TCCGGCGGTA ACCTCATGGT GCGCGTCGGT GTGGTGATCC TGCTGTTCGG GTTTGCCTTT CTGATCAAGT ATGCCGCGGC CCGGAACATG GTCCCTCTGG AGGTCCGACT GGCCGCCGCC TTTGCCGCCG GCATCGGGCT GCTGGCCCTT GGCTGGCGAC TGCGAAGCAA ACGGTTCGGC TATGCCATGG CCCTTCAGGG CGGGGGCCTG GGCATCATGT ACCTGACCTT GTTTGCGTCG GCCCGCCTCT TTCACATGGT TCCTCTGCCC CTCACCTTTG GCGTCATGGT GGCCCTGGTC GCTTTCTCCG GCATGCTGGC CATATTGCAG AACGCCGCCT CCATGGCCGT GCTGGGCGCC GCCGGCGGGT TTCTGGCGCC GGTGCTGCTT TCCACGGGAT CGGGAAATCA TGTCTTGCTC TTTTCCTACT ACGCCCTGCT CAATGCCGGA ATCTTCGGAA TCGCGTGGTT TAAGGCGTGG CGCTGGCTCA ACCTGCTGGG GTTCTTTTTC ACTTTCGGCA TCGGATCCGC CTGGGGGGTT CAATATTACC GGTCCTCCCA CTTCGCCACC ACCGAACCCT TTCTGGTGCT CTCCTTTGCG TTCTACCTGA CCATCTCCGT GCTTTTTGCC TTCAAACTGC CGCCAAAGCT GAAAGGGTAT GTGGACGGCA CTCTTGTGTT CGGCCTGCCC GTTGTGGTGT TCGGCCTTCA GGTGCCGCTG GTGGAGCGGT TTGAATACGG TCTTGCCTTC AGCGCCCTTG TCATGGGCAT AGTCTATATC TCCCTGGCAA CTTCGCTGTG GCGTCGCCGC ACAAAAGAGA TGGGCCCCCT GGTAGAGACA TTTCTGGCCC TGGGCGTGGT GTTTTCCAGC CTGGCCATTC CCCTGGCCCT GTCCGGACTG TGGACGGCTG TGGCATGGAG CCTGGAAGGG GCCGGCCTGG TCTGGGTGGG GGTCCGGCAG CGCCGCCTGA CCGCGCGGCT GTTCGGCCTG CTGCTTCAGT TCGGCGCCGG GTTCCTTTTT CTTGCGGACG GACGGTACGG CGGCGGCATG ATGATTCTCA ACAGCCGTTT TCTGGGCGGG ATGATGATCG CTGTGGCGGC CCTGGTCTCG GCATTTTTCA TGGAACGATA CCGGTCGGTG TTGCGTGTGC TGGAACAGTT TCCCTCGGCC CTCATCATGG CCTGGGGGCT TGTCTGGTGG TTCGGCGCCG GCGTAGTCGA GATCGACCGC CACTGGCCGG ACCGGTATCA GTTGGAATGC CTGGCGGCCT TTGTTGTGGC AAGCTGCGGG GCCATGGGCT GGCTGTGCCA CCGGCTGGAC TGGAAAGGGG TACGGTGGCC GGCAGCGGGC CTGCTTCCGT TTATGGTGTT TCTTTATATC GTAACAGCCC ACTACACCAA TGATTACAGC AGAATGCATC CCTTCCAGGA TTGGTGGCTG GTGATCTGGC CGCTGGCTCT GGCGGTACAT TTTTTATTGC TCTGGAAACT GGAAAACAAA TGGCCCAAAA AACTGCTGGT GCCCTGGCAT GTGACAGGCG GGCTGCTGAT CATTTTTCTG CTGAGCCGTG AAGCCGCTTG GGGAATCGAC CGGCTGACGC TCGGTTCCCC GACACACACC GGAAGGTTGA CCGCCACTGC ATACGAAACC CTGATGCGAC GACGGCTGGG GTTTGCCGGG ATTCAACAGT TTATCGCCTG GGGCATGGTA CCGGCCGCCG GCGCCTGGCT ATTAAAAGGA CTTTTCCGGA AAACCGGCAT TCGACCGGAG GTTGCCTACA ATGGGTGGCT TCCCTTCCTG ATCATGCTGG GGCTGATGGG ATGGACCTTC ATGGCCTCCT CTTTTAACGG TGGCTTTGAA TCGTTGCCTT ACCTGCCGCT GCTAAACCCC CTGGACGTGG TCCAGGCATT TGTGCTGGTC ACCATTCTCT ACTGGTGCCG GTCGCAACGG CAACATCCCA CCCCGCCTGC CGGAAAGCTG GACGCGGCCA TGCTGTGGGG GGCACCGGCA GCCGGCGTTT TTGTGTGGCT CACCGCCATT GTGGCCCGAA CGGTTCACCA CTGGGGCCAT GTTCCCTACC ACATGGAGGC CCTGGGCGAT TCCGCTGTTT TCCAGGCATC CCTGTCCGTT CTGTGGGGCG CCCTGGCCCT GGGCACCATG GTGACGGCCC ACCGTCTCAA GCAACGGGCC ATCTGGTTTA CCGGCGCGGG TCTCCTGACC GTGGTGCTGG TCAAGCTGTT TGTCGTTGAT CTGTCCGGCA CCGGCACGGT TTCCCGAATC GTCTCGTTTC TGGCAGTGGG AGCCCTGATG CTGATTATCG GATTCTTTAC CCCGCTGCCG CCGGCTGCTG ACAAAGGAGA GACTTCATGA
|
Protein sequence | MLYVFLGVIG ALAGAFWLRG DGAFLAGGLT GVLAAAVISL KNQVSTLVRR MSDLEARQTA DRTQAASAPQ QPEPEKAAAE TFSVSPEIKT PSFPVRRPAP EEDLYLESVS DTPTSESDVQ AVSDSPAPDE KPPLKTQQPP SPPPAVHDQT SFEVFIRSLF SGGNLMVRVG VVILLFGFAF LIKYAAARNM VPLEVRLAAA FAAGIGLLAL GWRLRSKRFG YAMALQGGGL GIMYLTLFAS ARLFHMVPLP LTFGVMVALV AFSGMLAILQ NAASMAVLGA AGGFLAPVLL STGSGNHVLL FSYYALLNAG IFGIAWFKAW RWLNLLGFFF TFGIGSAWGV QYYRSSHFAT TEPFLVLSFA FYLTISVLFA FKLPPKLKGY VDGTLVFGLP VVVFGLQVPL VERFEYGLAF SALVMGIVYI SLATSLWRRR TKEMGPLVET FLALGVVFSS LAIPLALSGL WTAVAWSLEG AGLVWVGVRQ RRLTARLFGL LLQFGAGFLF LADGRYGGGM MILNSRFLGG MMIAVAALVS AFFMERYRSV LRVLEQFPSA LIMAWGLVWW FGAGVVEIDR HWPDRYQLEC LAAFVVASCG AMGWLCHRLD WKGVRWPAAG LLPFMVFLYI VTAHYTNDYS RMHPFQDWWL VIWPLALAVH FLLLWKLENK WPKKLLVPWH VTGGLLIIFL LSREAAWGID RLTLGSPTHT GRLTATAYET LMRRRLGFAG IQQFIAWGMV PAAGAWLLKG LFRKTGIRPE VAYNGWLPFL IMLGLMGWTF MASSFNGGFE SLPYLPLLNP LDVVQAFVLV TILYWCRSQR QHPTPPAGKL DAAMLWGAPA AGVFVWLTAI VARTVHHWGH VPYHMEALGD SAVFQASLSV LWGALALGTM VTAHRLKQRA IWFTGAGLLT VVLVKLFVVD LSGTGTVSRI VSFLAVGALM LIIGFFTPLP PAADKGETS
|
| |