Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_0576 |
Symbol | |
ID | 3756237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | - |
Start bp | 601317 |
End bp | 603275 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637781437 |
Product | putative transmembrane signal peptide protein |
Protein accession | YP_387072 |
Protein GI | 78355623 |
COG category | [S] Function unknown |
COG ID | [COG4907] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000285101 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCACA CCCTGCAAAA GCGGACAGCG CTGCTTGCCG CCTGTCTGGC CGCCGTGCCG CTGCTCTGCG CTCTGCTGCT GTCTTCTCCC GTTCCCGCCT GCGCAGCCCG GCAGGCCCCT GCGGCAGAAC GCATTCTGCT GTTTGATTCC ACCGTGCACA TTCAGCCCGA CGGGCTGTTG TCCGAGCGTG AGACCATCAG GGTCAACGTT CTGGGCCAAA ACATCCGGCG CGGCATTTTC CGCGATATTC CGGTGCGCTA CAAGCTCGGC ATGGGGTTAC GCCGCCGTCA TGCGCTCATT GTGCAGAACG TGCTGCGCAA CGGAAAACCG GAACCCTATG CCGTGGAACA GCGCAATCAT GTACTGCGCA TACTCATCGG CTCGGGGCAT GCCATGCTGG ACCACGGGGT GCACACCTAT GAGATAACCT ACACGCTGGA CAAACAGATA GGTGTACCCG AAGACGGCAA AGAAGCGCTG CTGGCGTGGA ACGTGACAGG CAACTACTGG GACTTCACCA TCGACAAAGC CACGTGCACC GTTGTGGCTC CTCCCGGGGT ATCTGTTGTG TCCGCCACGG GGGCCACAGG CAGACCCGGT GAACACGGCA CGACGTTCAC CGCATCGTAC CACGGGGCAG AAGCCGCATT TGCCACCACC AAGCCGCTGC CGCCCCGGGA AGGCTTCACC ATATTCACGC GGCTGACACC GCCGGACGGC CTTGCGGGAC CGCAGGGCTT TGCTGCCGTA TGGGCTGACA ACAAACTTTT CTTTGCCGGA ATCGCGCTGC AGTTGCTGCC CCTTCTGGTC TTTGCCCTCC TTTGGCACAA ATTCGGACGC GACCCGCACG GCCCGGTGGT AATCCCCCGC TATGAGCCGC CCGCAGGTAT GGACGCGGCA TCTGCCGGCA GCCTGCTGCA AAACGCCCCC ATGCCGCGCA AACGCGCGCT GATGCTCAGC ATCGTGCATC TGGCAGCAGA AGGCTGTCTG ACCATTGAAA CGGAAAAAAT GACACATGTT CTGCATGCCA CCGGCAAGCC CTCGTCGGTA CCCGCCAACC GGCAGGTGCT GGAAGAGCTG TTCAGGACAG GCAACACAGT GCAGCTGAGC AGCTATAACC GGCCGGCGCT TTCTTCGGCG GCGGATGCGC TGCAGCGCTC CATGAAAGAA CGCTATGACA CCCCTGAAAT ACGCCGCAAC AACACCTCCA AGCTGCTGCT GTGTGTGCTG AGCAGCATTG CCGCACTGGC AGGCAGTGTA CCGCTTGTGT TCGCCGGTGA TGCCGGAGGA TTTATGGCGG GGGCCGCTTC CGGCATGTTG CTGCTCATGG CAGGACTTGT GCCCTACAAG ATTCTCAGCA GGCGGTTCAG CATTCCGGGC ATACTGTTCT CTTTGATATT TTTCAGTTTC GCGCTCATCT TTTTTGCCTC TTTCAACAGA CATGCAGAAG CCAACACCGG CATGGTCATC GTGCTGTGGA ACGTGTTTCT GCTCTGGCTT TTCCGCAGAC TGATGCCCGC CTACACCGAA AAAGGCATGG CTCTCGTCAA TGAGCTCAAA GGATTCAGAC TGTTTCTTTC GCTCACCGAA AAGCAGCGGC TGCAGATGCA GGACCAGCCT GAAATCACCC GCGAACGGTT TGAGGCCCTG TTGCCCTTTG CCATGGCTTT CGGCGTTGAA CGCCGCTGGG AGGACAGCTT TAAAGCTGCC ATGCAGCTGC GCGGCGAAGA CAGCACCAAC TACGGCATAC GCTGGTATGA CAGGGGTTTT TCCGGCACAG AGATGCGCCA TGCCGGACTG GGTGCCGCGC TGGGCAGCAG CATCGGGGCA TCAGTCTCCG CCAGCATGCC GGAATCCACC GTCTCCGGTT CTTCCGGCGG CTTCAGCAGC GGCGGAAGCG GCAGCAGCGG CGGCGGCAGC GGTTCCGGAG GAGGCGGCGG CGGGGGCGGC GGCTGGTAA
|
Protein sequence | MSHTLQKRTA LLAACLAAVP LLCALLLSSP VPACAARQAP AAERILLFDS TVHIQPDGLL SERETIRVNV LGQNIRRGIF RDIPVRYKLG MGLRRRHALI VQNVLRNGKP EPYAVEQRNH VLRILIGSGH AMLDHGVHTY EITYTLDKQI GVPEDGKEAL LAWNVTGNYW DFTIDKATCT VVAPPGVSVV SATGATGRPG EHGTTFTASY HGAEAAFATT KPLPPREGFT IFTRLTPPDG LAGPQGFAAV WADNKLFFAG IALQLLPLLV FALLWHKFGR DPHGPVVIPR YEPPAGMDAA SAGSLLQNAP MPRKRALMLS IVHLAAEGCL TIETEKMTHV LHATGKPSSV PANRQVLEEL FRTGNTVQLS SYNRPALSSA ADALQRSMKE RYDTPEIRRN NTSKLLLCVL SSIAALAGSV PLVFAGDAGG FMAGAASGML LLMAGLVPYK ILSRRFSIPG ILFSLIFFSF ALIFFASFNR HAEANTGMVI VLWNVFLLWL FRRLMPAYTE KGMALVNELK GFRLFLSLTE KQRLQMQDQP EITRERFEAL LPFAMAFGVE RRWEDSFKAA MQLRGEDSTN YGIRWYDRGF SGTEMRHAGL GAALGSSIGA SVSASMPEST VSGSSGGFSS GGSGSSGGGS GSGGGGGGGG GW
|
| |