Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2117 |
Symbol | |
ID | 3757125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 2166660 |
End bp | 2168360 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637783005 |
Product | TPR repeat-containing protein |
Protein accession | YP_388609 |
Protein GI | 78357160 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000306715 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTCCT TCAGGCACAT CACGCTGGCA ACAGCGTTGT GCGCGTCACT CACTGCCGGA TGCGCCACAG GACGGTACAA CGGCGACACA GCACCGGAAC AGACTCCCGC CGGCTGGCAG ATGTCTGAAA GTGCACAGCG CACCTATAAT TTTCTGCTGC TTAACGAAGC CGTGCGGGAA AACGACGATG CCACGGCTTC GGCTGCCATC TCGCGCCTAC TTGAGCTGAA CAATTCGCCC GAAATATATC TGGCAGCGGC CAATTATTAC CGCCTGCGGG GCGAAGCATC GTTCGCACGC CCCATCCTGC AGAAAGGCAT TGAGCTGCAC CCCGACAATC TGGATCTGAC ACTGGCGCTT GTGGAAGCCT ATATGCAGGA AAACAGACCG GATGCCGCTC TGGCCGCTTT GACAGACTTT GTGCACAGGC ACAAGGATGA CTACGAAGCG CAACAGATAC TTGCCATATA TCTTCTGGAT GCAAAGGAAT ACGCACAGGC TGCCGACATG CTGGAAGCGG TCCCCGAAGA CAGACGCAGC GCGCATTCCT ATTACTATCT TGCCAAGGCG CATATAGGCC TGAGCAAATT TGAAGAGGCC GAAGAGGCCC TGCACACGGC GCTTAAAATG GAACCGGATT CAGCCGTCGA AGCATGGGCG GAGCTGGCAT ATCTTTATGA ACTGCAGAAA GATTACCCCG CAGCGGAACA GACGTATGCC CGCATTCTGG AGCTTGGTGA CGCCGGCCCC GAAGTATGGC TGCGTCTTGT GGATCTGAAC CTCAAGCTTA ACCACCCGCA GAAGGCGCTT GAATACACCC GTCAGGGCCC TGACAGCCTG GCCTTCACCC TGCAGGCCGG AACACTCTTT CTGGAGGAAG GCTTTTACGA AGAGGCAGAA GAAATTCTGC TGCCCGTGAG CAACCGCCCT GACAGCCCCG AAGAAATCTG GTTTTATCTG GGAGTGCTTT CGCTGCGCGG TCACGATGAT GCCGACGCGG CCATCGCCAA CTTTGCCAGA GTGCCTGTTT CAAACAGATA CTACAGTCAG GCCCTGCGGT TCCGCGCCGA CCTGCTGATA GAACAGGGCG ATTACCCTGC GGCACAGCAG CTTATCGACG AAGGGAAACG ACGTTTTCCC GACTCGCCGG AATTCTGGCT GATAGAGGCC GGAGCGGCAT TTGAACGTGA AGACTTTACC AGTGCGGAAC GCATCCTGCG TGAAGCGCTG GGGAACTGGC CCGGAGACCC GCAGTTTCTT TTTGCGCTGG GCAGCCTGTA CGACACCCGC GGAGAAAAGC AGAAAGCGCT TGAATTTATG GAACAGGTCA TTCTGGCCGA CACCGACCAT TACCGCGCGC TTAATTATGT GGGCTACACG CTGGCAGAAC AGGGACGCGA TCTGGACAGG GCGCTCGTGC TCATCGAAAG CGCGCTCAAA CTGAGCCCCG GCAGCGCCTA CATACTCGAC TCACTGGCAT GGGTACATTA CAAACTCGGC AACAACGCGC TGGCGCTCAA AAATATCCGT CTTGCCGTTG CCGCCGAAGG CGGCGACGAC CCGACCATGT GGGAACATTA CGGAGACATC GCCGCTCGTG CCGGTCAGCG CAAACTTGCA CTGGAAGCCT ATCGCAAGGC CATCAGGCTG GGCTCTCCCC AAGCTGATGC CATCAGAAAC AAAATCAACG GGCTGCAATG A
|
Protein sequence | MFSFRHITLA TALCASLTAG CATGRYNGDT APEQTPAGWQ MSESAQRTYN FLLLNEAVRE NDDATASAAI SRLLELNNSP EIYLAAANYY RLRGEASFAR PILQKGIELH PDNLDLTLAL VEAYMQENRP DAALAALTDF VHRHKDDYEA QQILAIYLLD AKEYAQAADM LEAVPEDRRS AHSYYYLAKA HIGLSKFEEA EEALHTALKM EPDSAVEAWA ELAYLYELQK DYPAAEQTYA RILELGDAGP EVWLRLVDLN LKLNHPQKAL EYTRQGPDSL AFTLQAGTLF LEEGFYEEAE EILLPVSNRP DSPEEIWFYL GVLSLRGHDD ADAAIANFAR VPVSNRYYSQ ALRFRADLLI EQGDYPAAQQ LIDEGKRRFP DSPEFWLIEA GAAFEREDFT SAERILREAL GNWPGDPQFL FALGSLYDTR GEKQKALEFM EQVILADTDH YRALNYVGYT LAEQGRDLDR ALVLIESALK LSPGSAYILD SLAWVHYKLG NNALALKNIR LAVAAEGGDD PTMWEHYGDI AARAGQRKLA LEAYRKAIRL GSPQADAIRN KINGLQ
|
| |