Gene Dde_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_2117 
Symbol 
ID3757125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp2166660 
End bp2168360 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content57% 
IMG OID637783005 
ProductTPR repeat-containing protein 
Protein accessionYP_388609 
Protein GI78357160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000306715 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCCT TCAGGCACAT CACGCTGGCA ACAGCGTTGT GCGCGTCACT CACTGCCGGA 
TGCGCCACAG GACGGTACAA CGGCGACACA GCACCGGAAC AGACTCCCGC CGGCTGGCAG
ATGTCTGAAA GTGCACAGCG CACCTATAAT TTTCTGCTGC TTAACGAAGC CGTGCGGGAA
AACGACGATG CCACGGCTTC GGCTGCCATC TCGCGCCTAC TTGAGCTGAA CAATTCGCCC
GAAATATATC TGGCAGCGGC CAATTATTAC CGCCTGCGGG GCGAAGCATC GTTCGCACGC
CCCATCCTGC AGAAAGGCAT TGAGCTGCAC CCCGACAATC TGGATCTGAC ACTGGCGCTT
GTGGAAGCCT ATATGCAGGA AAACAGACCG GATGCCGCTC TGGCCGCTTT GACAGACTTT
GTGCACAGGC ACAAGGATGA CTACGAAGCG CAACAGATAC TTGCCATATA TCTTCTGGAT
GCAAAGGAAT ACGCACAGGC TGCCGACATG CTGGAAGCGG TCCCCGAAGA CAGACGCAGC
GCGCATTCCT ATTACTATCT TGCCAAGGCG CATATAGGCC TGAGCAAATT TGAAGAGGCC
GAAGAGGCCC TGCACACGGC GCTTAAAATG GAACCGGATT CAGCCGTCGA AGCATGGGCG
GAGCTGGCAT ATCTTTATGA ACTGCAGAAA GATTACCCCG CAGCGGAACA GACGTATGCC
CGCATTCTGG AGCTTGGTGA CGCCGGCCCC GAAGTATGGC TGCGTCTTGT GGATCTGAAC
CTCAAGCTTA ACCACCCGCA GAAGGCGCTT GAATACACCC GTCAGGGCCC TGACAGCCTG
GCCTTCACCC TGCAGGCCGG AACACTCTTT CTGGAGGAAG GCTTTTACGA AGAGGCAGAA
GAAATTCTGC TGCCCGTGAG CAACCGCCCT GACAGCCCCG AAGAAATCTG GTTTTATCTG
GGAGTGCTTT CGCTGCGCGG TCACGATGAT GCCGACGCGG CCATCGCCAA CTTTGCCAGA
GTGCCTGTTT CAAACAGATA CTACAGTCAG GCCCTGCGGT TCCGCGCCGA CCTGCTGATA
GAACAGGGCG ATTACCCTGC GGCACAGCAG CTTATCGACG AAGGGAAACG ACGTTTTCCC
GACTCGCCGG AATTCTGGCT GATAGAGGCC GGAGCGGCAT TTGAACGTGA AGACTTTACC
AGTGCGGAAC GCATCCTGCG TGAAGCGCTG GGGAACTGGC CCGGAGACCC GCAGTTTCTT
TTTGCGCTGG GCAGCCTGTA CGACACCCGC GGAGAAAAGC AGAAAGCGCT TGAATTTATG
GAACAGGTCA TTCTGGCCGA CACCGACCAT TACCGCGCGC TTAATTATGT GGGCTACACG
CTGGCAGAAC AGGGACGCGA TCTGGACAGG GCGCTCGTGC TCATCGAAAG CGCGCTCAAA
CTGAGCCCCG GCAGCGCCTA CATACTCGAC TCACTGGCAT GGGTACATTA CAAACTCGGC
AACAACGCGC TGGCGCTCAA AAATATCCGT CTTGCCGTTG CCGCCGAAGG CGGCGACGAC
CCGACCATGT GGGAACATTA CGGAGACATC GCCGCTCGTG CCGGTCAGCG CAAACTTGCA
CTGGAAGCCT ATCGCAAGGC CATCAGGCTG GGCTCTCCCC AAGCTGATGC CATCAGAAAC
AAAATCAACG GGCTGCAATG A
 
Protein sequence
MFSFRHITLA TALCASLTAG CATGRYNGDT APEQTPAGWQ MSESAQRTYN FLLLNEAVRE 
NDDATASAAI SRLLELNNSP EIYLAAANYY RLRGEASFAR PILQKGIELH PDNLDLTLAL
VEAYMQENRP DAALAALTDF VHRHKDDYEA QQILAIYLLD AKEYAQAADM LEAVPEDRRS
AHSYYYLAKA HIGLSKFEEA EEALHTALKM EPDSAVEAWA ELAYLYELQK DYPAAEQTYA
RILELGDAGP EVWLRLVDLN LKLNHPQKAL EYTRQGPDSL AFTLQAGTLF LEEGFYEEAE
EILLPVSNRP DSPEEIWFYL GVLSLRGHDD ADAAIANFAR VPVSNRYYSQ ALRFRADLLI
EQGDYPAAQQ LIDEGKRRFP DSPEFWLIEA GAAFEREDFT SAERILREAL GNWPGDPQFL
FALGSLYDTR GEKQKALEFM EQVILADTDH YRALNYVGYT LAEQGRDLDR ALVLIESALK
LSPGSAYILD SLAWVHYKLG NNALALKNIR LAVAAEGGDD PTMWEHYGDI AARAGQRKLA
LEAYRKAIRL GSPQADAIRN KINGLQ