Gene Dde_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_3036 
Symbol 
ID3758028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp3023767 
End bp3025095 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content58% 
IMG OID637783944 
ProductCBS 
Protein accessionYP_389525 
Protein GI78358076 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATATTG CCATTCTTGT TGCCCTGATA CTTCTTAACG GCGTTTTTGC CATGTCCGAG 
ATAGCCCTTG TCACCGCCCG CAGAAGCCGT CTGCAAAAAA TGGCGGAAGA AGGTGACCGT
TCCGCAGCCG TGGCCATCCG GCTGGGTGAA GAGCCTACCC AGTTTCTTTC CACTGTGCAG
ATAGGCATAA CGGCCATAGG CATACTGAAC GGCATAGTGG GAGAAGCCGC ACTGGCCGGG
CCTCTTGCCC TGATGCTGCA GAATGCCGGT CTGGAAAGCG GGACAAGCTC GGCCGTGGCA
ACCACAGTTG TGGTGGCGGG CATCACCTAT TTTTCCATTG TGGCGGGCGA ACTGGTGCCC
AAACGCATAG CGCAGTTCAA TGCCGAAGGC ATAGCGCGCA GCATGGCCAG ACCCATAGCC
CTGCTGGCCT GTCTGTCGCG TCCGTTTGTG TATCTGCTTT CTGTTTCCAC GGATGCCCTG
CTGCGGCTGG TGGGCAAAAC TGAACTGAGC AGCGCCAACC TGACCGAGGA GGACATCCAC
GCCATACTGA CGGAAGGTTC ACAGGCGGGT GTCATCGAAA AACACGAGCA TGATATGGTG
CGTAATGTCT TTCGTCTTGA CGACAGGCAG ATTCCTTCGC TGATGACTCC GCGCAGCGAT
ATTGTGTTTC TGGATATCAC GCAGCCGCTT GACGGGTTTC TGGACACAGT GGTGGCCTCT
GATCATTCCC GCTTTCCGGT ATGCCGCGGC GGTCTGCACG AGGTGCTGGG CGTCATCAGT
GCCAAGCGTC TGCTCAAGCA GCGGCTGAAA AACGAACCGG CAGAAAAACT GACCGGATAT
CTGCTGCCGG CTGTGTATGT GCCGGAGTCG CTGACGGGCA TGAAGCTGCT TGAACAGTTC
CGGGAATCCG GTGTGCAGAT GGTTTTTGTG GTTGATGAAT ACGGTGATAT TTCCGGTCTG
ATCACGTTGC AGGACCTGCT GGAAGCGCTC ACAGGAAAGT TCCGCCCGCG CGATCCGGAC
GAAATGTGGG CTGTGCAGCG TGACGACGGT TCGTGGCTGC TGGACGGGCT TATCCCTGTG
CCGGAACTGA AGGACAGGCT TGACCTCAAA ACCGTGCCCG ACGAGGCAAA AGTCCGCTAT
CATACCTTGA GCGGCATGAT GATGTGGCTT TTCGGCCGGT TGCCGCGTAC AGGTGATGTG
GCGGAGTGGC AGGGCTGGCA GCTGGAAGTT GTGGATCTTG ACGGCAAGCG CATCGACAAG
GTGCTGGCCA GCAGGATTTC CGGTTACGAG GCTTCGCAGC CGTCTGCGGC TGGTCCGGAC
GGGCGCTGA
 
Protein sequence
MDIAILVALI LLNGVFAMSE IALVTARRSR LQKMAEEGDR SAAVAIRLGE EPTQFLSTVQ 
IGITAIGILN GIVGEAALAG PLALMLQNAG LESGTSSAVA TTVVVAGITY FSIVAGELVP
KRIAQFNAEG IARSMARPIA LLACLSRPFV YLLSVSTDAL LRLVGKTELS SANLTEEDIH
AILTEGSQAG VIEKHEHDMV RNVFRLDDRQ IPSLMTPRSD IVFLDITQPL DGFLDTVVAS
DHSRFPVCRG GLHEVLGVIS AKRLLKQRLK NEPAEKLTGY LLPAVYVPES LTGMKLLEQF
RESGVQMVFV VDEYGDISGL ITLQDLLEAL TGKFRPRDPD EMWAVQRDDG SWLLDGLIPV
PELKDRLDLK TVPDEAKVRY HTLSGMMMWL FGRLPRTGDV AEWQGWQLEV VDLDGKRIDK
VLASRISGYE ASQPSAAGPD GR