Gene Dgeo_0306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0306 
Symbol 
ID4058030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp297150 
End bp298364 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content62% 
IMG OID641229309 
Productvon Willebrand factor, type A 
Protein accessionYP_603778 
Protein GI94984414 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.813873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTG TCACGCGGTA CAGCAAGTTC GAGGGCGAAC TCGACCAGCT CGATTCAAGT 
GAGCTGATGC AGATGATTCA GGAAGCCCTG CTGGGCAAGG GCATGAACGA CCCCTACGAT
CCTGACCCGA ATGCGCGGCC CAGCATGGAC GACCTGTTCG ACGCCATTTT GGAGGCTTTA
GCCGAGCGGG GCATGATTCC CGAGGAGCAG CTCTTAGAGG CCATGCAGGC CGACGACGTG
CGGCAGACGG CCCTGGGCCA GCAGATCGAG CGCCTGATGG ACAAGCTCCA GCAGGACGGC
TTTATCCGCA AGGAGTTTGA CGACCAGGAA GAGGGGCAGG GAGGACAAGG CCAGAGCGGC
GAGGCGCGAT TTCAGCTCAC CGACAAGAGC ATCGACTTTC TGGGCTACAA GAGCCTGCGC
GACCTGATGG GCGGCCTGGG CAAAAGCAGT GCGGGCGCCC ACGACACCCG CGAGTACGCC
AGCGGCGTCG AGATGACCGG CGAACTCAAG AACTACGAAT TTGGAGACAC GCTCAACCTC
GACACGACCG CGACCCTCAG CAACGTCATC AGCAAGGGCT TTGAGCAGCT CGAAGAAGCC
GATCTGGTTA TCCGGCAGGC GGAATACAGC TCCTCAGCGG CAACGGTGGT GCTGCTTGAC
TGCTCGCACT CCATGATTCT GTACGGTGAG GACCGCTTTA CGCCTGCCAA GCAGGTGGCC
CTCGCGCTTG CGCACCTGAT CCGCACGCAG TACCCCGGCG ACACCGTCAA GTTTGTGCTG
TTCCACGACT CCGCTGAAGA GGTGCCGGTC TCCAAGCTGG CGCAGGCGCA GATCGGGCCG
TACCACACGA ACACGGCGGG CGGATTGCGG CTCGCGCAGC AACTCCTCAA GCGCGAGAAC
AAGGACATGA AGCAGATCGT CATGATCACG GACGGCAAGC CCTCGGCCCT CACGCTGCCC
GATGGCCGCA TCTACAAGAA CGCCTACGGC CTCGATCCCT ATGTGCTGGG GGCCACCCTG
CGCGAGGTCG CCAACTGCCG CCGCGCCGGC ATCCAGGTGA ATACCTTTAT GCTCGCCCGC
GACCCCGAAC TCGTCGGCTT CGTGCGCCGC GTCACCGAGA TGACCAGGGG CAAGGCTTAT
TTCACCACCC CGTACAACAT CGGCCAGTAC GTCTTGATGG ACTTCATGAC GAACAAGACG
AAGATGGTGA ATTAG
 
Protein sequence
MARVTRYSKF EGELDQLDSS ELMQMIQEAL LGKGMNDPYD PDPNARPSMD DLFDAILEAL 
AERGMIPEEQ LLEAMQADDV RQTALGQQIE RLMDKLQQDG FIRKEFDDQE EGQGGQGQSG
EARFQLTDKS IDFLGYKSLR DLMGGLGKSS AGAHDTREYA SGVEMTGELK NYEFGDTLNL
DTTATLSNVI SKGFEQLEEA DLVIRQAEYS SSAATVVLLD CSHSMILYGE DRFTPAKQVA
LALAHLIRTQ YPGDTVKFVL FHDSAEEVPV SKLAQAQIGP YHTNTAGGLR LAQQLLKREN
KDMKQIVMIT DGKPSALTLP DGRIYKNAYG LDPYVLGATL REVANCRRAG IQVNTFMLAR
DPELVGFVRR VTEMTRGKAY FTTPYNIGQY VLMDFMTNKT KMVN