Gene Dole_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1988 
Symbol 
ID5694828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2404484 
End bp2405449 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content60% 
IMG OID641264586 
Productlow molecular weight phosphotyrosine protein phosphatase 
Protein accessionYP_001529869 
Protein GI158521999 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0394] Protein-tyrosine-phosphatase
[COG0655] Multimeric flavodoxin WrbA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0292793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGCAC TGGGATTGAT GGGCAGTCCG CGAAAAAAAG GCAACTCCGC CTACCTGCTT 
TCCGCCTTTC TCAAGGCGCT TGAAGCAAAG GGGGCTGTCA CCCATACCGT GGTGGTGGCG
GAAAAAGAGG TGCTGCCCTG CATCGGCTGC ACCTACTGCG AAAAGCACGG CCGCTGTTTT
CAGCAGACCG ATGACATGGC AAAGGAAATG TACGGCCTGT TCCGCCGGGC CGATATCGTG
GTGGCGGCTA CGCCCATGTA TTTCTACAGT GCCCCGGCCC AGCTGAAGAT GGTGATCGAC
CGGACCCAGA CCCTCTGGTC CCGCAACTAC CGGCTGAACC TGCGCGATCC CCGGGCCGGG
AGCCGGGCCG GGTTCATGCT CTCCCTGGGC GCCACAAAAG GGAAAAACCT TTTTGAGGGC
ATCAACCTCA CGGCCCGCTA TTTTTTTGAC GCGGTCAGCG CTGAATTTAC CGGCTGGCTG
GGCTACCGCC GGATCGAGAA CCCCGGAGAC ATGGAAAAGC AGGAGGGGCT TGCCGCCGAT
ATCGCGGCGG AAGTCAGCAA GCTGGACGCC CTGTTTGCCC GCAAAAAAAT GGTGTTTGTG
GGCACGGACA ACACGTGCAC CAGCCGCATG GCCGAAGCGT TTGCCATGGC CATGGCCGGT
GACCGGGTGG AAGCCATGAG CGCCGGTATG AGCCCGGCGG AAAAGATCGA TCCTGAAATG
GAGGCCGCCA TGGCGGAAAA GGGTGTGGAC ATGGCTTTTG GCCGCCCCCG TTTAATGGAT
GACGTGCTGT CGGAGATAAA ACCCGGTATC GTGGTGACCG TTGGCATCGT TCCCGATTTT
ACCCCGGTGC CCGGCGCTCA GGTCGTGGCA TGGGAGATTC CGAATATCGA AGACCGATCC
CCGGAAGGGG TGCGCCGCCT GCGGGATGAT ATTGAAGCGA GGGTGGCGGC ACTCATTCAG
GGATAA
 
Protein sequence
MFALGLMGSP RKKGNSAYLL SAFLKALEAK GAVTHTVVVA EKEVLPCIGC TYCEKHGRCF 
QQTDDMAKEM YGLFRRADIV VAATPMYFYS APAQLKMVID RTQTLWSRNY RLNLRDPRAG
SRAGFMLSLG ATKGKNLFEG INLTARYFFD AVSAEFTGWL GYRRIENPGD MEKQEGLAAD
IAAEVSKLDA LFARKKMVFV GTDNTCTSRM AEAFAMAMAG DRVEAMSAGM SPAEKIDPEM
EAAMAEKGVD MAFGRPRLMD DVLSEIKPGI VVTVGIVPDF TPVPGAQVVA WEIPNIEDRS
PEGVRRLRDD IEARVAALIQ G