Gene Dole_2444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2444 
Symbol 
ID5695293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2955310 
End bp2956479 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content63% 
IMG OID641265051 
ProductDNA protecting protein DprA 
Protein accessionYP_001530325 
Protein GI158522455 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000433746 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATC TCACCCCCTG GTTTACTCTG AAAAGCGTTC CCGGCGTGGG CAACCTCCTG 
TTCAAGCGGC TGATCGACCG GTTCGGCTCA CCGGAGAAAG TGCTGAGTGC TGCCCGGTCC
GACCTGCTGG GGGTCCAGGG CATATCAGAC ACCCTGGCCT CGGCCATCCG AGCCCACAAG
ACACCGGACA ATATTGAAAA GGTGCTTGAG ACCTGCGCCC GGAAAGCCAT TTCCATCGTC
CCCCTGACCG ATCCCGGCTA CCCCGCCCTG CTGCGGGAAA TCCATGACCC GCCCCCCTAT
CTCTACGTGT GGGGAAAGCT GGTGCCGGAC GCGGGGTGCA TATCGATTGT CGGATCCCGA
AGCCCCACCC GGTACGGCCT CTCCATGGCC ACCCAGTTGA GCGGCGAACT GGCCGCCATG
GGCCTGTGTG TGGCCAGCGG TATGGCCAGG GGCATCGACA CCGCGGCCCA CACCGGGGCA
CTGGACAACA ACGGCCTGAC ATACGCCGTT CTGGGCAGCG GCCTGTGCCG CATCTATCCG
CCGGAGAACA TGGAACTGGC CCGGCGCATC GCCGGGCAGG GGGCCGTGAT ATCAGAGTTC
CCTCTTTTTG CCGAGCCCGA CGCCCACCAC TTTCCCCTGC GCAACCGGTT GATCAGCGGC
CTCTCCCTGG GCACCATCGT GGTGGAGGCG GCGGCCCGAA GCGGGTCCCT GATCACGGCC
CGGCTGGCCA TGGAGCAGGG CCGGGAGGTA TTTGCCGTGC CCGGCAGCAT CACCTCCTTT
AAAAGCACCG GTGCCCACGG CCTGCTCAAA CAGGGGGCGA TCCTGGTGGA AAAGGCATCG
GACGTGATCG CCGAGATATC TCCCCGGCTT GCCGCCGGCC CCGCAACCGC CCCGGCGGCG
TCGGACCGGG CCGATGAAAA CAAACACGCC GGAAAACCGA CCCCCGGCCT TGACACGGAT
GAGGTACGGG TGTTACAAAC CCTTGAACCT TACCCGGTGC ATATTGACGA AATCGCCCAG
AAGGCGGCCA TGGCGCCGGG AAAAACAGCA GGCATCCTGC TGCAACTGGA ACTCAAAGGG
TTTGTAACCC AGGAACCGGG AAAACGGTTC CTTATTAACC CGGATGTTGC ACGAGCCGAT
TTGGTCGCAG ATGCAAGGCG CGAGACATGA
 
Protein sequence
MENLTPWFTL KSVPGVGNLL FKRLIDRFGS PEKVLSAARS DLLGVQGISD TLASAIRAHK 
TPDNIEKVLE TCARKAISIV PLTDPGYPAL LREIHDPPPY LYVWGKLVPD AGCISIVGSR
SPTRYGLSMA TQLSGELAAM GLCVASGMAR GIDTAAHTGA LDNNGLTYAV LGSGLCRIYP
PENMELARRI AGQGAVISEF PLFAEPDAHH FPLRNRLISG LSLGTIVVEA AARSGSLITA
RLAMEQGREV FAVPGSITSF KSTGAHGLLK QGAILVEKAS DVIAEISPRL AAGPATAPAA
SDRADENKHA GKPTPGLDTD EVRVLQTLEP YPVHIDEIAQ KAAMAPGKTA GILLQLELKG
FVTQEPGKRF LINPDVARAD LVADARRET