Gene Dole_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0037 
Symbol 
ID5692851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp43905 
End bp45068 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content59% 
IMG OID641262613 
ProductTPR repeat-containing protein 
Protein accessionYP_001527924 
Protein GI158520054 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000760847 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTCAA AAACCTTTCT TTACTTTATG GCGCTTTTCG GCCTGGTATT TTTGTTTGCC 
TGCGGCCCCA AGGCAGTTGC GCCGGAAGCC CAGATGGACA CACCGGAACA CCATGTGACC
AACGGCAACA AGTTTCTGAA GGCCGACAAG CTGGATGAAG CCTTTACCGC CTTTACCCGG
GCGACCCAGC TGGACCCCAA ATACGCGCCG GCCTATGTGG GCCTGGGGCT GGTCCACGGC
AAACAGGGAA TCTTTGACAA GGCATTTGAC GCCATGAAAC AGGCGGCCCG CTTTGCCAAA
ACCGACGCCC AGCACGCCGA AACCAGTGTG GGTTACATCC GCCTGTACAC CATGGGCGGC
CCGGCAGTGG AAGAGAACTG GCTCAACAAG GCCGGCAACC ATTTTGAGCG GGCCCACAAG
CTGGCGCCCC GGGACCCGGC CCCCTATTTT TACATGGGCA TGGCCTATCG CAACGCCTAT
CGTTTTTCAG ATGCCGCCGG CATGTTCAAG GCGGTGCTGG ACCTGGACAA AGACTTTGTG
GAAGCCGCGG ACCGGGAGTA TGCGGTGATG CAGCGCATCG AACGGGCCAT GCCCGGCACC
TCGGTGGGCA AGAAGATCGC CCTGCTGGAG GCCATCACCC GGGCCGACGT GGCCGCGCTT
TTTATCGAAG AGCTGAAGGT GGACGAGCTG TTTGAAAAAA ATACACCCAA AACCTTTGAC
ACCGCCTTCA AGGCCCCGGG CGCCGCTTTT AAAACCGGCG AGTATGTCAA GGCGCCGGCT
GTCACCGACA TTGACAACCA TGTGCTGCGC CAGGATATTG AAGCGGTGGT ACGCCTTCAG
ATCAAGGGCC TGCAGCCGGG TCCGGATCAC ACGTTTGAAC CGGACAAGTA TATCACCCGG
GCCGAGTTCG CCATGATGAT CGAAGATATC CTGATCAAGA TCACCGGAGA CAACTCGCTG
GCCACCCGGT TTATCGGCAC CGAGTCCCCT TTTCCCGACC TGCGCAGCGA TCTTGCCTTT
TTCAACGCGG CCATGGTGTG CGTCACCCGG AATATCATGG AGACCGTTGA CACCGCCACC
GGCGAATTCC GTCCCCAGGG GATGGTCTCG GGCGCGGACG CCCTGCTGAG CATTCGGCAG
ATGAAGGTGC AGCTTAACAA ATAG
 
Protein sequence
MRSKTFLYFM ALFGLVFLFA CGPKAVAPEA QMDTPEHHVT NGNKFLKADK LDEAFTAFTR 
ATQLDPKYAP AYVGLGLVHG KQGIFDKAFD AMKQAARFAK TDAQHAETSV GYIRLYTMGG
PAVEENWLNK AGNHFERAHK LAPRDPAPYF YMGMAYRNAY RFSDAAGMFK AVLDLDKDFV
EAADREYAVM QRIERAMPGT SVGKKIALLE AITRADVAAL FIEELKVDEL FEKNTPKTFD
TAFKAPGAAF KTGEYVKAPA VTDIDNHVLR QDIEAVVRLQ IKGLQPGPDH TFEPDKYITR
AEFAMMIEDI LIKITGDNSL ATRFIGTESP FPDLRSDLAF FNAAMVCVTR NIMETVDTAT
GEFRPQGMVS GADALLSIRQ MKVQLNK