Gene Dole_1826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1826 
Symbol 
ID5694666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2207989 
End bp2209341 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content51% 
IMG OID641264424 
Producttransposase IS4 family protein 
Protein accessionYP_001529707 
Protein GI158521837 
COG category[L] Replication, recombination and repair 
COG ID[COG3039] Transposase and inactivated derivatives, IS5 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAA AATGGCAAAA ACAAATGACC TTCATGCCTC AAGAAATTGA TCATCCGCAA 
GCCAGAGAAC TCGAAGCCAT CAGTCAGTTG CTTGACAGCA AATCTACCAT TTACGAAATC
GTCTTGCAAG ACCTTCCCAG TCAAGCGACA TCCGGCTCGC CAGGTGGCGC CCAGGGCATG
ACTGCCGAGC AGGTGATACG AGCCGCCATT GTAAAGGTCC TGTTCGGCTT TACATACGAA
GAACTGGCCT TTCATTTGGT CGATTCCATG AGCATCCGGC GCTTCTGTCG AATCGGCATT
ACCGACGAAG GGTTTAAAAA ATCCACGCTT CATAAAAACA TCAAAGCCCT GTCCGCCGAG
ACCTGGCAGT TGATCAACAA GGAGGTGCTG GCGCATGCCG AAGAAGCCGG GATTGAAAAA
GGCCGTCAGG TGCGTATTGA TTGCACGGTT GTTGAAAGCA ATATCCATAA GCCGAGTGAT
TCTGTCCTTC TGTGGGACGC CGTCCGGGCT ATTACCCGGT TATTGGAACG CGCCCAACAA
GAGACCGGGA AGCAAAGGCT TTTGTTCCAT GACCACCGGC GGATCGCAAA AAAACGAATG
CTGGCGATTC AATACACCCG TGACGCAAAA GCCCGAAAGC CCCTTTACAA AGACCTTGTC
AAAAAAACCC GCCAGTGCGT CTCCTATGCC AGGTCAGCCG TAAAAGCGCT GGAGCAATCC
GTGGCCCATC CTTCCAGAAC GGCATTGGCC ATTGAACTGC AGAGCTTTGT GCGCCTGACC
GATCAGGTGA TCCGTCAGAC CGAGCTGCGC GTCTTCCAGG ACCAGCAGGT TCCGTCGTCA
GAAAAAATCG TCTCTTTGTT TGAACCGCAT ACGGACATCA TTGTCAAAGA CCGCCGGGAC
ACCTATTACG GACATAAAGT CTGTCTGACC GGCGGCAAAT CAAATCTGAT CCTTGATTGC
CTTATTGTTG AAGGCAACCC GGCCGATACC ACACTGACAG AGACCATGCT GGATCGCCAG
CATCAGATTT ATAATCGTTA CCCGCTGAAA GCCGCCCTTG ATGGCGGATT TGCCTCCAAA
GACAACCTGG CCAAAGCCAA AGAGAAAAAG ATCAAAGACG TATGCTTTGC CAAAAAACGC
GGGTTGTCTG AATTGGATAT GTGTCGCAGC CATTATGTTT ATAAACAGTT ACGCCGCTTC
CGTGCGGGCA TCGAGGCCGG GATATCCTGG CTCAAACGGA CCTTCGGCTT CAACCGCTGC
ATGTGGAAAG GCCTGCCGTC CTTTAAAAGC TATGTCTGGG CAACGATCGT GTCGGCAAAT
TTGCTGACCG TTGCCAGAAA ACAACTGGCA TAA
 
Protein sequence
MRKKWQKQMT FMPQEIDHPQ ARELEAISQL LDSKSTIYEI VLQDLPSQAT SGSPGGAQGM 
TAEQVIRAAI VKVLFGFTYE ELAFHLVDSM SIRRFCRIGI TDEGFKKSTL HKNIKALSAE
TWQLINKEVL AHAEEAGIEK GRQVRIDCTV VESNIHKPSD SVLLWDAVRA ITRLLERAQQ
ETGKQRLLFH DHRRIAKKRM LAIQYTRDAK ARKPLYKDLV KKTRQCVSYA RSAVKALEQS
VAHPSRTALA IELQSFVRLT DQVIRQTELR VFQDQQVPSS EKIVSLFEPH TDIIVKDRRD
TYYGHKVCLT GGKSNLILDC LIVEGNPADT TLTETMLDRQ HQIYNRYPLK AALDGGFASK
DNLAKAKEKK IKDVCFAKKR GLSELDMCRS HYVYKQLRRF RAGIEAGISW LKRTFGFNRC
MWKGLPSFKS YVWATIVSAN LLTVARKQLA