Gene Dgeo_2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2272 
Symbol 
ID4057240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2395236 
End bp2396411 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content61% 
IMG OID641231322 
Productputative transposase, IS891/IS1136/IS1341 
Protein accessionYP_605735 
Protein GI94986371 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.641277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC ACCGCAAGGT CTACAGGTAT CGGATTGAGC CGACCCCGGT TCAAGAGTCG 
AAGCTGTACA TGCTGGCGGG AAGTCGGCGC TTCGTCTTCA ACTGGGCTCT TGCGCGTCGA
AGGGAACACT ACGCCGAAAC GGGCAAGACC CTGGGGTACA ACGCTCAGGC GGGAGAGTTG
ACGGCCCTGA AGAACCAGGA GGAAACCTCC TGGCTGAAGG AATCGGACAG CCAGCTTCTC
CAGCAGGCCC TCAAGGACGT GGAGCGGGCC TTCGTCAACT TCTTTGAGAA GCGGGCGAGG
TTCCCCCGGT TCAAGAGCAA AAAGACGGAT ACTCCGCGCT TCCGTATTCC CCAGCGGGTG
CGGATAGAGG GGAGCCGTGT GTATGTCCCG AAGGTGGGAT GGGTTAAGCT CCGCAAGTCT
CAGGAGATAG AGGGCAAGAC CAAGAGCGCG ACGTTCAAGC GGGAGGCAGA CGGTCACTGG
TACGTCTTGC TCGTCTCCGA GTTTGAGATG CCCGATGTAC CGCTGCCCCC CGTCCCTGAG
TCCGAGGTGG TCGGGATTGA CCTCGGCCTG AAGGATTTCT ACGTGTTGTC CGACGGCGGG
CGGAAAGAGG CCCCGAGGTT TGCCCGCAAG GGGCAGCGGA AACTCCGCCG CGCTGCCCGT
CGTCACTCCA AATGCACCAG GGGGAGCAAC CGCAAGGCGA AGGCCAAGCG GAAGCTCGCC
CGTGTTCACC GCCAGATTGC GAATCAGAGA AAGGACTTCG TTCACAAGGC CACCTCCGGT
CTTGTTCAGC AGTACCAGGG TTTCTGCATC GAGAACTTGA GCATCAAGGG GATGGCGAAA
ACCAAGCTGT CCAAGAGCGT TCTCGACGCC GCCCTGGGAG AGTTTCGCCG CCAGCTCGCC
TACAAGGCCC AGTGGCACCG GAAGTGGCTG GCGGTCATAG ACCGCTGGTT TCCGTCCAGC
AAGCTGTGTG GGGAATGCGG CAGCATCAAC GCAGACCTGA CCCTGAGTGA CCGGGAATGG
ACGTGCGAAT GTGGAGCGGT TCACGACCGC GACCTCAACG CCGCCCGGAA CATCAAGCGG
GAAGGGCTTT CGCAAATCGT CGTCGCGGGG CACGCGGAGA CGTTAAACGC TCGGGGAGAG
GGTGTCAGAC CTGCGATAGC GGGCAGCCCT CGATGA
 
Protein sequence
MTTHRKVYRY RIEPTPVQES KLYMLAGSRR FVFNWALARR REHYAETGKT LGYNAQAGEL 
TALKNQEETS WLKESDSQLL QQALKDVERA FVNFFEKRAR FPRFKSKKTD TPRFRIPQRV
RIEGSRVYVP KVGWVKLRKS QEIEGKTKSA TFKREADGHW YVLLVSEFEM PDVPLPPVPE
SEVVGIDLGL KDFYVLSDGG RKEAPRFARK GQRKLRRAAR RHSKCTRGSN RKAKAKRKLA
RVHRQIANQR KDFVHKATSG LVQQYQGFCI ENLSIKGMAK TKLSKSVLDA ALGEFRRQLA
YKAQWHRKWL AVIDRWFPSS KLCGECGSIN ADLTLSDREW TCECGAVHDR DLNAARNIKR
EGLSQIVVAG HAETLNARGE GVRPAIAGSP R