Gene Dret_1496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1496 
Symbol 
ID8419325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1733291 
End bp1734628 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content48% 
IMG OID645038070 
Producttransposase IS4 family protein 
Protein accessionYP_003198360 
Protein GI258405618 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.17789e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value8.23353e-08 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTAGAG GGTTTATTAA AGGGCATCGA GATCAACTCT ACCTGTTGCC ACCTTCAATT 
GATGACTGGA TATCAAAAAA CCACAGTGTC AGGTTGATCG ATTCTTGTGT AGAAAATATT
GATTTATCAA TTTTTTATGA AAGCTACTCC CACGAGGGGA AGCCGCCTTA TGACCCTGCT
ATGATGATTC GTATTCTTAT TTATGCATAT AGCAAAGGAA TACGCTCTTC TCGGAAAATA
TCTGCTTTAT GCGAAGAAGA TATTGCTTTC CGGTGGCTTA CCGGAAATAT AATTCCTGAT
CATTCTGCTA TTTGCCGCTT CCGCGCTAAG CATAAAGAAA ATTTTAAGCA GCTTTTTCGA
GAAACAATCC GTTTGGCCGC TGAATCCGGT GCGTTAAAAA TAGGCAGTCT TTTTCTTGAT
GGAACTAAGG TGAAAGGCTC GGCTTCCTTG GAAGCCAATC GTAATCTCGA GCATATTAAG
CAAGACATTG AACGCATCGT GGACGAAGCT GAGGCAGTCG ATGCCAGTGA GGATAAGCAG
CTCGGCGAAG ATAACCGGGA CGATGTTTTG CCCCCTGAGC TTGCTGATCC CAAATCCCGG
TTGGAGCGAC TCAAGGCTGC CAAGGCCAGG CTGGAAGCTG AAAAAGAGGC TGCGGCGAAA
GAGTCTCGAG ATGACGACGA CTCAAATGGT CCTGGAGCTG GCACCGGTGA TGAAAAAACG
GCCACTGGCA ACAAAGAAAA AGCAAATATC ACCGATCCTG ACAGCAGAAT AATGAAAACA
CGGAACGGCT GGGTGCAAGG GTATAATTGC CAAGGAGTTT CAGACGAAAA TCAGTTTATA
GTCGCCAACG CAGTTACTCA AGACTGCAAT GACGCCCACC AACTCGAACC AATGCTTCAA
GCGGCTCAAG ACAACTTGTC CAAGATAGAG ACAGGCCAGA ACACCGAAAC CTTTTCGGCA
GATGCCGGTT ACTGGGCTGA GGGACTTGAT ATTTCAAAGA TCGAGAGCAA TGGCCCAGAG
GTGATTGTGG CTACCCGCAA AGGCTGGAAG CAGCGAAAAC AAAACCGTGA AAAGTCCCCA
CCTCGAGGGC GGATCCCCAA AGGGTTATCC CAGCGGGAGT TGATGGAACG AAAGCTACTG
ACCCAAAGAG GCCAGCGGAT CTATGCCAAG CGCGGACAAA CGATAGAAGC TATTTTCGGT
CAACTCAAGG AATGCCTTGG ATACAGGAAT TTTCTATTGC GTAGCCTCAA AAAAGTTCAG
GGTGAATGGG ACCTCCAATG TGCAGTGAGC AATATGCTCA AGCTGTTTCG GTTGTCAGGG
GCCACCACCA GTCAGTAG
 
Protein sequence
MARGFIKGHR DQLYLLPPSI DDWISKNHSV RLIDSCVENI DLSIFYESYS HEGKPPYDPA 
MMIRILIYAY SKGIRSSRKI SALCEEDIAF RWLTGNIIPD HSAICRFRAK HKENFKQLFR
ETIRLAAESG ALKIGSLFLD GTKVKGSASL EANRNLEHIK QDIERIVDEA EAVDASEDKQ
LGEDNRDDVL PPELADPKSR LERLKAAKAR LEAEKEAAAK ESRDDDDSNG PGAGTGDEKT
ATGNKEKANI TDPDSRIMKT RNGWVQGYNC QGVSDENQFI VANAVTQDCN DAHQLEPMLQ
AAQDNLSKIE TGQNTETFSA DAGYWAEGLD ISKIESNGPE VIVATRKGWK QRKQNREKSP
PRGRIPKGLS QRELMERKLL TQRGQRIYAK RGQTIEAIFG QLKECLGYRN FLLRSLKKVQ
GEWDLQCAVS NMLKLFRLSG ATTSQ