Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0625 |
Symbol | |
ID | 8418437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 750009 |
End bp | 751346 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645037191 |
Product | transposase IS4 family protein |
Protein accession | YP_003197498 |
Protein GI | 258404756 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000035382 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000000654827 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGCTAGAG GGTTCATTAA AGGGCATCGA GATCAACTCT ACCTGTTGCC ACCTTCGATT GATGACTGGA TATCAAAAAA CCACAGTGTC AGGTTGATCG ATTCTTGTGT AGAAAATATT GATTTATCAA TTTTTTATGA AAGCTACTCT CACGAGGGGA AGCCGCCTTA TGACCCTGCT ATGATGATTC GTATTCTTAT TTATGCATAC AGCAAAGGAA TACGCTCTTC TCGGAAGATA TCTGCTTTAT GCGAAGAAGA TATTGCTTTC CGGTGGCTTA CTGGAAATAT AATCCCTGAT CATTCTGCTA TTTGCCGCTT CCGCGCTAAG CATAAAGAAA ATTTTAAGCA GCTTTTCCGA GAAACAATCT GTTTGGCCGC TGAATCCGGT GCGTTAAAAA TAGGCAGTCT TTTTCTTGAT GGAACTAAGG TGAAAGGCTC GGCTTCCTTG GAAGCCAATC GTAATCTCGA GCATATTAAG CAAGACATTG AACGCATCGT GGACGAAGCT GAGGCAGTCG ATGCCAGTGA GGATAAGCAG CTTGGCGAAG ATAACCGGGA TGATGTTTTG CCCCCTGAGC TTGCTGATCC CAAATCCCGG TTGGAGCGAC TCAAGGCTGC CAAGGCCAGG CTGGAAGCTG AAAAAGAGGC TGCGGCGAAA GAGTCTCGAG ATGACGACGA CTCAAATGGT CCTGGAGCTG GTACCGGTGA TGAAAAAACG GCCACTGGCA ACAAAGAAAA AGCAAATATC ACCGATCCTG ACAGCAGAAT AATGAAAACA CGGAACGGCT GGGTGCAAGG GTATAATTGC CAAGGAGTTT CAGACGAAAA TCAGTTTATT GTCGCCAACG CGGTTACTCA AGACTGCAAT GACGCCCACC AACTCGAACC AATGCTCCAA GCGGCTCAAG GCAACTTGTC CAAGATAGAG ACAGGCCAGA ACACCGAAAC CTTCTCGGCA GATGCCGGTT ACTGGGCTGA GAAACTTGAT ATTTCAAAGA TCGAGAGCAA TGGCCCAGAG GTGATTATGG CTACCCGCAA AGGCTGGAAG CAGCGAAAAC AAAACCGTGA AAAATCCCCA CCTCGAGGGC GGATCCCCAA AGGGTTATCC CAGCGGGAGT TGATGGAACG AAAGCTACTG ACCCAAAGAG GCCAGCGGAT CTATGCCAAG CGCGGACAAA CGATAGAAGC TATTTTCGGT CAACTCAAGG AATGCCTTGG ATACAGGAAT TTTCTATTGC GTAGCCTCAA AAAAGTTCAG GGTGAATGGG ACCTCCAATG TGCAGTGAGC AATATGCTCA AGCTGCTTCG GTTGTCAGGG GCCACCACCA GTCAGTAG
|
Protein sequence | MARGFIKGHR DQLYLLPPSI DDWISKNHSV RLIDSCVENI DLSIFYESYS HEGKPPYDPA MMIRILIYAY SKGIRSSRKI SALCEEDIAF RWLTGNIIPD HSAICRFRAK HKENFKQLFR ETICLAAESG ALKIGSLFLD GTKVKGSASL EANRNLEHIK QDIERIVDEA EAVDASEDKQ LGEDNRDDVL PPELADPKSR LERLKAAKAR LEAEKEAAAK ESRDDDDSNG PGAGTGDEKT ATGNKEKANI TDPDSRIMKT RNGWVQGYNC QGVSDENQFI VANAVTQDCN DAHQLEPMLQ AAQGNLSKIE TGQNTETFSA DAGYWAEKLD ISKIESNGPE VIMATRKGWK QRKQNREKSP PRGRIPKGLS QRELMERKLL TQRGQRIYAK RGQTIEAIFG QLKECLGYRN FLLRSLKKVQ GEWDLQCAVS NMLKLLRLSG ATTSQ
|
| |