Gene Dret_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1610 
Symbol 
ID8419441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1857181 
End bp1858365 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content54% 
IMG OID645038184 
Productintegrase family protein 
Protein accessionYP_003198472 
Protein GI258405730 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0939795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.722179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCA CCGTCAAAGC CATCCAGGCC GCTAAGCCTC GAGAAAAACT CTATCGTTTG 
AATGACGAAC GAGGGCTCTA TCTCGAGATC CCACCGAAAG GCGCTTACCG CTGGCGCTTT
CGCTACCGAT TTCATGGCAA ACCCAAAATG GTCAGCTTGG GCCGCTACCC AGATATCAGC
CTCAAACAAG CAAGGGAAAA ACGCGACGAG ATGCGCGCCC TTGTTGCTTC CTCCATCGAC
CCCTCGAATT ATCTCCGCCA AGCACGACAC GCTGCTAAAG ATGACAGCTT TGAGGCGGTC
GCCAGGGAAT GGCACAAAAA ATTCAAAAGC CGGTGGACAG AGGGCCATGC AGCCACTGTC
CTCACCCGGC TCGAACAAAA TGCTTTTCCC TGGATAGGCT CTCAGCCGAT AGATTCCGTT
ACCCCCCTCG ATATACTTCC TCTTTTGCGT CGCATAGAGG ATCGCGGCGC TATCGAACTT
GCCCACCGGG TGCGCGGGAT TATTAGTCAA GTCTTCCGAT TTGCAGTGGC CAACGAAAGA
GCCAGCCGCG ACCCGGCAAG CGACCTGAGA GACGCCCTGA CACCTCGCCA AGAAAATCAC
TTCGCGGCTA TCACCAACCC CTCTGAGATC CCAGCCCTAC TGGGCGCTAT CAGCGAATAC
CAAGGGCATT TTGTCACACG TTGCGCCCTC AGGCTGGCCT CCCTTGTTTT TGTCCGCCCC
GGCGAATTAC GCAAAGCAGA ATGGGACGAA ATTGATCTCG CCCACCAAGA GTGGCGCCTG
CCACCCGCCA AAACCAAACT TCGCAAGACG CACATCATCC CTCTTGCGCA TCAAGCTTGC
GCAATCTTTA ACGAAATCCA CCCGCTCACA GGCACAGGTC GTTACGTCTT CCCCTCTGCG
AGAGATAAAA ACAAGCCCAT GTCGGAAAAC ACCATCAATG CCGCACTACG ACAACTTGGA
TTCTCCAAAG AGAAAATGAC CGCTCACGGT TTCAGATCGA TGGCCTCGAC ACGCCTAAAC
GAACTAGGTT GGCATCCGGA TGCAATCGAG CGCCAACTGG GACATACCGA AAAAAACGGA
GTGCGTGCAG CATATAACCA CGCAGAATAC CTGGAAGAAC GCCGCAAAAT GATGCAGGCC
TGGGCTGACT ATCTCGATAA ACTCACCAAT AAAGATCTTC TTTAA
 
Protein sequence
MALTVKAIQA AKPREKLYRL NDERGLYLEI PPKGAYRWRF RYRFHGKPKM VSLGRYPDIS 
LKQAREKRDE MRALVASSID PSNYLRQARH AAKDDSFEAV AREWHKKFKS RWTEGHAATV
LTRLEQNAFP WIGSQPIDSV TPLDILPLLR RIEDRGAIEL AHRVRGIISQ VFRFAVANER
ASRDPASDLR DALTPRQENH FAAITNPSEI PALLGAISEY QGHFVTRCAL RLASLVFVRP
GELRKAEWDE IDLAHQEWRL PPAKTKLRKT HIIPLAHQAC AIFNEIHPLT GTGRYVFPSA
RDKNKPMSEN TINAALRQLG FSKEKMTAHG FRSMASTRLN ELGWHPDAIE RQLGHTEKNG
VRAAYNHAEY LEERRKMMQA WADYLDKLTN KDLL