Gene Dret_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1777 
Symbol 
ID8419618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2046261 
End bp2048171 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content61% 
IMG OID645038361 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003198639 
Protein GI258405897 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0413401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTT TTCGAGCCAA GGAATACCCG GATTGCCCGG GGACCTATCT GATGAAGGAC 
GAACGGGGCC GTGTGATCTA TGTCGGCAAG GCCAAGGTGC TGCGCAAGCG GTTGGCTTCC
TATTTTCAGG AACCAGACCG TCTGCCGCGC AAAACGCGGG TCATGATGGA CAAGGTCTGG
TCTATTGAAA CATTGTGCAC GGAAACGGAA AAAGAAGCGT TTCTGCTCGA AAACAGCCTG
ATCAAGAAGC ACCGGCCCCG TTACAATATC ATTTTGCGCG ACGACAAGTC GTACGTCCTT
TTCAAACTGG ACAAGCGCCA CCCCTTTCCC CGTTTGAGCA TGACCCGGCG GGTGGTCCAG
GACGGGTCGA CCTATTTCGG GCCTTTCACT TCTGCGGTGG CGGCCAGAGA GACCTGGAAA
CTGCTCAATC GTTTGTTTCC GCTGCGCAAA TGCAAACAAA CGACCTTCAA CAACCGGGTC
CGGCCCTGTC TGCAATATAA TATCGGTCGG TGTCTGGGCC CGTGTGTCTA CGATATCCCT
CGGCAGGAGT ATCAGGAGGT GGTGCGGCAG GTGGAACTGT TTCTGACCGG CCGCTCCAAG
GAACTGCTGC GCCGGCTCCG GGCGGATATG CAAGCGGCGT CAGAGGATCT GCGGTTTGAA
GATGCGGCCC GTCTGCGGGA TCAGATCCAG GCTGTCGAAC AGACCGTGGA GCAGCAGGTC
GCGGTTCTTC CCGGCGGCAA GGACCGCGAT GTGCTCGGAT TGGGGCGCAC GGTTGGCGGT
GTGGCGCTAG GGCTGCTTTT CGTGCGCCAG GGGCAGCTGC TGGATCAGAA GAGCTTTTTT
TGGGCTGAAG AAGACGGGAC AGACGAGCTG GGTCTCCAGG AGGAGACGGC CTCGGCACAG
TCCACCGCTG TGGAAGAGGC GCAAGAGGTG CTGCGCTCCT TTGTGCTCCA GTTCTACAGC
CCTGGCCGGT ATATCCCGGA GCGCATCGTG CTCCCCTTTG CCATGGAAGA TGCGGTCCTG
GAGGACATAT TGAGTGAGCG CCGCGGCGGC CCTGTCCGTT TGGCCACCGC CCATGGGCCG
CAGGAACGCA AACTGGTATC CTTGGCCGAA ACCAATGCCG TCCAGGCGGG AGATCGGGCG
CGGAAAACGG TCGAGCCCCC TCTCGAACGC CTTCAGGACC GGCTTGGACT CTCGCAATTG
CCGGAGCGCA TCGAAGCCGT CGACGCCTCC CATTTCGGTG GCCAGGGGAT GGTCGTGGGC
CAGGTCGTCT TTGAGAACGG GCGACCGCAC AAGGAGGCCT ACCGGATTTA TGCCTTCCCG
GAACTCGAGG GGGCAGCCGA TGACTATGCC GCTCTCCAGG GATGGGCGCG GCGCCGGCTT
CGTTCCGGAC CGCCCTGGCC CGATCTGATC GTGGTTGACG GTGGCAAGGG TCAGCTCCAG
GCAGTACAAA AAGGGCTTAA CGAAGGACAG GAGGCCGGTC TTGCGGCGGA TTCCTTCGCC
CTGGCCGCTT TGGCCAAAGG GGAGCGCCGC GGCGGCGAAC TCGAGGAACG CGTCTTTCGA
CCGGGACGAA AAAATCCGGT GGCCCTGCGT CCGGGCAGTG CGGAATTGCT TCTCCTGCAA
CACATCCGTG ACAGCGTGCA CCGCTTTGTC TTGAGCCGGC AACGCCGCAC GCGGCGGGCG
AAGGGCTTGG ACAGCCGGCT CGAGGAATTG CCGGGAGTCG GTCCACGAAC GGCCCACTTG
TTGTGGAACC ATTTCGGCAC CCTGGAGCGG ATGTGTCAGG CAACCGAAGC TGAACTGGAA
GCATTGCCAG GCATTGGTGC GGCCAAAGCG GCCCAATTGC GCCGGGGATT GGCTTCACTG
TCCCCCGGGA CCCCCGTGGA CCATGATGAG CAGGGGGGCA ACACGGCTTG A
 
Protein sequence
MKPFRAKEYP DCPGTYLMKD ERGRVIYVGK AKVLRKRLAS YFQEPDRLPR KTRVMMDKVW 
SIETLCTETE KEAFLLENSL IKKHRPRYNI ILRDDKSYVL FKLDKRHPFP RLSMTRRVVQ
DGSTYFGPFT SAVAARETWK LLNRLFPLRK CKQTTFNNRV RPCLQYNIGR CLGPCVYDIP
RQEYQEVVRQ VELFLTGRSK ELLRRLRADM QAASEDLRFE DAARLRDQIQ AVEQTVEQQV
AVLPGGKDRD VLGLGRTVGG VALGLLFVRQ GQLLDQKSFF WAEEDGTDEL GLQEETASAQ
STAVEEAQEV LRSFVLQFYS PGRYIPERIV LPFAMEDAVL EDILSERRGG PVRLATAHGP
QERKLVSLAE TNAVQAGDRA RKTVEPPLER LQDRLGLSQL PERIEAVDAS HFGGQGMVVG
QVVFENGRPH KEAYRIYAFP ELEGAADDYA ALQGWARRRL RSGPPWPDLI VVDGGKGQLQ
AVQKGLNEGQ EAGLAADSFA LAALAKGERR GGELEERVFR PGRKNPVALR PGSAELLLLQ
HIRDSVHRFV LSRQRRTRRA KGLDSRLEEL PGVGPRTAHL LWNHFGTLER MCQATEAELE
ALPGIGAAKA AQLRRGLASL SPGTPVDHDE QGGNTA