Gene Dret_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1761 
Symbol 
ID8419602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2026544 
End bp2027911 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID645038345 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_003198623 
Protein GI258405881 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCATG TCTTCAGCGT TCAGGAATTG ACCCAGGCCG TGAAAGGGGT CGTCGAGGGC 
CAGTTCCCTC TGGTCTGGGT TCGAGGGCAG GTTTCCAACC TGTCCCGGCC CGGTTCGGGG
CATATCTATT TCACGCTCAA GGACGAACTC GCTCAACTCA AAGTGGTCTG GTTCAAAGGC
AACCAATGGC AGACACCGGC CCTGGAATCC CTGCATAACG GGCAGGAAAT CGTCTGTGCC
GGGCGGTTGA GCGTCTATCC GCCGCAAGGG ACCTACCAAC TCATTGCTGA ATGGGTGCAG
GACCAGGGGA TTGGCCGTTT GCAGGCCGCC TTTGAAGCCC TGAAGCAGAA GATGGAGGCC
AAGGGGTATT TTGCACAGGA CCGGAAACGG CCCCTGCCGC CGCATCCCCA ACGCGTCGCG
GTCGTGACCG CGCCCCAGGG CGCGGCGGTA CGCGATTTTC TCCGTCTGGC CGGAGAGCGG
GGACGTCCGG CTGCCATCCG CATCTACCCC AGCCTCATGC AGGGAGAAGG GGCGGAAGAC
AACGTCATTG GAGCCCTGGA GCAGGTGCAA CTCGACGAAT GGGCCGAAGT GGTGGTCATC
ACCCGCGGCG GTGGATCACT GGAGGACCTG TGGACCTTTA ATACCGAGGC GGTCGCCGAG
GCCGTGGCCC ACTTTCCGCT TCCGACCGTT GCCGCGATCG GCCACGAGCG GGATGTGACC
ATCGTAGACC TTATCGCTGA CAGTCGGGCG GCCACGCCGA GTCACGCTGC GCAACTTGTG
TGGCCTTTGC GTCAGGAACT TGTGCAGGAG GTCGACGAGT GGGAAATGCG TCTGGATAAG
GCGTGGATGG TACAGTGGCG CCATGCCCAG CGCCGCTTTG CCGAACTGGA GAAAGGGCTG
GGATGGCTTT CGCCCAGGCG CCGTTTGCAA CGAATGGACA AGGAATGCCA CCGTTTGGGA
CACGCCCTGG TCCGGACCGG ACGCCGCTTT GTCACGCAGC AGGAAAGCCG GGCACACGAC
AGCAGCGAGG CGCTTTTGCA TCGGGTGCGA CGGCATTTTG GCCGCACGGC CACGGAACGT
CTGGAACACG CCGGGCGACG TTTGCTCCAC TGTCGCAGCA ACCATTTTAA TGCCGTTGCC
CATCGGCTCG AAATCCAGAC TTCGGCCTTG GCCCATCTCG ATCCCAAGGC CCCGCTACGG
CACGGATTCA GCCTAGTCTC TCGGGTGGAC ACCGGAGAGT TGGTGACATC GGCGGCTCAG
GTCGCCCCTG AAGATTTTTT GCAGGTCCAG ACCGGTGATG GAGCGTACCG GGTCCGGGCT
ACAGCAGGCG ATACGGCTGC CTCGTCCGGC CAAGAGACGG AAACCTGA
 
Protein sequence
MQHVFSVQEL TQAVKGVVEG QFPLVWVRGQ VSNLSRPGSG HIYFTLKDEL AQLKVVWFKG 
NQWQTPALES LHNGQEIVCA GRLSVYPPQG TYQLIAEWVQ DQGIGRLQAA FEALKQKMEA
KGYFAQDRKR PLPPHPQRVA VVTAPQGAAV RDFLRLAGER GRPAAIRIYP SLMQGEGAED
NVIGALEQVQ LDEWAEVVVI TRGGGSLEDL WTFNTEAVAE AVAHFPLPTV AAIGHERDVT
IVDLIADSRA ATPSHAAQLV WPLRQELVQE VDEWEMRLDK AWMVQWRHAQ RRFAELEKGL
GWLSPRRRLQ RMDKECHRLG HALVRTGRRF VTQQESRAHD SSEALLHRVR RHFGRTATER
LEHAGRRLLH CRSNHFNAVA HRLEIQTSAL AHLDPKAPLR HGFSLVSRVD TGELVTSAAQ
VAPEDFLQVQ TGDGAYRVRA TAGDTAASSG QETET