Gene Dret_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0801 
Symbol 
ID8418619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp949100 
End bp951136 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content62% 
IMG OID645037369 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_003197670 
Protein GI258404928 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC AGTCTGTTCA AAGCCGGGTG GCCTCCTTGC GCGAAACCAT TCGGCACCAC 
GATTATCTCT ATTATGTCCT GGATGCTCCC CAGATAAGCG ATAACGAGTA TGACGCCCTG
TTTAAGGAAC TCCAGGCGCT GGAACACCGG TACCCCGAGT TGGATGACCC CGCCTCACCC
ACCAAGCGGG TGGGGCAGAA ACCTTTGGCT GCCTTTGCCC ACCGCGAACA CAGTCTCCCC
ATGGCCAGTC TGGACAATGT TTTTTCCCTG GAGGAATTCC AGACCTATGT CCAGCGTCTG
GAGCGACTCC TTCCCGGTGA AGAGATCACC TTCTGGGTTG ATCCCAAGAT GGACGGACTG
GCGGTCGAGG TGATTTTTGA GGCCGGCCGG TATGTCGCGG CTTGCACCCG GGGCGATGGC
CGGGTGGGCG AGGACGTCAC CGCCCAGATG CGTACGGTGC GCACTGTCCC TTTGCAATTG
CGCGAAGAAG GGGTCGAGGT GCCCGACTAC ATCGAGGTCC GCGGGGAGGT GGTCATGCAT
GAAGACGATT TTGCCCTGCT CAACAAACGG CAAACCGAGA ACGGAGCCCG CCCATTTGCC
AACCCGCGCA ATGCCGCGGC TGGTTCGGTG CGCCAATTGG ACCCGGAAGT GACGGCCAAA
CGCCCGTTAC GTTTTTTCGC GTATGGCGTC GGAGTGGTGC GCTGGTCCGG AGCAAAAACG
GTCTGGTCGC GGCAGTCCGA GGTCATGCAG GGGTTGCGCG AACTGGGACT GCCGGTTGCC
CCCAAAGCCC GCTGGGCCGA GAGTGCTGCA GCGGTGCAGA CCTATTACCA GGACCTTGGC
GCCAACCGCC ACTCGCTGGG CTTTGAAATC GATGGGGTGG TGGCCAAGGT GGACCGGCTC
GAGCAACAGC ACCGTCTGGG CAGCACTTCC CGGGCGCCCC GCTGGGCTTT GGCCTTGAAA
TTTCCCGCCC ACCAGGCGGA GACGGTATTG GAAGATATCC AGGTCCAAGT GGGGCGCACC
GGGGTCTTGA CTCCCGTGGC CGTGCTGCGG CCGGTGGAGG TTGGGGGCGT CACGGTCTCC
CGGGCCACGC TGCACAATGA AGATGAGATC CGGGCCAAGG GCCTTCGAGT GGGCGATCCG
GTGCTTGTCC AGCGCGCCGG AGACGTCATT CCGGAAGTGG TGCGTCCTTT GACTGACAAA
CGTACCGGAG AGGAGAAAAC ATTTGTTTTC CCCCAGACCT GCCCGGCGTG CGGCAGCCCG
GTTTCTCGGC TCGGCGATGA AGTGGCCAGG CGGTGCGTGA ACGCCGCCTG TCCGGCCCAG
GTCGTTCAGC AACTCATCCA TTTTGTCTCC AAGGCCGGGC TGGATATTGA AGGTCTGGGG
GAGCGCTGGG TTCAGATTTT CGTGGACCAG GGGCTGGTCC GCTCCCCGGC TGACCTCTTT
CGGCTGAAGG AAAGAGACCT GGTGCCGCTG GAGCGCATGG GCGATAAATC GGCCCGGAAC
ATGATCCAGG CCATTGCAAG CGCCAAATCC TCGGCCCGGC TGGACCAACT TATCAGTGCC
CTGGGCATCC GGCTGGTAGG GCGGCGGACC GCCGAAATCC TGGCTACCTC GTTTGAGGAC
CTTGAGGCCC TGGCGGAGGC CGATTCTGAA ACTCTGCAGC ACATCCCGGA TATCGGACCG
GAAGTGGCCG CCTCACTGCG CCGTTTTTTT ACCACCCCGG CCAACCAGGC GCTCATTCGG
GATCTGCACC AACTTGGCGT GTGGCCACGG CGCGAGACTG GCAACGGATC TGATGCGGAC
CAGTCTGCCC CTCTGGAAGG ACTGCGTTTT GTTCTGACCG GGGCGTTGTC GGACATGAGC
CGGAGTGCGG CCAAGGATGC TATCGAGTCC CGTGGGGGCA AGGTGGTCAG CGCTGTTTCC
AGTAAGGTGG ATTACGTGGT CGCTGGGGAG AATCCGGGCA GCAAGCTGGA TAAGGCCCGG
GAACTGGATA TCGCCGTTCT CGACGAGACG GGGCTTCACG ACCTTTTGGC TGGGTAG
 
Protein sequence
MAEQSVQSRV ASLRETIRHH DYLYYVLDAP QISDNEYDAL FKELQALEHR YPELDDPASP 
TKRVGQKPLA AFAHREHSLP MASLDNVFSL EEFQTYVQRL ERLLPGEEIT FWVDPKMDGL
AVEVIFEAGR YVAACTRGDG RVGEDVTAQM RTVRTVPLQL REEGVEVPDY IEVRGEVVMH
EDDFALLNKR QTENGARPFA NPRNAAAGSV RQLDPEVTAK RPLRFFAYGV GVVRWSGAKT
VWSRQSEVMQ GLRELGLPVA PKARWAESAA AVQTYYQDLG ANRHSLGFEI DGVVAKVDRL
EQQHRLGSTS RAPRWALALK FPAHQAETVL EDIQVQVGRT GVLTPVAVLR PVEVGGVTVS
RATLHNEDEI RAKGLRVGDP VLVQRAGDVI PEVVRPLTDK RTGEEKTFVF PQTCPACGSP
VSRLGDEVAR RCVNAACPAQ VVQQLIHFVS KAGLDIEGLG ERWVQIFVDQ GLVRSPADLF
RLKERDLVPL ERMGDKSARN MIQAIASAKS SARLDQLISA LGIRLVGRRT AEILATSFED
LEALAEADSE TLQHIPDIGP EVAASLRRFF TTPANQALIR DLHQLGVWPR RETGNGSDAD
QSAPLEGLRF VLTGALSDMS RSAAKDAIES RGGKVVSAVS SKVDYVVAGE NPGSKLDKAR
ELDIAVLDET GLHDLLAG