Gene Dret_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0830 
Symbol 
ID8418648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp980806 
End bp983100 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content59% 
IMG OID645037398 
Productcysteine synthase 
Protein accessionYP_003197699 
Protein GI258404957 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01138] cysteine synthase B 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.246504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGCCAA CACCACCCAA TGTCCTTTCC TTGATCGGCA AGACCCCTCT GGTTCCTGTC 
CATCGCCTGA ATCCCTATCC CAGTGTCTGT ATTGAGGCCA AGCTGGAAAA GCTCAATCCT
GGCGGTTCGG TCAAAGACCG CGTGGCGCTG GCCATGGTCG AGGCAGCGGA ACGCAGTGGG
CAATTGACGC CAGGCAAGAC CCTTATTGAA GCGACCTCCG GCAATACCGG AATAGGGTTG
GCCATGGTCT GTGCGGTCAA AGGGTACACC CTGCGGCTGT TGATGCCCGA ATCGGCTTCA
GAAGAGCGTC GTCGTATCCT GCGGGCCTAT GGAGCGCAAA TCCAATTGAC GCCCGGTCAC
CTCGGTACCG ATGGCGCGAT TGAAGAGGCC TATCGCATGG CCCGGGAAGA GCCCGAGTTC
TATGTCCTGA TGGACCAGTT CAACAATCCA GCGTCGATTG CCGCCCATTA TGAGACCACG
GCCCCGGAAA TCTGGGAGCA GACCGCAGGG GAGGTAACCC ATGTGGTCGT GGCTTTGGGA
ACCAGCGGGA CAGCCATGGG GCTCAGCCAG CGGCTCAAGG AATACAACCC GGCCGTGGAG
GTTGTTGGTG TGGAACCCTA TGCGGGGCAC AAGATCCAGG GCCTGAAGAA TATGCAGGAG
TCCTATCCGC CCGGAATTTA TGAGCGGAAC CAACTGGATC GTATTGTGCG GGTCGATGAC
GAACAGGCTT TTGGTTTGTG CCGGGCATTG GCTCAAAAAG AAGGCATTTT TGCCGGCCTG
AGTTCCGGTG CGGCCTTGGC AGGCGCACTG ACCCTGGCCG CTGAGATCCC CTCAGGGCGA
ATTGTTATCA TCTTTCCGGA TGGTGGAGAG CGGTACCTGA GCACTCCCGT CTTTGATCCT
CCGGCCAAGC AGGGCATGCG CCTTTTGGAC CTGCAAAGCG GTGAGAAGCG GCATATATTT
GTCCAGAAAA ACGCCCTGGG AGTCTTCACG CCCGGGCCCG GCCCCGAGGA ATTGGAGCAG
CCGGAGCTCT GGCGCCGTCT GGTCTGGGCG GACATCCTGG TTCGCTATCT CCGGCACAAG
GATTTCGAGG TGCACGGGGT GGTCGGCCTT GCAGACTGGG ATGAACACCT TTCCCATCTG
GCTGAACAGC AAGGCTGCAA TTTGCAGGAA ATGCGAGGGG CCATGCTCGA TCGGGCCGAG
GCTTTGCTGC GCCGGCTCGG GTTGGAAACT GGCTGGCATT GTCAGGCGGC AAGCCAGTGC
CGCGAGACCC AATTGGCGCT GTGCCGTGAA CTGGTCCGTA AGGGGCTGGG GTATGAAAAA
TTGCGCTCGG TCTATTATGA CGTCGGCCGT GACACCGATT ACGGGGTCTT ACGACGTACC
GATCTGGCGA AGCTCTCCCT GGGAAAAACT GTGGATCTCG ACCGCTATGC CAAAGATAAT
CCCCGGGATT TTACATTGTT GAAACGGACC AACCTCGCCG ATCTCAAGCG GGGGGATTTC
TGGAAGACAG AATGGGGCAA TGTCCGGCCG AGCTGGTATC TGCAAATGGC CGCCGCGGCC
TTGGCAGAGG GTATCGCCAT GGATGTGGTC TTGGCCGGGC GAGCCCACCA TTTTCCGCAT
ATGGAAAACC TCCGGGCTCT CTGGGCGGTT CGCAATGCCC TGCCTCAGGT CTGGCTCATG
ACCCAGGCGG TGGAAGGAGA GGCCATCGTC CCTGATATCG AAACGGCCAG CGAACGGCTC
GGCGGCATGC ATGCTTTGCG TTTGTGGTTG CTCTCTGGCG GGTATCGCAA ACCCCTGCAC
TCCTCTGAGG ACAACGCGAC CATGTGGCGC CGCAATTGGG AGAGGCTACA GGAAAGCGTG
GCTACGCTGC ATGTCGCCCG CGGCGAGGGT GGACAGGTGG ATCCGGGCTT TGAGCAGACG
CTCTACGATG TCAAGACAAT GTTCTGGGAC CAATTGGAAG ACGACTTGGA TCTCCAGCAT
TTTTGGCCTG TACTCTGGAG TTTATGCCGA ACGATCCTCA AAAAGGCCTC CCAGGGCCGA
CTGGCTCCGG TCGAGGCCGC TCGAGGCTGG AAACTTGTGA CGGATCTGGA CACGATTCTC
GGCGTGGTCG ATTGGCATAC ACTGCCCTTA CGTCAGGATC AGTGGCCTGA GGGGGTTCGG
GATCGGATCG CATTGCGCGA ACAGGCCCGG CGCGATCGGG ATTTTGCCCG GGCCGATCTC
CTGCGTCAGG AAATCGTCGC CAAAGGGTAC CGGCTTGAAG ACACCCCCCA AGGGCCGCGA
GTATTTCCGG CTTGA
 
Protein sequence
MSPTPPNVLS LIGKTPLVPV HRLNPYPSVC IEAKLEKLNP GGSVKDRVAL AMVEAAERSG 
QLTPGKTLIE ATSGNTGIGL AMVCAVKGYT LRLLMPESAS EERRRILRAY GAQIQLTPGH
LGTDGAIEEA YRMAREEPEF YVLMDQFNNP ASIAAHYETT APEIWEQTAG EVTHVVVALG
TSGTAMGLSQ RLKEYNPAVE VVGVEPYAGH KIQGLKNMQE SYPPGIYERN QLDRIVRVDD
EQAFGLCRAL AQKEGIFAGL SSGAALAGAL TLAAEIPSGR IVIIFPDGGE RYLSTPVFDP
PAKQGMRLLD LQSGEKRHIF VQKNALGVFT PGPGPEELEQ PELWRRLVWA DILVRYLRHK
DFEVHGVVGL ADWDEHLSHL AEQQGCNLQE MRGAMLDRAE ALLRRLGLET GWHCQAASQC
RETQLALCRE LVRKGLGYEK LRSVYYDVGR DTDYGVLRRT DLAKLSLGKT VDLDRYAKDN
PRDFTLLKRT NLADLKRGDF WKTEWGNVRP SWYLQMAAAA LAEGIAMDVV LAGRAHHFPH
MENLRALWAV RNALPQVWLM TQAVEGEAIV PDIETASERL GGMHALRLWL LSGGYRKPLH
SSEDNATMWR RNWERLQESV ATLHVARGEG GQVDPGFEQT LYDVKTMFWD QLEDDLDLQH
FWPVLWSLCR TILKKASQGR LAPVEAARGW KLVTDLDTIL GVVDWHTLPL RQDQWPEGVR
DRIALREQAR RDRDFARADL LRQEIVAKGY RLEDTPQGPR VFPA