Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0830 |
Symbol | |
ID | 8418648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 980806 |
End bp | 983100 |
Gene Length | 2295 bp |
Protein Length | 764 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645037398 |
Product | cysteine synthase |
Protein accession | YP_003197699 |
Protein GI | 258404957 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01138] cysteine synthase B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.246504 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCGCCAA CACCACCCAA TGTCCTTTCC TTGATCGGCA AGACCCCTCT GGTTCCTGTC CATCGCCTGA ATCCCTATCC CAGTGTCTGT ATTGAGGCCA AGCTGGAAAA GCTCAATCCT GGCGGTTCGG TCAAAGACCG CGTGGCGCTG GCCATGGTCG AGGCAGCGGA ACGCAGTGGG CAATTGACGC CAGGCAAGAC CCTTATTGAA GCGACCTCCG GCAATACCGG AATAGGGTTG GCCATGGTCT GTGCGGTCAA AGGGTACACC CTGCGGCTGT TGATGCCCGA ATCGGCTTCA GAAGAGCGTC GTCGTATCCT GCGGGCCTAT GGAGCGCAAA TCCAATTGAC GCCCGGTCAC CTCGGTACCG ATGGCGCGAT TGAAGAGGCC TATCGCATGG CCCGGGAAGA GCCCGAGTTC TATGTCCTGA TGGACCAGTT CAACAATCCA GCGTCGATTG CCGCCCATTA TGAGACCACG GCCCCGGAAA TCTGGGAGCA GACCGCAGGG GAGGTAACCC ATGTGGTCGT GGCTTTGGGA ACCAGCGGGA CAGCCATGGG GCTCAGCCAG CGGCTCAAGG AATACAACCC GGCCGTGGAG GTTGTTGGTG TGGAACCCTA TGCGGGGCAC AAGATCCAGG GCCTGAAGAA TATGCAGGAG TCCTATCCGC CCGGAATTTA TGAGCGGAAC CAACTGGATC GTATTGTGCG GGTCGATGAC GAACAGGCTT TTGGTTTGTG CCGGGCATTG GCTCAAAAAG AAGGCATTTT TGCCGGCCTG AGTTCCGGTG CGGCCTTGGC AGGCGCACTG ACCCTGGCCG CTGAGATCCC CTCAGGGCGA ATTGTTATCA TCTTTCCGGA TGGTGGAGAG CGGTACCTGA GCACTCCCGT CTTTGATCCT CCGGCCAAGC AGGGCATGCG CCTTTTGGAC CTGCAAAGCG GTGAGAAGCG GCATATATTT GTCCAGAAAA ACGCCCTGGG AGTCTTCACG CCCGGGCCCG GCCCCGAGGA ATTGGAGCAG CCGGAGCTCT GGCGCCGTCT GGTCTGGGCG GACATCCTGG TTCGCTATCT CCGGCACAAG GATTTCGAGG TGCACGGGGT GGTCGGCCTT GCAGACTGGG ATGAACACCT TTCCCATCTG GCTGAACAGC AAGGCTGCAA TTTGCAGGAA ATGCGAGGGG CCATGCTCGA TCGGGCCGAG GCTTTGCTGC GCCGGCTCGG GTTGGAAACT GGCTGGCATT GTCAGGCGGC AAGCCAGTGC CGCGAGACCC AATTGGCGCT GTGCCGTGAA CTGGTCCGTA AGGGGCTGGG GTATGAAAAA TTGCGCTCGG TCTATTATGA CGTCGGCCGT GACACCGATT ACGGGGTCTT ACGACGTACC GATCTGGCGA AGCTCTCCCT GGGAAAAACT GTGGATCTCG ACCGCTATGC CAAAGATAAT CCCCGGGATT TTACATTGTT GAAACGGACC AACCTCGCCG ATCTCAAGCG GGGGGATTTC TGGAAGACAG AATGGGGCAA TGTCCGGCCG AGCTGGTATC TGCAAATGGC CGCCGCGGCC TTGGCAGAGG GTATCGCCAT GGATGTGGTC TTGGCCGGGC GAGCCCACCA TTTTCCGCAT ATGGAAAACC TCCGGGCTCT CTGGGCGGTT CGCAATGCCC TGCCTCAGGT CTGGCTCATG ACCCAGGCGG TGGAAGGAGA GGCCATCGTC CCTGATATCG AAACGGCCAG CGAACGGCTC GGCGGCATGC ATGCTTTGCG TTTGTGGTTG CTCTCTGGCG GGTATCGCAA ACCCCTGCAC TCCTCTGAGG ACAACGCGAC CATGTGGCGC CGCAATTGGG AGAGGCTACA GGAAAGCGTG GCTACGCTGC ATGTCGCCCG CGGCGAGGGT GGACAGGTGG ATCCGGGCTT TGAGCAGACG CTCTACGATG TCAAGACAAT GTTCTGGGAC CAATTGGAAG ACGACTTGGA TCTCCAGCAT TTTTGGCCTG TACTCTGGAG TTTATGCCGA ACGATCCTCA AAAAGGCCTC CCAGGGCCGA CTGGCTCCGG TCGAGGCCGC TCGAGGCTGG AAACTTGTGA CGGATCTGGA CACGATTCTC GGCGTGGTCG ATTGGCATAC ACTGCCCTTA CGTCAGGATC AGTGGCCTGA GGGGGTTCGG GATCGGATCG CATTGCGCGA ACAGGCCCGG CGCGATCGGG ATTTTGCCCG GGCCGATCTC CTGCGTCAGG AAATCGTCGC CAAAGGGTAC CGGCTTGAAG ACACCCCCCA AGGGCCGCGA GTATTTCCGG CTTGA
|
Protein sequence | MSPTPPNVLS LIGKTPLVPV HRLNPYPSVC IEAKLEKLNP GGSVKDRVAL AMVEAAERSG QLTPGKTLIE ATSGNTGIGL AMVCAVKGYT LRLLMPESAS EERRRILRAY GAQIQLTPGH LGTDGAIEEA YRMAREEPEF YVLMDQFNNP ASIAAHYETT APEIWEQTAG EVTHVVVALG TSGTAMGLSQ RLKEYNPAVE VVGVEPYAGH KIQGLKNMQE SYPPGIYERN QLDRIVRVDD EQAFGLCRAL AQKEGIFAGL SSGAALAGAL TLAAEIPSGR IVIIFPDGGE RYLSTPVFDP PAKQGMRLLD LQSGEKRHIF VQKNALGVFT PGPGPEELEQ PELWRRLVWA DILVRYLRHK DFEVHGVVGL ADWDEHLSHL AEQQGCNLQE MRGAMLDRAE ALLRRLGLET GWHCQAASQC RETQLALCRE LVRKGLGYEK LRSVYYDVGR DTDYGVLRRT DLAKLSLGKT VDLDRYAKDN PRDFTLLKRT NLADLKRGDF WKTEWGNVRP SWYLQMAAAA LAEGIAMDVV LAGRAHHFPH MENLRALWAV RNALPQVWLM TQAVEGEAIV PDIETASERL GGMHALRLWL LSGGYRKPLH SSEDNATMWR RNWERLQESV ATLHVARGEG GQVDPGFEQT LYDVKTMFWD QLEDDLDLQH FWPVLWSLCR TILKKASQGR LAPVEAARGW KLVTDLDTIL GVVDWHTLPL RQDQWPEGVR DRIALREQAR RDRDFARADL LRQEIVAKGY RLEDTPQGPR VFPA
|
| |