Gene Dret_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1978 
Symbol 
ID8419823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2270304 
End bp2271524 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID645038566 
Productglycogen synthase 
Protein accessionYP_003198840 
Protein GI258406098 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR02149] glycogen synthase, Corynebacterium family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.215698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCG GTGTGTTGAC CAATGAATAC CCCCCGCATG TGTATGGCGG GGCGGGAGTC 
CATGTTGACT ATTTAACCCG GGAATTGGCC CGCGTGGAAA ACGGCCGGCA CTCCGTCGAG
GTCCTTGCCT TCGGGGACCA GCATGTCGCT CGCAGCACGC TTCAGGTCAA TGGGGTCAAC
GGCGATCTCG GAGCCCGGCC GCAGTATCCG CAATGGGGCA AGGTGGTCGA TCCCCTGTTC
AAGAACCTGC TCATGGCCGC TCAGGCCGAG GCCTGGGACA TCGTGCACTG CCACACTTGG
TACACCCATT TTGCGGGCTG TTTGCTGCAA CAATTGCTGG GAATCCCGTT GGTGCTGACC
ACCCATTCCC TGGAGCCGCA TCGTCCCTGG AAAGCTGAAC AACTGGGCCC AGGTGGCTAC
CGGGCCTCCA CCTGGCTGGA AAAGACCGCC TACCAAAATG CCGACGGCGT GGTGGCGGTT
TCCGGTTCGA TGGCCAGCGA CGTGCAGACC CTGTACGGCG TGGCGCCTGA GCGCGTGCGG
GTCATCCACA ACGGCATCGA TCCCGAGGAA TACCATCCGG GGCAGCAGAC CGCCCCGCTG
GAAGATCTCG GCGTCGACCC CACGGTCCCC TATGTCCTGT TCGTGGGCCG GATTACCCGA
CAAAAAGGGA TCACCCATTT GCTGCGTGCC CTGGAGCAGG TCCGCTCCGG CACCCAGGTG
GTGCTCTGCG CCGCCTCACC CGATACACCG GAGATTGCGC GGGAGACCGA GGCCTTGGTC
CAGCAGCTTC GCGACCAGGG GCATTGCCGG GTGCACTGGT TCGACACCCC GATGCCCAAG
GAGCAGCTCA TACCGTTATA CGCCCATGCG GCAGTCTTTG TCTGTCCCTC CATCTACGAG
CCGTTCGGGA TCATCAATCT CGAGGCCATG TCCTGCGCCA CACCGGTGGT CGCCTCCAGT
GTCGGCGGTA TCCCGGAGAT CGTGGTCCAC GACGAGACCG GGTATCTGGT GGGATTTGAA
CCGGCGGGGA GCGAGGACAG CGATCCCAAA GATCCCGACC GGTTTGCCGC GGATTTGGCC
AAGGCCGTTA ATGCGGTCCT CGACGATCCG GAAAAGGGGG AGGGATTCGG ACGGCAGGCC
CGGCAGCGGG TGCTGAGTCA TTTCAGTTGG CGCTCTGTGG CCGCCCAGAC AATTCAATGG
TATCAGGCCC TGACCGGATA A
 
Protein sequence
MRIGVLTNEY PPHVYGGAGV HVDYLTRELA RVENGRHSVE VLAFGDQHVA RSTLQVNGVN 
GDLGARPQYP QWGKVVDPLF KNLLMAAQAE AWDIVHCHTW YTHFAGCLLQ QLLGIPLVLT
THSLEPHRPW KAEQLGPGGY RASTWLEKTA YQNADGVVAV SGSMASDVQT LYGVAPERVR
VIHNGIDPEE YHPGQQTAPL EDLGVDPTVP YVLFVGRITR QKGITHLLRA LEQVRSGTQV
VLCAASPDTP EIARETEALV QQLRDQGHCR VHWFDTPMPK EQLIPLYAHA AVFVCPSIYE
PFGIINLEAM SCATPVVASS VGGIPEIVVH DETGYLVGFE PAGSEDSDPK DPDRFAADLA
KAVNAVLDDP EKGEGFGRQA RQRVLSHFSW RSVAAQTIQW YQALTG