Gene Dret_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1607 
SymbolpyrG 
ID8419437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1853628 
End bp1855268 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content59% 
IMG OID645038180 
ProductCTP synthetase 
Protein accessionYP_003198469 
Protein GI258405727 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.353292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGACCA AATATATTTT CGTGACCGGC GGTGTGCTCT CGTCCCTGGG CAAGGGACTG 
GCGGCGGCTT CTGTGGGAGC CCTTCTCAAG GCCAGGGGCC TGAAAGTAAC CATCCAGAAG
CTCGATCCAT ACATCAATGT TGACCCGGGG ACGATGAATC CCTTCCAGCA CGGTGAGGTC
TATGTCACCG AAGACGGCGC GGAAACCGAC CTCGATCTGG GCCACTATGA ACGGTACCTC
GACGAGTACA TGGCCCAGAG CAACAATTAC ACCTCCGGCA GCATCTACCA CACCGTGATC
ACTAAGGAGC GCCGCGGGGA TTATCTCGGT GGCACCGTGC AGGTCATCCC CCATGTCACG
GACCAGATCA AACAAGCTGT AACCGGCGCG GCCCGCAACG ACGAGGACGT GGTCCTCATC
GAGATCGGCG GGACGGTCGG GGATATCGAA GGGCTGCCCT TTCTGGAAGC TATCCGCCAA
TTGCGCTCGG ATCTGGGCAA GGAGAATGTG CTCTACATCC ATCTGACCCT GGTGCCGTAT
ATGCGGGCCG CCGGTGAAGT CAAAACCAAG CCGACTCAGC ACAGCGTCAA GGAATTGCGC
AGCATCGGCA TCCAGCCGGA TATCATCCTC TGCCGCACCG AGGTTCCCCT GGAGCAGGAC
ATCAAGGACA AGATCGCCCT GTTTTGCAAT GTCGAGCCCG ATGCGGTCTT CACCGCCATC
GACGTGCGCC ACATCTACGA ACTTCCGCTG TCGTTTTATA CTGAAGGATT GGATCAGAAG
ATCGCCATCA TGCTGAGGCT GCCGGCCAAA AACCCCGACC TGGCGCCGTG GCGGCAATTG
GTGGACAAGC TGCACAATCC CAAGGGCGAA GTCCGCATCG GCATCGTCGG CAAATATGTC
GATCTCAAAG AGGCCTACAA AAGCCTGCAC GAGGCGCTGA CCCACGGTGG ATTGGCCAAC
GATCTGGCCG TGGAACTGGT CTACGTCAAT GCCGAGGATC TGGAAAAAAC CAGTGATCCG
GCTTCCCTGT TGCAGGATCT CGACGGCATT CTCGTGCCCG GCGGCTTCGG CAGCCGCGGA
GTGGAAGGCA AAATCAAAGC CTGCGGCTAT GCCCGGCAGA ACAAGATTCC TTTTTTCGGG
ATCTGTCTCG GTATGCAGTG CGCGGTCATC GAGTACGCCC GCAACGTTTT GGAATTGGGC
AAAGCCCATT CCTCGGAATT CGACGCCTGC ACCCCGGATC CGGTCATCTA TCTGATCACC
GAATGGTTCG ATCACCGCCA AAACTGCATG CAAAAGCGGG ACAAGGACTC GGAAAAAGGC
GGCACCATGC GCCTTGGTGC CTACCCCTGC CAGTTGCGCC CGGACAGCAA GGCCATGCAG
GCCTACGGTC AGGAGCTGAT CCAGGAACGC CACCGGCACC GGTATGAATT CAACAACGAC
TACGGCCCCC AATTCGAGGA AAACGGCCTG GTCCTGAGCG GGACCTCCCC GGATCATGAG
TTGGTGGAAA TCGTGGAAAT GCGCGACCAT CCCTGGTTCG TGGCCTGCCA ATTCCATCCG
GAGTTCAAAT CCAACCCCAT GCGCCCGCAT CCGCTTTTCC GGGAATTCAT CCGCGCGGCC
AAGGAGCGCA AAGGAGTCTA G
 
Protein sequence
MRTKYIFVTG GVLSSLGKGL AAASVGALLK ARGLKVTIQK LDPYINVDPG TMNPFQHGEV 
YVTEDGAETD LDLGHYERYL DEYMAQSNNY TSGSIYHTVI TKERRGDYLG GTVQVIPHVT
DQIKQAVTGA ARNDEDVVLI EIGGTVGDIE GLPFLEAIRQ LRSDLGKENV LYIHLTLVPY
MRAAGEVKTK PTQHSVKELR SIGIQPDIIL CRTEVPLEQD IKDKIALFCN VEPDAVFTAI
DVRHIYELPL SFYTEGLDQK IAIMLRLPAK NPDLAPWRQL VDKLHNPKGE VRIGIVGKYV
DLKEAYKSLH EALTHGGLAN DLAVELVYVN AEDLEKTSDP ASLLQDLDGI LVPGGFGSRG
VEGKIKACGY ARQNKIPFFG ICLGMQCAVI EYARNVLELG KAHSSEFDAC TPDPVIYLIT
EWFDHRQNCM QKRDKDSEKG GTMRLGAYPC QLRPDSKAMQ AYGQELIQER HRHRYEFNND
YGPQFEENGL VLSGTSPDHE LVEIVEMRDH PWFVACQFHP EFKSNPMRPH PLFREFIRAA
KERKGV