Gene Dret_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0116 
Symbol 
ID8417920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp154942 
End bp156075 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content58% 
IMG OID645036681 
Productchorismate mutase 
Protein accessionYP_003196996 
Protein GI258404254 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTGA ACAACCAGGA CGAAATGACC ATGACCGATA CTTCCACTAT CTCTCTTGAA 
ACCTTGCGCG ACGCCATTGA CGGTGTCGAC CAGGAACTGC TGCACCTGCT CAACCGGCGC
GCGCAATTGA GTCTGCAGGT CGGTGAGGCC AAATCCACGA CCAAGGGCGC CATTTTCAAG
CCGTTTCGGG AGAAGGCCGT TTTAGAGCGG CTGTCCGCGC ACAATCCCGG CCCATTGCCC
CAGGACCATC TGGAATCCAT TTATCGCGAG ATTCTCTCGT CTTCGCGCGC CCTGCAACGA
CCGCAACGGG TCGTCTATCT CGGACCGGAA GGCACTTTTT CCTATTTTGC TGGGGTGCAT
GCCCTGGGCG GCAGTGCTGA ATTTCATGAC CAGCCGACAC TCGAGGATGT CTTCCACGCC
GTGTCCTGCG GACGCGCGGA ACTGGGGATC ATTCCCTTGG AAAATTCCCT GCAGGGCACG
GTGGGGCAAA GCCTGGACCT GTTTTTGATG TACGAAGTCT TTATCCAGGC CGAAGTCTTT
TGTCGTATCA GCCACAGCCT GCTCAGCTGC AACGGTGACA AGGGCGGCAT CACCACCGTC
TATTCGCATC CACAGGCATT GCAGCAATGC GGCGGTTGGT TGCGCCAACA TCTCCCTCAG
GCCAAGGTCG TGCCAGTGGA AAGCACGGCC GCGGCTGCGG CCCGAGTGAA AAACGCCTCA
GAAGCCGCTA TCGGCCACAG CGCTCTGGCC GGTTTGTACC AGCTCCAGGT CGTGGCCTCC
CATATTGAAG ATCTGCCGGA GAATTGGACC CGCTTTTTGG TCATCGGCCG TGGCGCTCCG
CCGGCAGGGA ACAGGGATAA GACCTCTCTG CTGTTTTCGG TGCCGGATAA ACCCGGGGCC
TTGGCCGGGG TCCTCAATCT TTTGGCCCGT GAAGGTGTGA ATATGCGCAA GCTGGAATCT
CGGCCCATGC GCGGTGAACG GTGGAAATAC GTCTTTTTCG CTGATCTGGA GTGCGACCTG
GGACGGGAGG AGTATACCCA GCTTTTGCAA GCCCTGGAGG CCAACTGTCA CAGTTTTCGC
GTTCTGGGGA GTTACCCCAA CGGCCAAGCC CTGGATATGG GGCGGGAGGA ATAG
 
Protein sequence
MGLNNQDEMT MTDTSTISLE TLRDAIDGVD QELLHLLNRR AQLSLQVGEA KSTTKGAIFK 
PFREKAVLER LSAHNPGPLP QDHLESIYRE ILSSSRALQR PQRVVYLGPE GTFSYFAGVH
ALGGSAEFHD QPTLEDVFHA VSCGRAELGI IPLENSLQGT VGQSLDLFLM YEVFIQAEVF
CRISHSLLSC NGDKGGITTV YSHPQALQQC GGWLRQHLPQ AKVVPVESTA AAAARVKNAS
EAAIGHSALA GLYQLQVVAS HIEDLPENWT RFLVIGRGAP PAGNRDKTSL LFSVPDKPGA
LAGVLNLLAR EGVNMRKLES RPMRGERWKY VFFADLECDL GREEYTQLLQ ALEANCHSFR
VLGSYPNGQA LDMGREE