Gene Dret_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0117 
Symbol 
ID8417921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp156078 
End bp157412 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content62% 
IMG OID645036682 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_003196997 
Protein GI258404255 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCATG TGGAGGTCAC GGTCCCGGCA AGCAAGTCCC TCTCCCATCG GGCTTTGATT 
TGCGCTGGTC TGGCACTTGG AGTCAGTCGG GTGGAAAACG TCCTGGACAG TCAGGACCTG
GACCGGACCC GTGCCTGTCT CGAAGCGCTG GGCACCCAAT TCGAGGTCGA GGCCGACGGA
CTTGTGGTCC GAGGGCGCGG CGGTATCGGC CAGGTCAATC AGGCGAGTCT CGATGTCGGC
GAATCCGGCA CCACCTGCCG CCTGCTCACG GCTGTGGCCG CAGCCGGTTC AGGGGTGTTT
TCCCTTGCCG GGCAAGGTCG GATGCACCAG CGGCCCATCG CGCCGCTGGC TTCGGCTCTT
CACCAACTCG GATGCCGCTT TGAATGGCTC GAGGCGGACG GCTTTCTGCC CTGCCGGGTG
CATAGCTCAG GGCTCAAAGG GGGGCAGACG ACAGTGGCCC TGGATGAAAG CAGCCAATTT
CTCTCCGGGT TGCTGCTGGC CTCGCCGCTG GCCTGTGATC CTCTGACTAT TGGGATCGGC
GGACAGCGGG CCGTCTCCTG GCCCTATGTG GCCTTGACTC TTGAAGTGAT GCGTTTTTTT
GGACAGGAGC CGATCCTGGA ACAAGCGCAC GGCGAGAGAT GGCACTCCGT GCCCTTTGAG
AGCAATCCCT CCATCGAGCC AAGTAAAACG CGGTTTCGTT GCCATCCCGG GGTCTACTCG
CCGCAACGCT ATCGGGTCGA GGGCGACTGG AGCAACGCGT CCTATTTCGT GGCCGCTGGT
GCCATCGGGC CCCGCCCTGT GCGGTTGCGT GGTTTGTATA AGGATTCTCG CCAGGGCGAT
CGGGTCATTG TGGACATCGT CAAACAATTC GGTGCGTACG TTGAGTGGGG GCGGGAGTCG
CTGGTCGTCG CTCCTGGACC TCTTCAGGGG CAGGAATTGG ACATGGGCCC TTGCCCGGAT
CTCGTCCCGA CGGTGGCGGT GATGGCCAGT CTGGCGGAAG GCCCCACGGT GATCAAGAAT
ATCGCGCATC TGCAGCTCAA GGAGAGCGAC CGTCTCAATG GCGTGGCCAA TGAGTTGCGC
AAGGCCGGGG CCGAGGTCAC CGTTGAAGCG GATACCCTGA CGATCATCCC CTGTCCGCTG
GGGACCAAAC CGCTGCGATT GTCGACTTAT GATGATCACC GTATGGCCAT GGCCCTTTCC
CTGTTCCAGT TGGCCGGGTT GCATCTCCAA TTAGACAATC CCGGTTGCGT GGCCAAATCC
TTTCCCCGCT TCTGGGAACA ATGGGACAAG GTCCGTCAGG CATCGGAAGG AACGTCCGAA
AGGCCTGGAA ATTGA
 
Protein sequence
MYHVEVTVPA SKSLSHRALI CAGLALGVSR VENVLDSQDL DRTRACLEAL GTQFEVEADG 
LVVRGRGGIG QVNQASLDVG ESGTTCRLLT AVAAAGSGVF SLAGQGRMHQ RPIAPLASAL
HQLGCRFEWL EADGFLPCRV HSSGLKGGQT TVALDESSQF LSGLLLASPL ACDPLTIGIG
GQRAVSWPYV ALTLEVMRFF GQEPILEQAH GERWHSVPFE SNPSIEPSKT RFRCHPGVYS
PQRYRVEGDW SNASYFVAAG AIGPRPVRLR GLYKDSRQGD RVIVDIVKQF GAYVEWGRES
LVVAPGPLQG QELDMGPCPD LVPTVAVMAS LAEGPTVIKN IAHLQLKESD RLNGVANELR
KAGAEVTVEA DTLTIIPCPL GTKPLRLSTY DDHRMAMALS LFQLAGLHLQ LDNPGCVAKS
FPRFWEQWDK VRQASEGTSE RPGN