Gene Sde_2147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2147 
Symbol 
ID3967531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2738462 
End bp2739583 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content47% 
IMG OID637921237 
Productprephenate dehydratase 
Protein accessionYP_527619 
Protein GI90021792 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.707843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGATA AATCCGAGCC AATTAGTGCA GAAGAAGCGG CGCTTCTTGG TGAATTGCGC 
GTAAAAATAG ACGATATAGA TCAGCAGATT GGCGACTTAA TATGTGCTCG CGCAAATTGT
GCGGTAGAAG TTGCCCATGT AAAAAAACGC TTTTCTAATA TCACCGAGCC AAAGTTTTAT
CGCCCAGAGC GCGAAGCACA GGTATTGCGC AATGCTATGG CGCGTAACAA GGGTCCGTTG
TCTAACGAAG AGTTTGCCCG TTTATTTCGC GAAATAATGT CTGCTTGCTT GGCTTTAGAA
GCACCAGTAA AAGTGGCTTA CTTGGGGCCA GAAGGCACCT ATACCCAGCA AGCCGCACTT
AAGCATTTCG GTCATTCCGC TCAAGCTGTT TCTTTGCCCG CTATTGATGA AGTTTTCCGC
GAAGTAGCAT CTGGTGCTGC GCACTATGGT GTAGTGCCGG TAGAAAACTC CACCGAAGGC
GTGGTTACGC ACACCTTAGA TAACTTTTTA GGCAGTAGCG TAAAAATTTG TGGTGAAGTT
GTACTGCGTA TTCATCACCA CCTGTTAGTT TCTGATGTAA CACACGTACA AAATATTTCG
CGCATTTATT CTCATGCGCA GTCTTTGGCG CAATGCAGAA AATGGTTAGA TGCACATTAC
CCTCGTGCAG AGCGTATAGC GGTAAGCAGT AATGCAGAAG CAGCACGCCG AATAAAAGGC
GAGTGGAATT CAGCTGCCAT TGCAGGCGCT ATGGCGGCAG ATTTATACGG CCTTACTAGC
CACGCACAAA ATATTGAAGA CCAGCCAGAT AACTCCACGC GCTTTTTAAT TATTGGTGCA
GAAAGCGTAG GTGCAAGCGG CGAAGATAAA ACTTCTATTG TTGTGTCTAT GAAAAACGAG
CCGGGTGCGT TGCACAATTT GCTAGAGCCA TTCCATCAGC ACGGCATAGA TTTAACCCGC
GTAGAAACTC GTCCATCGCC AACCGGTGCG TGGAACTACG TGTTTTTTAT AGATTTTGCC
GGCCATGCCA GCGAGCCAGT TGCTAAAAAA GTGCTAGAGG AAGTGGGGCG CAGAGCCTCA
GATCTGAAAA TATTAGGCTC ATACCCTAAA GGCGTACTTT GA
 
Protein sequence
MSDKSEPISA EEAALLGELR VKIDDIDQQI GDLICARANC AVEVAHVKKR FSNITEPKFY 
RPEREAQVLR NAMARNKGPL SNEEFARLFR EIMSACLALE APVKVAYLGP EGTYTQQAAL
KHFGHSAQAV SLPAIDEVFR EVASGAAHYG VVPVENSTEG VVTHTLDNFL GSSVKICGEV
VLRIHHHLLV SDVTHVQNIS RIYSHAQSLA QCRKWLDAHY PRAERIAVSS NAEAARRIKG
EWNSAAIAGA MAADLYGLTS HAQNIEDQPD NSTRFLIIGA ESVGASGEDK TSIVVSMKNE
PGALHNLLEP FHQHGIDLTR VETRPSPTGA WNYVFFIDFA GHASEPVAKK VLEEVGRRAS
DLKILGSYPK GVL