Gene Sala_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1189 
Symbol 
ID4080837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1229244 
End bp1231103 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content66% 
IMG OID638009550 
Productdihydroxy-acid dehydratase 
Protein accessionYP_616238 
Protein GI103486677 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCTT ATCGTTCGCG CACCACAACC CATGGCCGCA ACATGGCCGG TGCCCGCGGC 
CTGTGGCGTG CGACGGGGAT GAAGGACAGC GACTTCGGCA AGCCCATCAT CGCGGTGGTC
AACAGCTTCA CCCAGTTTGT GCCGGGCCAT GTTCACCTGA AAGACCTGGG CCAGATGGTC
GCGCGCGAGA TCGAGGCAGC CGGAGGCGTT GCCAAGGAGT TCAACACGAT CGCGGTCGAT
GACGGCATCG CGATGGGACA CGACGGGATG CTCTATTCGC TGCCCAGCCG CGACCTGATC
GCCGACAGTG TCGAGTATAT GGTCAATGCG CATTGCGCCG ATGCGATGGT GTGCATTTCC
AACTGCGACA AGATCACGCC GGGGATGTTG ATGGCGGCGC TGCGCATCAA CATCCCCGTG
GTGTTCGTGT CGGGCGGGCC GATGGAGGCG GGCAAGGTCG TGCTGAAGGG CAAGGAGGTC
GCGCTCGACC TGGTCGACGC GATGGTCGCC GCCGCCGACG AAAAATACAG CGACGAAGAA
GTGCTGGCAA TCGAACAGGC GGCGTGCCCG ACATGCGGAT CATGTTCGGG CATGTTCACC
GCCAATTCGA TGAACTGCCT GACCGAAGCG TTGGGGCTGT CGCTGCCGGG CAATGGCTCG
ACGCTGGCGA CCCACGCCGA CCGCAAGGAG CTGTTCCTGC GCGCCGGGCG GATCGTCGTC
GAAATGTGTC GCCGTCATTA TGAGGAAGGC GACGACAGCG TCCTGCCGCG CAATATCGCG
ACGTTCGAGG CGTTTGAAAA TGCGATGAGC CTCGACATCG CGATGGGCGG ATCGACCAAC
ACCGTGCTCC ACCTGCTCGC CGCCGCGCAT GAAGCGGGCG TCGATTTCAC GATGGAAGAC
ATCGACCGGC TGTCGCGTCG CGTGCCGTGC CTGTCAAAGG TCGCGCCGGC CAAGAGCGAC
GTGCATATGG AGGATGTCCA CCGCGCGGGC GGGATCATGG CGATCCTCGG CGAACTCGAC
CGCGCGGGGC TGCTCCACGC GCATCTGCCG ACGGTGCATA GCGCCACGCT GGGCGATGCG
CTCAACAAAT GGGACATTGC GCGCACGAAC GACCCGGAGG TGCAGAAGTT CTTCATGGCC
GCCCCCGGCG GCGTGCCGAC GCAGACGGCG TTCAGCCAGG CCCGGCGCTG GGACAGCCTT
GACCTCGACC GGATAAGCGG TGTGATCCGT TCGGCGGACC ATGCGTTCAG CAAGGACGGC
GGGCTGGCGG TGCTGAGCGG CAATGTCGCG CCCGACGGCT GCATCGTGAA GACCGCGGGG
GTCGATGAAA GCATCCTCAA GTTCAGCGGG CCGGCGAAGG TGTTCGAGAG CCAGGACGCC
GCGGTTGCGG GCATATTGAC CGGGCAGGTC GAGGCGGGCG ACGTCGTCGT CATCCGCTAC
GAAGGGCCGA AGGGCGGGCC GGGGATGCAG GAAATGCTCT ATCCGACCAG CTATCTGAAA
TCGAAAGGGC TGGGCGCCGC CTGTGCGCTC GTCACCGATG GGCGCTTTTC GGGCGGCACG
TCGGGCCTGT CGATCGGCCA TGTCTCGCCC GAGGCGGCCG AAGGCGGGAC GATCGGGCTG
GTCGAGAATG GCGACCTCAT CAACATCGAC ATTCCGTCGC GGACGATCAC GCTCGCGGTG
GCCGACAGCG TGCTCGCCGA ACGCCGCGCC GCGATGGAGG CGAAGGGTGA CGCCGCCTGG
CAGCCCGCCA AGCCGCGCCC GCGCAAGGTT TCGGTGGCGC TTCAGGCCTA TGCCGCGATG
ACGACGAGCG CCGCGCGCGG CGCGGTGCGC GACCTGTCGC AGCTGAAGGG CAAGGGATGA
 
Protein sequence
MPSYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQMV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRDLI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAALRINIPV VFVSGGPMEA GKVVLKGKEV ALDLVDAMVA AADEKYSDEE
VLAIEQAACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHADRKE LFLRAGRIVV
EMCRRHYEEG DDSVLPRNIA TFEAFENAMS LDIAMGGSTN TVLHLLAAAH EAGVDFTMED
IDRLSRRVPC LSKVAPAKSD VHMEDVHRAG GIMAILGELD RAGLLHAHLP TVHSATLGDA
LNKWDIARTN DPEVQKFFMA APGGVPTQTA FSQARRWDSL DLDRISGVIR SADHAFSKDG
GLAVLSGNVA PDGCIVKTAG VDESILKFSG PAKVFESQDA AVAGILTGQV EAGDVVVIRY
EGPKGGPGMQ EMLYPTSYLK SKGLGAACAL VTDGRFSGGT SGLSIGHVSP EAAEGGTIGL
VENGDLINID IPSRTITLAV ADSVLAERRA AMEAKGDAAW QPAKPRPRKV SVALQAYAAM
TTSAARGAVR DLSQLKGKG