Gene Sala_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1842 
Symbol 
ID4082020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1935505 
End bp1936755 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID638010217 
Productthreonine dehydratase 
Protein accessionYP_616887 
Protein GI103487326 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.53937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC ACCTTACCCT GACCTCCGCG GCCGATTTTC CGGCGATCAC ATTGGACGAT 
GTGCGCGCGG CGGCGGGGCG AATTAACGGC GCGGTGGTGC GCACGCCGAC GCTCCATTCG
CAGACGCTGT CCGAGCTGGT CGGCGCCGAG GTGTGGCTGA AGTTCGAGAA TCTGCAATTC
ACCGCGGCCT ACAAGGAACG CGGGGCATTG AACGCGCTGC TGCTGCTCAG CGAAGAAGCC
AGGGCGCGCG GCGTGATCGC GGCGTCGGCG GGCAATCATG CGCAGGGGCT GGCCTATCAC
GGCAAAAGGC TGGGCGTGCC CGTCACCATC GTGATGCCCA GCACGACGCC GCAGGTGAAG
GTGTCGCAGA CCGCGGGCCA CGGCGCGACG ATCGTGCTGC ACGGCGAAAA GTTCGACGAC
GCCTATGCCC ATGCGCGCGA GCTGGAAGAG GAACGCGGGC TTACCTTTGT CCACCCCTTC
GACCATCCGC ACGTCGCGGC GGGGCAGGGG ACGGTGGCGC TCGAAATGCT TGAGGATGTG
CCTGAACTCG ACACGCTGAT CGTGCCGATC GGCGGCGGCG GGCTGCTCGC GGGCATGGGC
ACCGCGGCGC GCGGGATCAA GAACGACATG CGCCTCGTCG GCGTGCAGGC CGAGCTTTAT
CCCTCGATGT ACGCCGAGCT GAACGGCGTC GACATGGCGT GCGAGGGCGA TACGCTGGCC
GAGGGCATCG CGGTCAAGGA ACCGGGGAGC TATACGCGCA AGCTCGTCGC CGAGCTCAAC
GACGACATCG TGCTCGTCGC CGAACGGCAT CTGGAGCGCG CGGTGAGCCT GTTGCTCCAG
ATCGAAAAAA CGGTCGTCGA GGGCGCGGGC GCCGCGGGCC TCGCGGCGAT GCTCGCGCAC
CCCGACGAGT TCGCCGGGCG CAAGGTCGGA CTGGTGCTCA CCGGCGGCAA TATCGACACG
CGCCTCCTCG CGAACGTGCT GCTGCGCGAC CTTGCGCGAT CGGGCCGCAT CGCGCGGCTG
CGCATCCGCT TGCAGGACCG CCCCGGCGCG CTGTTCAAGG TGATGAAGCT GTTCGACGAG
AAACAGGTCA ACATCATCGA AATCTATCAC CAGCGCATCT TCACGACGCT CCCCGCGAAA
GGGCTGATCA CCGACATCGA ATGCGAAGCG CGCGACCGCG AGCATCTCGA CAGCCTCGTC
ACCGCTCTGC GCGACGCGGG CTATATGGTG ACGACGGTCG AGCTGGCTTA G
 
Protein sequence
MTDHLTLTSA ADFPAITLDD VRAAAGRING AVVRTPTLHS QTLSELVGAE VWLKFENLQF 
TAAYKERGAL NALLLLSEEA RARGVIAASA GNHAQGLAYH GKRLGVPVTI VMPSTTPQVK
VSQTAGHGAT IVLHGEKFDD AYAHARELEE ERGLTFVHPF DHPHVAAGQG TVALEMLEDV
PELDTLIVPI GGGGLLAGMG TAARGIKNDM RLVGVQAELY PSMYAELNGV DMACEGDTLA
EGIAVKEPGS YTRKLVAELN DDIVLVAERH LERAVSLLLQ IEKTVVEGAG AAGLAAMLAH
PDEFAGRKVG LVLTGGNIDT RLLANVLLRD LARSGRIARL RIRLQDRPGA LFKVMKLFDE
KQVNIIEIYH QRIFTTLPAK GLITDIECEA RDREHLDSLV TALRDAGYMV TTVELA