Gene Sala_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1398 
Symbol 
ID4081754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1453095 
End bp1454492 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content67% 
IMG OID638009764 
Productthreonine synthase 
Protein accessionYP_616445 
Protein GI103486884 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTATA TCAGCACCCG CGGCTCAGCG CCGACCCTCG ACTTTCGTGC CGCCACGCTC 
GCGGGCCTTG CCAGCGACGG CGGGCTCTAT GTGCCGACCA AATGGCCGCG GATGTCAGCA
GATGAAATCC GCGCGCTCGC AGGCCTCGAT TATGTCGAAA CCGCGGTTCG CATCATGCGG
CCCTTCGTCG AAAGCGTTTT GACCGAGGAC GAACTGCGCG CGCTGTGCCG GGCCGCCTAT
GGCCGGTTCA GCCACGATGC GGTGACGCCG CTGGTCCAGC TCGATCATCG CCACTGGCTG
CTCGAACTCT TCCATGGGCC GACGCTGGCG TTCAAGGACG TCGCGCTGCA ATTGCTCGGC
GAATTGTTCG AGACGTTCCT GAGTGGCGGC GGCACCGACA TCACGATCGT CGGCGCGACC
TCCGGCGACA CTGGATCGGC GGCGATCGAG GCCGTGGCGG GGCGTGAACA TATCCAGATA
TTCATGCTGC ATCCGGAGGG CCGCGTCAGC GACGTGCAGC GGCGCCAGAT GACGACGGTG
CTCGCGCCCA ACGTCCACAA TATCGCGATC GACGGCAGCT TCGACGATGC GCAGGCGATG
GTGAAGCGGC TGTTCGGCGA CGAAGCGGCG CGGGGCCAGG TCAATCTGTC GGCGGTGAAC
AGCATCAACT GGGCGCGGCT GATGGCGCAG ATCGTCTATT ATTTCTATGC CGCCGTCCGC
CTCGGCGGAC CCGACCGCCC GGTGGCGTTC AGCGTACCGA CGGGCAATTT CGGCGATGTG
TTCGCGGGCT ATGTCGCGGC GCAGATGGGC TTGCCGATCG CAAGGCTCGT CGTCGCCACG
AACGTCAACG ACATCCTCCA CCGCGCGCTG ACCAGCGGCG ATTACAGCGC GGGGACGGTG
ACGCCCACCG CGACCCCCAG CATGGACATT CAGGTCAGCA GCAATTTCGA ACGGCTGCTC
TTCGATCTCG CGGGCCGCGA CGGCGCGGCC ATTGCGGGGA TGATGGCCGA GTTCGAGGCG
AAGCGCGCGA TGACCATCCC CGCCGACATG CTGGCCGGCG CGCGCGACCT GTTTTCGAGC
GCGCGCATCG ACGGCGACGC GATGGCGCTC GCGCTGCGCT GGGCGCGGGA ACGCGGCGGA
CAGATCATCG ACCCGCACAG CGCGGTGGGC CTCGCCGCGG CGCGCGCGCT GGAGATCGAC
GCCGAGATTC CGGTCGTCAC GCTCGCGACG GCGCATCCGG CCAAGTTCCG CGAGGCGGTC
GAGCGCGCGA CCGGGGTACG CCCACCACTG CCCGCGCGGC TGGGCAATCT GTTCGAGCGC
GAGGAACGCT ATACGAAGCT TCCCGGCGAC TATGATGTCG TCAAGGCCTT CATCCTCGCG
GAAGCCGCGC GTGGCTGA
 
Protein sequence
MDYISTRGSA PTLDFRAATL AGLASDGGLY VPTKWPRMSA DEIRALAGLD YVETAVRIMR 
PFVESVLTED ELRALCRAAY GRFSHDAVTP LVQLDHRHWL LELFHGPTLA FKDVALQLLG
ELFETFLSGG GTDITIVGAT SGDTGSAAIE AVAGREHIQI FMLHPEGRVS DVQRRQMTTV
LAPNVHNIAI DGSFDDAQAM VKRLFGDEAA RGQVNLSAVN SINWARLMAQ IVYYFYAAVR
LGGPDRPVAF SVPTGNFGDV FAGYVAAQMG LPIARLVVAT NVNDILHRAL TSGDYSAGTV
TPTATPSMDI QVSSNFERLL FDLAGRDGAA IAGMMAEFEA KRAMTIPADM LAGARDLFSS
ARIDGDAMAL ALRWARERGG QIIDPHSAVG LAAARALEID AEIPVVTLAT AHPAKFREAV
ERATGVRPPL PARLGNLFER EERYTKLPGD YDVVKAFILA EAARG