Gene Sala_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1201 
Symbol 
ID4080696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1241820 
End bp1243028 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content60% 
IMG OID638009562 
Productsaccharopine dehydrogenase 
Protein accessionYP_616250 
Protein GI103486689 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.150944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG TTCTGGTGAT CGGCGCAGGC GGCGTCGGTT CGGTCGCGGT GCACAAGATG 
GCGATGAACT CCGACATCTT TCCCGACATC ACCCTCGCCA GCCGCCGCAA GTTCAAGTGC
GACGCGATTG CCGGGTCGGT GAAGGCGCGT ACCGGCGTCA CGATCAAGAC CGCCGAGGTC
GACGCCGACC ATATCGACGC GACCGCGGCG CTGATCCGTC AGATTGGCGC CACGCACGTC
GTCAATCTTG CGCTGCCTTA TCAGGATCTG ACGATAATGG AGGCGTGCCT TTCGACCGGC
GCGCATTATC TCGACACCGC AAATTACGAA CCGCGCGACG AGGCGAAGTT CGAATATCAC
TGGCAATGGG CCTATCACGA CCGCTTCAAG GACGCGGGCC TGATGGCGCT GCTCGGCTCG
GGCTTCGACC CCGGCGTGAC GAGCGTGTTC ACGACCTGGC TTCGCAAGCA TCATTTCGAC
CGCATCGACA CGCTCGACAT CCTCGACTGC AACGGCGGCG ATCACGGCCA GCATTTCGCG
ACCAACTTCA ACCCCGAAAT CAACATTCGT GAAGTCACCG CGGTCGCGCG CCACTGGGAA
AATGGCGACT GGGTCGAAAC GCCCCCGATG TCGGTGAAGC AGCAGTTCCA TTTCGAAGGC
GTGGGGCCGA AGAATATGTA CCTCATGTAT CATGAGGAGA TCGAAAGCCT GAAAACGCAT
TTGCCCGAAA TCAAGCGCAT CCGTTTCTGG ATGACCTTTG GCGACGCTTA TATCCAGCAC
CTTACCGTGC TCCAGAATGT CGGCATGACG CGGATCGATC CGGTGGTCTA CGAGGGCAAG
GAGATCGTTC CGCTCCAGTT CCTCAAAGCC GTGCTCCCCG AACCGGCGAG CCTTGGCGGG
ACGACGAAAG GCAAGACCAA TATCGGCGTC ATCGCGACCG GCCTTGGCAA GGATGGCAAG
GAAAAGACGC TCTACCTCTA CAATATCTGC GACCATGAGG ATGCCTATGC AGAAACGGGC
AATCAGGCGG TCAGCTACAC CACCGGCGTT CCCGCGATGA TCGGCGCCGC AATGATGGTC
ACCGGTACGT GGGGCGGCGC GGGCGTCTTC AACATGGAAC AGATGGACCC CGATCCCTTC
ATGGACATGC TGATGAAACA TGGTCTGCCG TGGCAGGTGA AGGAACTGGA CGCGCCGCTC
GATTTCTGA
 
Protein sequence
MSKVLVIGAG GVGSVAVHKM AMNSDIFPDI TLASRRKFKC DAIAGSVKAR TGVTIKTAEV 
DADHIDATAA LIRQIGATHV VNLALPYQDL TIMEACLSTG AHYLDTANYE PRDEAKFEYH
WQWAYHDRFK DAGLMALLGS GFDPGVTSVF TTWLRKHHFD RIDTLDILDC NGGDHGQHFA
TNFNPEINIR EVTAVARHWE NGDWVETPPM SVKQQFHFEG VGPKNMYLMY HEEIESLKTH
LPEIKRIRFW MTFGDAYIQH LTVLQNVGMT RIDPVVYEGK EIVPLQFLKA VLPEPASLGG
TTKGKTNIGV IATGLGKDGK EKTLYLYNIC DHEDAYAETG NQAVSYTTGV PAMIGAAMMV
TGTWGGAGVF NMEQMDPDPF MDMLMKHGLP WQVKELDAPL DF