Gene Sala_1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1521 
Symbol 
ID4080034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1586256 
End bp1587305 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content67% 
IMG OID638009888 
ProductGlu/Leu/Phe/Val dehydrogenase, dimerisation region 
Protein accessionYP_616567 
Protein GI103487006 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0334] Glutamate dehydrogenase/leucine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.448998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTG TCTGGGATTT CGCCGATTTC GACGATCACG AACATGTCCA CATGTTCCGC 
GATCGCGCCA GCGGGCTGAC CGCGGTTATC GCCGTCCACT CGACATACCT TGGCCCCGGC
GCGGGCGGCG TGCGATACTG GCATTATCCG CAGCGCGCCG CCGCAATCAC CGACGCGCTG
CGCCTGTCGC GCGGGATGAG CTATAAAAAT GCGATGGCGG GGCTGCCGAT GGGCGGCGGC
AAGGGCGTGA TCCTGGCCGA CGAAGGACAG GAAAAGAGAC CCGAACTGCT CGCCGCATTC
GGGCGCGCTG TCGATTCGCT CGGCGGCGCC TATGTGACCG CCGAGGATGT CGGCATCACC
GACGCCGACA TGGTCGAGAT TTCCAGGCAG ACGAAGCATG TCAGCGGGCT GCCCGTGGCG
AGCGGCGAGG CCGGGGGCGA TCCGGGGCCG TTCACCGCGC TCGGCGTCTA CCTCGGCATC
AAGGCGGCGA TTCGCGAAGG GTTGGGCACC GACAGCGCGA AAGATGTGCG CGTCGCGATC
CAGGGTGTCG GCAGCGTCGG CGGTGGCGTC GCGCGGCGGC TCGCGGCCGA GGGCGCGAAG
CTGACCCTCG CCGACGTCAA CCTGGCGCGC GCGAAGGCGC TCGCCGAGGA ACTGGGCGCC
GAACTCGCCG ATTCGGCGGC GATCATGGAG ATCGAGGCCG ATGTGCTCAG CCCCAATGCG
CTGGGTGCGA TTCTGACCGA ACGGAGCATC GAGAAGTTGA AGGTGCCGAT CGTCGCGGGC
GGCGCAAACA ATCAGCTCGC AACCGCTCTC GACGGCCAGC GCATCCACGA CCGCGGCATC
GTCTATGCCC CCGACTATGT CATCAACGCC GGCGGGATCA TCAATGTCGC GCTGGAGTAT
CTTGGACAGG GAAGCCAGGA CGAGGTCGAA AGCCGTATCC GGCTGATCCC CGGCCGGCTC
GCCGAAATCT GGGCCGAGAG CAAGGCGAGC GGTACCCCCG CCTCGGTCGT CGCCGACCGT
ATGGCGCAAA AACTGATCGG GCGCGGGTGA
 
Protein sequence
MSAVWDFADF DDHEHVHMFR DRASGLTAVI AVHSTYLGPG AGGVRYWHYP QRAAAITDAL 
RLSRGMSYKN AMAGLPMGGG KGVILADEGQ EKRPELLAAF GRAVDSLGGA YVTAEDVGIT
DADMVEISRQ TKHVSGLPVA SGEAGGDPGP FTALGVYLGI KAAIREGLGT DSAKDVRVAI
QGVGSVGGGV ARRLAAEGAK LTLADVNLAR AKALAEELGA ELADSAAIME IEADVLSPNA
LGAILTERSI EKLKVPIVAG GANNQLATAL DGQRIHDRGI VYAPDYVINA GGIINVALEY
LGQGSQDEVE SRIRLIPGRL AEIWAESKAS GTPASVVADR MAQKLIGRG