Gene Sare_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1914 
Symbol 
ID5708123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2209975 
End bp2211090 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID641271418 
Productalanine dehydrogenase 
Protein accessionYP_001536790 
Protein GI159037537 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0686] Alanine dehydrogenase 
TIGRFAM ID[TIGR00518] alanine dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.225329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00147074 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGGTCG GAATCCCACG CGAGATCAAA AATCACGAGT ACCGGGTGGC GATCACCCCG 
GCCGGCGTCA ACGAGTTCAT CCGCGCCGGC CACCAGGTGT TCGTCGAATC GGGTGCGGGC
GCCGGCTCCA GCACCAGTGA CAGTGACTTC GCCGCCGCCG GCGCCACGAT CCTCACCACC
GCCGACCAGG TGTGGGAAAC CGCCGAACTG GTGCTGAAGG TCAAGGAACC GGTCGCCGAG
GAATACCACC GGATGCAGGC GGGTCAGGTG CTCTTCACCT ACCTGCACCT GGCCGCCTCC
AAAGAGTGCA CCGACGCGCT GCTCGACCGT GGGGTCACCG CCATCGCGTA CGAGACAGTC
GAACTGCCCG ACCAGTCGTT GCCCCTGCTC GCCCCGATGT CCGAGGTGGC CGGTCGACTC
GCCCCACAAG TCGGCGCGTA CCACCTGCAG GCGCAGGGTG GTGGACGCGG CGTGCTGATG
GGCGGCGTCT CCGGCGTGTA CGCGGCCAAG ACCGTCGTCA TCGGCGCCGG CGTCTCCGGC
ATGAACGCCG CCGCCATCGC GCTCGGCCTC CAGGCCGAGG TGCTGCTGCT GGACAAGAAC
GTGGCCCGGC TGCGGCAGGC CGACGCGATC TACCGCGGTC ACCTACAGAC GGTCGCCTCC
AACGCGTACG AGATCGAGCG GGCCGTGGTC GACGCGGACC TGGTCATCGG CGCGGTACTG
GTGCCCGGCG CCAAGGCCCC GACCCTGATC TCCAACGACC TGGTCGCCCA GATGAAACCC
GGCAGCCTGC TGGTCGACAT CTCCATCGAC CAGGGAGGCT GCTTCGAGGA CTCGAATCCC
ACCACGCACG CCGACCCCAC GTACCCGGTG CACCAGTCCG TCTTCTACTG CGTGGCGAAC
ATGCCCGGCG CCGTGCCACA CACCAGTACC TACGCGTTGA CCAACGTGAC CCTGCCCTAC
GCCCTGGAGT TGGCCAACCA CGGCTGGCGG GAGGCGCTGC GCCGGGACCA CGCGCTCGCG
CTGGGCTTGA ACACCCACGA CGGCCACGTC GTCTACGGGC CGGTCGCCGA CGCGCACGGC
ATGGCCACCC TGCCGCTGGC CGAGGTGCTG ACCTGA
 
Protein sequence
MKVGIPREIK NHEYRVAITP AGVNEFIRAG HQVFVESGAG AGSSTSDSDF AAAGATILTT 
ADQVWETAEL VLKVKEPVAE EYHRMQAGQV LFTYLHLAAS KECTDALLDR GVTAIAYETV
ELPDQSLPLL APMSEVAGRL APQVGAYHLQ AQGGGRGVLM GGVSGVYAAK TVVIGAGVSG
MNAAAIALGL QAEVLLLDKN VARLRQADAI YRGHLQTVAS NAYEIERAVV DADLVIGAVL
VPGAKAPTLI SNDLVAQMKP GSLLVDISID QGGCFEDSNP TTHADPTYPV HQSVFYCVAN
MPGAVPHTST YALTNVTLPY ALELANHGWR EALRRDHALA LGLNTHDGHV VYGPVADAHG
MATLPLAEVL T