Gene Sare_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2081 
Symbol 
ID5706801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2393198 
End bp2394313 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content72% 
IMG OID641271567 
Productsaccharopine dehydrogenase 
Protein accessionYP_001536938 
Protein GI159037685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.484567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA TCACGACGAT CGGCGTTCTC GGCGGCTATG GAGCCGTTGG TAGCGCGGTC 
GTGCGCCGTC TGCACCAGAG CACGGCCGCC TTACTCCTCG TTGCCGGACG AGACCTCGGC
CGGGCTGAGA AACTCGTCCG CTCCTTGGAT GCGAACGGCG CCCTTGCCGA GCCGGTGGCG
GCCGACCTTG CTGATCCGGC CGCGCTCGAC CGACTCGCGG CCCGGTGTGA CCTCCTCGTC
AACTGCGCCG GTCCGTCGTA CCGGGTTCTT GACACGGTCG CCCGAGCGGC GCTGCGCAAC
GGCGCCGACT ACGTCGACGC CGCCGGCGAC GATCCCACCT TCCTGCGTCT GACCACCGAC
GGCGGCGCGC GCGAGTGGCA GGCAGCCGGG CGGGTCGCCC TGCTGTCGGC CGGGGCTCTC
CCAGGGCTGT CCGGGCTCCT GCCGCGCCAC CTGGCCACCA CCGTCGGACG GGCCAGCCGG
CTCGACGCCT ACCTTGGCGG AGTGGCACCG CTGTCCCCGG CAGCGGCCGG GGACGTACTG
CTCAGTCGCG GGCCCGAACA CGGCACACCC GGGGCCGGCT GGCGGGACGG CGTCGTCCGC
GAACGCAGCC TCGAGCCCCG CCGCCGGCTG TCGCTTGCCG CGTTCCCCCG ACCAGTGGAT
GCCTTTCCCT TCTTGGCCAC CGAGGCCGTC CGACTCGCTC GCGCGCTCCA GATCGGCGAG
GTCAACTGGT ACACCGCCTT CGGCGGCGGC CGGCTTCCCG AGCAACTGGC ACTATCCTGG
GCCCTCGACG ACACGGACAC ATCCGAGGTC GTCAACGCGG CGGCCGAGGA CGTACGGCGC
CACGGCACGT GGTATGGCCA GGAGTTTCAC CTCTGGGACG GGAACGCCGC CGAGACGCCG
CCGCGGGTCC TGTCGCTGGC CTGCGAGGAC TCCTACGAAC TCAGCGGATT CATGGCCGCC
GCGGCGGCGA CCGCCGTCCT CGTGGGCGAG ACGCCCGCAG GAGTGCACTT CGCGGCCGAC
GTGCTCATAC CCACCGAGAT CTTCCAGGCG CTCGCCGTGG ATCCAGCAGC GACGATCAGC
CTTGACGGAC CGGGCGTGCC GGCTCTGCCA CAGTGA
 
Protein sequence
MSRITTIGVL GGYGAVGSAV VRRLHQSTAA LLLVAGRDLG RAEKLVRSLD ANGALAEPVA 
ADLADPAALD RLAARCDLLV NCAGPSYRVL DTVARAALRN GADYVDAAGD DPTFLRLTTD
GGAREWQAAG RVALLSAGAL PGLSGLLPRH LATTVGRASR LDAYLGGVAP LSPAAAGDVL
LSRGPEHGTP GAGWRDGVVR ERSLEPRRRL SLAAFPRPVD AFPFLATEAV RLARALQIGE
VNWYTAFGGG RLPEQLALSW ALDDTDTSEV VNAAAEDVRR HGTWYGQEFH LWDGNAAETP
PRVLSLACED SYELSGFMAA AAATAVLVGE TPAGVHFAAD VLIPTEIFQA LAVDPAATIS
LDGPGVPALP Q