Gene Sros_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3887 
Symbol 
ID8667177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4331982 
End bp4333142 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content75% 
IMG OID 
ProductSaccharopine dehydrogenase (NAD(+), L-glutamate- forming) 
Protein accessionYP_003339547 
Protein GI271965351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.011216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACG ACCGCCCGTA CGACATCGTG CTCTTCGGCG CCACCGGGTT CACCGGAGCG 
CTGACCGCGC AGTACCTCGC GCGCAACGCC AGTCCTGGCT GCCGCTGGGC ACTGGCCGGT
CGCAGCCGGA CCAAGCTCGA AGCGGTGAGG GAGCGCATCG GCCTGCCCGA GCTGCCGCTG
CTCCACGCCG ACGCGACGGA TCCGGCCTCC CTGGCGCGGA TCGCCGGGCA GGCCAGGGTC
GTCGCCACCA CCGTGGGCCC CTACGTCGCC TACGGCGAGC CGCTCGTGGC CGCCTGCGCG
GCCGCGGGCA CCCACTACGC CGACATCACC GGCGAGCCGG AGTTCGTCGA CCTCATGTTC
GCCCGGCACC ACGAGAGGGC CAGGCGGAGC GGGGCGAAGA TCGTGCACGC CTGCGGGTTC
GACTCCATCC CGCACGACCT CGGCGCCTAC TTCACCGTCA ACCGGCTCCC CGAGGGGGTG
CCGATCGAGG TGAGCGGGTT CCTCCGGGGG AACGGCCGGC CCTCGGGCGG CACCGTCCAC
TCCGCCCTCG CGGCGGTCTC CCGGGCCCGG CAGACCGCTC GGGCCGCGCT CGCCCGGCGC
GAGGTCGAGG AGCGCCCCCA AGGCCGGCGG GCGCGTGGCA CCGCCGGACC GCCCCGGTAT
GTCGGAGGCT GGGCCCTGCC GCTGCCCACG ATCGACCCGC AGATCGTGGC GCGCTCGGCC
CGCGCGCTGG AGCGCTACGG CCCCGACTTC ACCTACCGCC ACCACATCGC CGTCAGGCGG
CTGCCCGCCG CGCTGGGGCT CGTGGCGGGC GCGGGCGCCC TCGTCGCGCT CGCCCAGATC
CCCCCGGTCC GCTCCTGGCT GCTCGGCCGG ATCTCGCCCG GTGACGGACC CACCCCCGAG
CAGCGGGCCG GGAGCTGGTT CAAGGTCACC TTCCTCGGCC TGGGTGGCGG CGAGCGCGTC
GTCACCGAGG TCGCGGGCGG CGACCCCGGC TACGACGAGA CCGCCAAGAT GCTCGCCGAG
TCGGCGCTCT GCCTCGCCCT CGACGACCTG CCGCCGGTCT CCGGCCAGGT CACCACGGCC
GTGGCGATGG GAGACGCGCT GATCGAGCGG CTCCGGCGGG CGGGCATCAC CTTCACCGTG
CTGAGCGGCC CGCCGAAGTA G
 
Protein sequence
MSDDRPYDIV LFGATGFTGA LTAQYLARNA SPGCRWALAG RSRTKLEAVR ERIGLPELPL 
LHADATDPAS LARIAGQARV VATTVGPYVA YGEPLVAACA AAGTHYADIT GEPEFVDLMF
ARHHERARRS GAKIVHACGF DSIPHDLGAY FTVNRLPEGV PIEVSGFLRG NGRPSGGTVH
SALAAVSRAR QTARAALARR EVEERPQGRR ARGTAGPPRY VGGWALPLPT IDPQIVARSA
RALERYGPDF TYRHHIAVRR LPAALGLVAG AGALVALAQI PPVRSWLLGR ISPGDGPTPE
QRAGSWFKVT FLGLGGGERV VTEVAGGDPG YDETAKMLAE SALCLALDDL PPVSGQVTTA
VAMGDALIER LRRAGITFTV LSGPPK