Gene Sros_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1683 
Symbol 
ID8664960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1797479 
End bp1798963 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative aldehyde dehydrogenase 
Protein accessionYP_003337417 
Protein GI271963221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGG CGACCGGCCG GCTCACCTTC GACTCCCTCG ATCCGGCCAC GGGCGCCGTC 
GTCGGCACCC ACCCGATCCA GGACGCGGCG GCCGTGCGCG CGGCCGTGGC GGAGGCCGGG
GCTGCCGCCG TCCGGTGGGC CGGGCTGGGG TGGGTCGAGC GCAGGCGGCG GCTGCTCGAC
TACAAGGCCG TCATCACCCG GAACATCGCC GGGATCGTCG CCACCGTCCA CGACGAGACC
GGCAAGCCTG AGGCCGACGC CACCCTGGAG GTCGTCCTGA CCATCACCCA TCTCGACTGG
GCCGCGCGGA ACGCGCACCG GGTGCTCGGG CCGAGAAGGG TCTCCCCCGG GATGATGGGC
GCCAACATCT CGGCCACGCT GGAGTACCGG CCGCTCGGGG TGGTCGGGGT GATCGGACCG
TGGAACTACC CGGTGTTCAC CCCGATGGGG TCGATCGCCT ACGCGCTGGC GGCGGGGAAC
GCGGTGGTGT TCAAGCCCAG CGAGCTGACC CCCGGCGTGG GCGTGCTGCT GGCGGAGCTG
TTCAGCGAGG CCGTTCCCGA GCACGGGGTG TTCCGGACGG TCACCGGGCT GGGCGAGACG
GGGGCCGCGC TGGCCGGTGA CCCGGGGGTG GGGAAGATCG CCTTCACCGG CTCGACCGCC
ACCGCCAAGC GGGTGATGGC CGCGTGCGCG GCGAACCTCA CGCCGCTGGT CGCCGAGTGC
GGCGGCAAGG ACGCGCTGAT CGTGGACGAG GGTGCCGACC TGCCGGCCGC GGCGGACGCG
GCCCTGTGGG GCGCGCTGTC CAACGCGGGC CAGACCTGCG TCGGCGTGGA GCGCGTCTAC
GTGGTGGACG CGGTCTACGA CGGCTTCATG GGGGAGCTGA CCCGGCGGGC CCGGAAGGTC
AGGGCCGGGG AGGACTACGG CCCGATCACC ATGCCGGCCC AGCTCGACGT CATCCGGCGG
CACATCCAGG ACGCGGTCGC GGGCGGCAGG GCCGTCCTCG GCGGCCCCGA GTCGGTTCGC
GCGCCCTTCG TGGACCCGGT GATCGTGGAG GACGTGCCGG AGGAGTCCGT GTCGGTCCGC
GAGGAGACGT TCGGCCCGAC CCTGACGGTC AAACGGGTCG CCGACGTGGA GGAGGCACTG
GAGAAGGCCA ACGCCTCCGC GTACGGCCTG GCCGGGACGG TCTTCTCCGG CAACGCGCGG
CGGGCCGTGG AGCTCGCGCG CCGGATGCGC GGCGGGATGA CCGCGATCAA TTCGATCATC
TCCTTCGCCG CGGTCCCCTC GCTGCCGTTC GGCGGGGTGG GCGACTCCGG CTTCGGCCGC
ATCCACGGCG CGGACGGGCT GCGGGAGTTC TCCCGCCCCA AGGCCGTCTC GCGGCAGCGC
TTCGCGCTGC CCGGAATGAA CCTCACCTCG TTCACTCGCG GCCAGAAGGA ACTTGACCGA
CTAGTCAGGC TTGTCACCTT CTTGCATGGA CGACGGAAGC GATAG
 
Protein sequence
MTTATGRLTF DSLDPATGAV VGTHPIQDAA AVRAAVAEAG AAAVRWAGLG WVERRRRLLD 
YKAVITRNIA GIVATVHDET GKPEADATLE VVLTITHLDW AARNAHRVLG PRRVSPGMMG
ANISATLEYR PLGVVGVIGP WNYPVFTPMG SIAYALAAGN AVVFKPSELT PGVGVLLAEL
FSEAVPEHGV FRTVTGLGET GAALAGDPGV GKIAFTGSTA TAKRVMAACA ANLTPLVAEC
GGKDALIVDE GADLPAAADA ALWGALSNAG QTCVGVERVY VVDAVYDGFM GELTRRARKV
RAGEDYGPIT MPAQLDVIRR HIQDAVAGGR AVLGGPESVR APFVDPVIVE DVPEESVSVR
EETFGPTLTV KRVADVEEAL EKANASAYGL AGTVFSGNAR RAVELARRMR GGMTAINSII
SFAAVPSLPF GGVGDSGFGR IHGADGLREF SRPKAVSRQR FALPGMNLTS FTRGQKELDR
LVRLVTFLHG RRKR