Gene Sros_3086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3086 
Symbol 
ID8666373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3368070 
End bp3369380 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID 
Productglutamate-1-semialdehyde-2,1-aminomutase 
Protein accessionYP_003338778 
Protein GI271964582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.156585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTCA TGACCCGCAA TCTGGACGTC AGCCGCGACC TCTACTCCAC CGCGCAGCAG 
TACCTGGCTG GAGGGGTCTC CAGCGACGCT CGCCGTACCA CCGGCGTCCC GCTCTACGTG
GACCGCGCGC AGGGCGCGCG TCTCTGGGAC GTGGACGGCA ACGCCTACGT CGACTACGTC
CTGGGCCAGG GGCCCGCGCT GCTCGGGCAC TGCCCGCCCG AGGTGGTCGA GGCGATCTCG
GACCAGGCCG CGCGGGGCAT GGTCTACTCG GCCCAGCACG CCGCGGAGGT CCGGGTGGCC
GAGCGGCTGT GCGCGATGGT GCCCTCGGCG GAACGGGTCC GTTTCAACAC CGTCGGCTCC
GAGGCGGTCC ACGCCGCCCT CCGGCTGGCG CGCGGCTTCA CCGGCCGTTC CAGGATCCTC
AAGTTCGAAG GGCACTACCA CGGCTGGCTG GACCCCGTCC TCTACAGCGT GCATCCCGCC
CTGGACCTGG CCGGCCCGGC CGACGCTCCC GTCGCGGTGC CCGGTACGGC GGGTCAGCAG
GAGGGCCACG GCGAGGACCT GATCGTCTGC CCGTGGAACG ACCTGGAGAC CCTGACCCGC
CTGATGGACC GCTACGGTGC CGAGATCGCC GCGGTGATCG CCGAACCGGT GCTCTGCAAC
ACCGGGGCGA TCCTGCCGGA CCCCGGATAC CTCGAAGCCG TACGGAAGCT CTGCGACCAG
CACGGCAGCC TGCTGATCTT CGATGAGATC ATCACCGGCT TCCGGCTGGC GCCCGGCGGA
GCGCAGGAGT ACCTCGGTAT CACCCCGGAC CTGTCGGTGT TCGGCAAGGC CATGGCCGGG
GGCATGCAGG TGTCGGCGCT CGTCGGAAAG GCGTCCGTCA TGGACCACAT CTCGACGGGG
AAGGTCGCCC ACGCGGGCAC GTTCAACTCC CAGCCCGTAG GAATCGCGGC GGCCGAGGCG
ACGCTGCGCG TTCTCGACGA GCGGCGTGAC GAGGTCTACG GCACGCTGTT CGCCCGCGGC
TCGGCGTTGA TGGAGGGCAT CAGGGCCGCG GCGGAGAAGG CGGGCGTGCC GCTGCTCGTG
GACGGTCCCG GGCCGGTCTT CCAGACCTAC TTCACCGACG CCGGCGCGGT ACGCGACTAT
CGCGACTTCG CCGCGACCGA CCGGGCGATG ATGGCCCGGC TGCACGCGGC GCTGCTCGAC
CGCGGCGTCA ACATGGTGCC GCGTGGCCTG TGGTTCCTGT CCACCGCGCA CACCGAGTCC
GACATCGACG CCACCGTCGA CGCCTTCGCC GGGGCGCTCC GGGCGCTCTG A
 
Protein sequence
MVVMTRNLDV SRDLYSTAQQ YLAGGVSSDA RRTTGVPLYV DRAQGARLWD VDGNAYVDYV 
LGQGPALLGH CPPEVVEAIS DQAARGMVYS AQHAAEVRVA ERLCAMVPSA ERVRFNTVGS
EAVHAALRLA RGFTGRSRIL KFEGHYHGWL DPVLYSVHPA LDLAGPADAP VAVPGTAGQQ
EGHGEDLIVC PWNDLETLTR LMDRYGAEIA AVIAEPVLCN TGAILPDPGY LEAVRKLCDQ
HGSLLIFDEI ITGFRLAPGG AQEYLGITPD LSVFGKAMAG GMQVSALVGK ASVMDHISTG
KVAHAGTFNS QPVGIAAAEA TLRVLDERRD EVYGTLFARG SALMEGIRAA AEKAGVPLLV
DGPGPVFQTY FTDAGAVRDY RDFAATDRAM MARLHAALLD RGVNMVPRGL WFLSTAHTES
DIDATVDAFA GALRAL