Gene Sros_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3951 
Symbol 
ID8667241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4397733 
End bp4398833 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339604 
Protein GI271965408 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0100165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0185065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGGC TGGACGAGCT GCGCGGACCG GTACCGTCCC GCGCGCACGA CGCCCGCGCC 
ATCGCGGCGC TGGCCGCCAA CCCCGGCTGC GCCCGCCGGG CGCTGATGGA CGCCGCTGGG
GTGGACAAGG ACCGTACGGC CCGGCACCTC GGCTTCCCGG CCCCCTTCGG CCAGTCGCAG
TTCGCGATCA CGCGCGGCAA CGTGTTCGAG GCGCTGGTGA AGGAGAACGG CTGCGCCGAG
CTGCTGCGGC TCCTGCGCGA GCTGCTCGGC CTGCCCGTGG CCCAGGTCGG CTACCAGGAC
GTGGAGAGCG TCGGCTCCCA CCTGCGCCAC TCCCACACCC GGACGCTGAT CGACCGGGCG
GCGCGCGAGA ACGACGACGC GGCGGTCTTC TACGACCACC CGCTGTTCAG CCTGGAGATC
GCCGGGCACA CCTCCTACCT GGAGCCCGAC GTGGTGGCCT TCCAGCTCGG GGGGCGCTTC
CGCATCGTGG AGATCAAGTC GTTCGGCGTG ATCGACGGCC AGGCCGAGCC CGAGAAGGTC
GCCGCCGCGG CCAGGCAGGC CGCAGTCTAC GTCCTGGCGC TGCGCACGCT CCTGGCCGAC
CTCGGGCACG ACCCCGAGCG CGTCTCCCAC GACGTGGTGC TCGTCTGCCC GGAGAACTTC
GCCAACCGGC CGACCGCGAC GCTGGTGGAC GTGCGCAAGC AGCTCGCCGT GCTCAAGCGG
CAGCTCGCCC GGATGACCCG GGTGGACCGC CTGCTCGAAG GGCTCCCCCA GGGGCTCACC
TTCGACCTGG CCCCCGACGC GGACGGCGTG CCCACCCGGT CGGCGGAGGA GCTGGCCGGC
GCGCTGTGCC AGGTGCCCGC CCGCTACGCG CCCGACTGCC TGTCCACCTG CGACATGTGC
ATGTTCTGCC GTGACGAGGC CCGTGGCTGC GGCTCCACCG ACCTGCTGGG CCGCCAGGTC
CGCGACCAGC TCGGCGGCGT CTCCCTGATG ACCGAGGCCC TCGGCCTGGC CGAGGGCACC
GTCGAACCCG CCGAGGGCCA GGAGGAGGTC GCCCGCCTGC TCCGCCTGGC GGACCGCCTG
CGCGAGGAGT GCCTGAATTG A
 
Protein sequence
MNRLDELRGP VPSRAHDARA IAALAANPGC ARRALMDAAG VDKDRTARHL GFPAPFGQSQ 
FAITRGNVFE ALVKENGCAE LLRLLRELLG LPVAQVGYQD VESVGSHLRH SHTRTLIDRA
ARENDDAAVF YDHPLFSLEI AGHTSYLEPD VVAFQLGGRF RIVEIKSFGV IDGQAEPEKV
AAAARQAAVY VLALRTLLAD LGHDPERVSH DVVLVCPENF ANRPTATLVD VRKQLAVLKR
QLARMTRVDR LLEGLPQGLT FDLAPDADGV PTRSAEELAG ALCQVPARYA PDCLSTCDMC
MFCRDEARGC GSTDLLGRQV RDQLGGVSLM TEALGLAEGT VEPAEGQEEV ARLLRLADRL
REECLN