Gene Sros_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3551 
Symbol 
ID8666839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3935486 
End bp3937498 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339229 
Protein GI271965033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0236032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAAT TCACCGTGGA CGAGGTGATC GCGGCCGCCT CCGTCTCAGC GGGCTGGCTC 
CTCGATGCGG TCGCCCGGGA ACGCGAGCTG GAGAGCCGTT CGAATCCGGG GGCCCGCCTG
GCCGGGTGGG AGGAGGGGCA CGGCGGGAGG ACGAGCCGTC AGGCCAGGCG GGTCCGGGCG
GACATGGTGA GGCGGCTGTC CCGGGAGTCC GGCGCCGCGC GGCGGGCCGC GGCGATCTGC
CTCGCGCGCC CCTCGCTGAG CGGTTACTCC GAGGAGCTCG ACCTCGCCCT GTCCGGTTCG
AAGGGGTGGA CCGCGGTGGA GCTGGCCGCG CTGCTCACCG TCGTGCGGTC ACGGGAACTC
TGCTACTGGG ACGACGTGTG GCTGTCGCGG GTGACGGAGC TCGCGCTGGG GTTCGACGCC
GGGGAACAGG CGGTGCTGCG GGAGCCGATC ACGTCGCTGA TGTGGGCCGT GGCCGACAGC
GGCATCGAGG TCGTCAGGCG ACGTGAGCTG GAGCGTGCGA TCGGGGAGGT CCTGCGGAGG
GACCCCGGCG AGCTGTCGCC GTCGGTGCTC ATCCCGCTGG ACGGGTGGGC GGTGATCCTG
CGCGGCCACC TCGGCGACAT GCCGCCGCCG CACCTGGTGC GGCTCGTCGA CCACCTCTGT
GAGGTCCGCG GCCCGCGGCC CGCGAAGAAG TGGCGCGCGC GATGTTTGGA ACTGCTCCGG
CCCGCCGACG CCGGGGAGCT CGTGCGGGTG GCGCTCGCCG CCTTCGACCA CGTCTCCGGA
GGGGGCGATC CCAGCGGAGG AGACCTGCGG CCGATCGTCC TGGACTCCAA CGTGGACGCG
GCCCGGGGAT TCGTCTGGGC CGCCGTCCTG CTGCGGACCG GCGGCCTCGT CCCGGCGCTG
ACGGAGCTCG CCCTGCGGGC GGGCGGGGTC CGTCCGGGGG TGAGGGAGGA CCTGAAGCTG
GCGGGGGCCG CGATCAACGC GCTTGGTGAC TGCGAGGGCC CGGACGCCAT GACGGCCCTG
TGGCGGCTGC AGCGGTCGAT CCGGCATCGG GCACTGCGCA GGCAGGTCGG CACCGCGCTC
GACGCCGCCG CCGGCCGGCA GGGGATCACT CCCGGGCAGC TGCTCGAACG CGGCGTCCCC
GATCACGGGC TCGCCCCCGA CGGGACGCTC ACCCGCACGA TCGGCGACTG GACCGCCGTG
CTCGCCGTCG AGGACGCGAT GACCGTACGG CTCGGCTTCC GGGCTCCGGA CGGCACCACG
GTCCGCGCCG TGCCCGCCGA GCTCGGGGAG AGCGGCGACC TCCGGGCGCT CAAGGCCGTG
CGGAAGGAGA TCCGCCGGAC GCTGTCCGCC GAGCGCGCGC GACTGGAGGG GCTGCTCACC
GCCGACCGGA CGTGGACGTA CGAGGAATGG GCCCGCCACT ACCGGGACCA CCCGATCACC
GGCGCGGTCA CGCGCGCGCT GATCTGGGAG GCCGGAGGGG AGGGCCACCT GTCCGGCGGG
GCCGTACCGG ATGCCACGCT CCGGCTGTGG CATCCCGCGC GTGCCCGCCC GGCGGAGGTG
ACGGCCTGGC GGGAGGAGGT GACGGAGCGG CGGCTGCGCC AGCCGTTCAA GCAGGCCTTC
CGCGAGGTCT ACCTGATCAC TCCCGCCGAG GAGGAGACCC GGGTCCACTC CGACCGGTTC
GCCGCCCGCA TTGTCGACGA TCCCCGGCTG TACGCGCTGC TCAAGGAGCG CGGCTGGCGG
ACGGGCCTGC TGGGGTCCTT CGGCGGCGGC CACGGCGCCG AGGCGGCGAA GGAGCTGGCC
GAGGGCGCCT GGCGGGTCCG GTTCGGCTAC GAGACGGCGG GTGCCGGCGA GAGGTACGAG
GTGACGCGGG CGGTCATCGG CCAGGTCCGT TTCGAGCGCC GCGACGGGCG CTCCTGGCGC
GGGACCGAGC TGGCCCGGGT GCCGCCGCCG GTGTTCAGCG AGGGCATGCG GGATGTGGAC
CTGTTCGTCA CGGTCGCCGC CGTCCCCGAA TGA
 
Protein sequence
MHEFTVDEVI AAASVSAGWL LDAVAREREL ESRSNPGARL AGWEEGHGGR TSRQARRVRA 
DMVRRLSRES GAARRAAAIC LARPSLSGYS EELDLALSGS KGWTAVELAA LLTVVRSREL
CYWDDVWLSR VTELALGFDA GEQAVLREPI TSLMWAVADS GIEVVRRREL ERAIGEVLRR
DPGELSPSVL IPLDGWAVIL RGHLGDMPPP HLVRLVDHLC EVRGPRPAKK WRARCLELLR
PADAGELVRV ALAAFDHVSG GGDPSGGDLR PIVLDSNVDA ARGFVWAAVL LRTGGLVPAL
TELALRAGGV RPGVREDLKL AGAAINALGD CEGPDAMTAL WRLQRSIRHR ALRRQVGTAL
DAAAGRQGIT PGQLLERGVP DHGLAPDGTL TRTIGDWTAV LAVEDAMTVR LGFRAPDGTT
VRAVPAELGE SGDLRALKAV RKEIRRTLSA ERARLEGLLT ADRTWTYEEW ARHYRDHPIT
GAVTRALIWE AGGEGHLSGG AVPDATLRLW HPARARPAEV TAWREEVTER RLRQPFKQAF
REVYLITPAE EETRVHSDRF AARIVDDPRL YALLKERGWR TGLLGSFGGG HGAEAAKELA
EGAWRVRFGY ETAGAGERYE VTRAVIGQVR FERRDGRSWR GTELARVPPP VFSEGMRDVD
LFVTVAAVPE