Gene Sros_8297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8297 
Symbol 
ID8671625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9160210 
End bp9161568 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343688 
Protein GI271969492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.512635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGCG AGCCCAACCG ACTGCTTCAG CGACTCATCG CTGAAGCGGG TTTCACGCAC 
AAGGGGCTGT CCCGCAGGCT CAACGATCTC GGTGTGGTCC GGGGCCTGCC GGGTCTGAAG
TATGACCACA GTTCCGTTCT GCGCTGGATC GGCGGCCAAC GGCCGAGAAA TCCGGTGCCG
GGACTGCTCG CAGAGATTTT CGCACACCGG CTCGGCAGGA CGGTCAGCTC GGAAGACCTC
GGTCTGCCGG TCGTCGCGAC ACCTCCCGAC CTCGGACAGG AGTTCACACA CACGTGGCAG
GAGGGGATCG CGACCGTGAC GGCACTGTGG CGGGGAGATG TTGAGAGACG CAGGTTCCTG
ATCGACTCGA CCTTCGCGAT CGGGGCCGGG GCCACCGGCG CCTTGCGCTG GCTGACCCTT
CCCCTGGAGG GCCGTCCCGT CGCGGGCGGC GCCCGGCGGG TGGGCATGGC CGACATCGCC
GCGATCCGGG AGGTCACCCG GTCCTTCGGC GAGCTGGACA ACAGGTTCGG CGGCGGCAGG
GTCCGCTCGG CCGTCGTGAA ATATCTGGAC ACGGCGGTCG CGCCGCTGCT CAGCGAGGGC
TCCTACGGCG AGGGCACCGG CAGGGCGCTG GCGTCCGCCG CGGCCGAGCT GACCCGGCTG
GCCGGGTGGA TGGCCTACGA CCTGGAGCAG CACGGTCTGG CCCAGCGCTA CCTGATCCAG
GCGCTGCGCC TGGCCCGGGG GGCGGGGGAC CACGGGCTCG GCGGGGAGAT CCTCGCCGGG
ATGAGCCATC AGGCCCTATA TATAGGACAG CCGGCCCACG CTCTCGACCT GGCGCGGGCC
GCCCAGCTGT CGGCCCGCCG CGCCGGGGTC TACGCCCTGC TGGCGGAGTC GCACGTGCTG
GAGGCCCACG GCCACGCCCT GATGGACGAC CGGGGAGCGT GCGCCAACTC CCTGCATGCG
GCCGAGCTGG CCTTCGACCA GCGCGAGGCC GGTGAGGAGC CCGACTGGAT CGCCTACTTC
GACGAGGCGT ACCTGTCGGC CAAGTTCGCC CACTGCTTCC GCGATCTGGG CGACGGGCCG
GGCACCGTAC GGCACGCGAC GCGGTCGCTG GACATGGACG GGCGCTACGT CCGCGGCCGC
ATGTTCAACC TCTCGCTGCT GTCGGCGGGG CTGCTCGGGT GCGGCGAGCT GGAGCAGGCC
TGCGTGGCGG CCGGCCAGGC GCTGGAGCTG GCCGGGGGGC TGCAGTCGGC CCGGACCCAG
TCGTACGCGT CCGACCTGCG GCGGCGGCTG GACCCCTTCG CCGGCGAGCC CGCCGTGAGG
GAGCTGAACG AGCGGGCCAG GGAGCTGAGC CCCGCCTGA
 
Protein sequence
MEREPNRLLQ RLIAEAGFTH KGLSRRLNDL GVVRGLPGLK YDHSSVLRWI GGQRPRNPVP 
GLLAEIFAHR LGRTVSSEDL GLPVVATPPD LGQEFTHTWQ EGIATVTALW RGDVERRRFL
IDSTFAIGAG ATGALRWLTL PLEGRPVAGG ARRVGMADIA AIREVTRSFG ELDNRFGGGR
VRSAVVKYLD TAVAPLLSEG SYGEGTGRAL ASAAAELTRL AGWMAYDLEQ HGLAQRYLIQ
ALRLARGAGD HGLGGEILAG MSHQALYIGQ PAHALDLARA AQLSARRAGV YALLAESHVL
EAHGHALMDD RGACANSLHA AELAFDQREA GEEPDWIAYF DEAYLSAKFA HCFRDLGDGP
GTVRHATRSL DMDGRYVRGR MFNLSLLSAG LLGCGELEQA CVAAGQALEL AGGLQSARTQ
SYASDLRRRL DPFAGEPAVR ELNERARELS PA