Gene Sros_8226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8226 
Symbol 
ID8671554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9074655 
End bp9075878 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343618 
Protein GI271969422 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCGGC GCGCGCACAT AGCCATGGTC AGCATCCCGG CCCCCGGCCA CGTCAACCCC 
AGCCTTGAGG TGATCAGGGA GCTGGCGGCC CGGGGCCACC GGGTGACCTA CGCCAACGAC
CCGTCGTTCG CCGAGGTGAT CGAGGGGGCC GGGGCCGAGC CCGTGCCGTA CGCCTCGACG
CTGCCCATGA ACGATCCCGA CGGCTGGCCG GAGGACACGA TCGCCCAGCT CGACGTCTTC
CTCAACGACT CGATCTCCAT GCTGCCGCAG CTCCGCGCCG CCTACGGCGA CGACCGGCCC
GACCTGTTCC TCTACGACAT CGCCGGCTAC TCCGCGCGGA TTCTCGCCGA GAACTGGGGG
ATCCCGGCGA TGCAGCTCTC ACCGTGCTAC GTGGCGTGGG AGGGCTACGA GGAGGACACG
GCGCCGATGG TCGAGCAGCT GAAAAAGGCG CCCGGCGGGG CCGAGCACTA CCGGCGGTTC
GAGCAGTGGC TGGTGGACAA CGGCATCACC GGGACCGACT CCCAGGCGTT CGTCGGCTCT
CCGGAGAGGG CGCTCGCGCT GATCCCGAGG ATGATGCAGC CCAACGCCGA CCGGGTCGAC
CCGGCGCGGA TCACCTTCAC CGGGCCGTGC CTCAGCGCCC GCACCCACCA GGAGGCCTGG
ACCCGGCCCG AGAGCGCCGA GAAGGTGCTG CTGGTCTCCC TGGGTTCCGC CTTCACCAAC
CTGCCCGGCT TCTACCGCTC CTGCCTGGCC GCCTTCGGTG ACCTGCCCGG CTGGCACGTC
GTGCTCCAGA TCGGCAAGTT CGTCGACCCG GCCGAGCTCG GTGAGATCCC CGCCAACGTG
GAGGTGCACA GCTGGGTGTC GCAGCTGTCG ATCCTGGAGC AGGCCGACGC GTTCGTCACC
CACGCGGGCA TGGGCGGCAC CCAGGAGGGC CTGTACTGCG GCGTCCCGAT GATCGCGGTC
CCGCAGGGCG CCGACCAGTT CGACAACGCC GACAAGATCG TGGAGCTCGG CGCCGGCCGC
CGGATCGACG CCGGGCAGGC CACCCCCGAG GCGCTGCGGA CGGCCCTGCT CGAACTCACC
TCCGACCCGG AGGTCGCGCT GCGGCTGGAG AGGATCAGCG CCGAGGTCCG TGCCGAGGGC
GGCACCACCC GCGCCGCCGA CCTCGTCGAG CGACTGCTGG AGACGGCCGC GCCGACGCCG
GAGGGCAGCC TGCCTGCCGC GTGA
 
Protein sequence
MPRRAHIAMV SIPAPGHVNP SLEVIRELAA RGHRVTYAND PSFAEVIEGA GAEPVPYAST 
LPMNDPDGWP EDTIAQLDVF LNDSISMLPQ LRAAYGDDRP DLFLYDIAGY SARILAENWG
IPAMQLSPCY VAWEGYEEDT APMVEQLKKA PGGAEHYRRF EQWLVDNGIT GTDSQAFVGS
PERALALIPR MMQPNADRVD PARITFTGPC LSARTHQEAW TRPESAEKVL LVSLGSAFTN
LPGFYRSCLA AFGDLPGWHV VLQIGKFVDP AELGEIPANV EVHSWVSQLS ILEQADAFVT
HAGMGGTQEG LYCGVPMIAV PQGADQFDNA DKIVELGAGR RIDAGQATPE ALRTALLELT
SDPEVALRLE RISAEVRAEG GTTRAADLVE RLLETAAPTP EGSLPAA