Gene Sros_9022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9022 
Symbol 
ID8672364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9966572 
End bp9968638 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003344396 
Protein GI271970200 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAT CCTCTCCCTC CCCGCCCGGC GACGGCTCGC GAGGGCCGGA CGGGGATCCG 
TCCGGCCTCA CCCCCGCGAA GCTCAGGGAG ATCGCGCTCA CCGCCCTGAT CGGCGGCGGC
GTCACCGCCC TTTTCACCCT GCTGGTGGAG GGCATGCGGG CACTGGGAAC GGGTCTCACC
CTGCTCATCA GCGCGGTGAT CGGCCTGGCC GTGGCCGTTC TCCTGACCCT GGGCCATTTC
TCCAGGTTCC TCCGACGGCT GTTGGAGCCG GTGAAGCGGG TCGTCACGCG CCGCCGTGCC
GCGGCGACGG CCGGGACCGG CGGCCCGGAT CGGCCGCCGA GCCGGAGACG TCAGCGGCTG
GAACGGGCCG TGCTCAGGCT GTCGGCCGGT TACGTCGGCG TCCTGCTCGG CGCCGCCGTC
GTCATGGTCC CGTGGGGAGC CGTCAAGGGC GGGGCGTGGC TGAGCCACAG AATCCAGCCC
CCGGACTGCG AGCGGCCGCT GGAACTGCGC GTCATCACCG CTCCCGAGAA CGTTCTCGCC
CTGCGCGAGA GCGTCGCCCT GTTCATCCGG AGCAGGCTGG TGGACGGCTG CGCGCCGTAC
GCGATCTCGG TGGGGGTCGC GCCGTCGATC GGCGAGCTGG CCTACGCCCT CGGCGACAAC
TGGTACCGCG ACGACGTGCG CCAGGAGGGG GGCGAGCCGT TCCGCCGCCT GTACGGCCCG
CGTGCCGACG CCTGGATCGC CACGAGCACC GGTGAGGCCG ATCTGGTCTC CGACGAGATG
AGGGCGGGTG GCGCCACGCT CCGCATCGGC CCGGCGGTCG CGTCCGACCG CCTGGTGATC
GGCATGATCA GCGGGCGGGC CGAGGACATC AGGGAGAGGC TCCCGCTGGA ACGGCCCGGC
AGTCACAGCC TGCGGGACCT GTGGACCGCC ATCCGATCGG AGGCCGGCAT GTCGGTGAGC
TACCCGCAGC CCGAGCTCTC CACCGCCGGG CTCGCCGCCG TCTCCGACCT CCTGCTCCTC
GACGGCGAGG CGGACGGCAC GGCGGACGGC GAGGCGGACG ATGCGGCGGA CGACGTGGTG
GATCGGGAAC AGCGGGGGAT CGTCGCCGAG AGCGTCAGCT CCCTGCTCTG CCAGTTCAAG
ACCGCGGCTC AGGGAACCGG GGGCGAGAAC CTCGCCCGGA GCCTGGCCGT GGTGGTCCCC
TTTCACAGCC TCACGGACTA CAACAACGGA AGGTTCAACG ACCCCCGCTG CCCGGAGGGG
GCGTCGAGCG GAAGGAACAA GCTCCGTGAG TTCTCCTCTC CGGGGCTCTC CAGGCTGGAC
TACCCGTTCG TCACGATCGA CTGGCCGAAG GAGCGGAGCG GGGAGCGCGC GGCGGGGCTC
GACCTGTTGC GCGGATGGCT GACCACCCAC CCGCTCTTCG GCGATGGTGC CGGGGGGAGA
CCGGAGACCG GAGCGGCGGT GCGCGGCCTG AAGGATCTGA GAGACGCCCA GAACGAGTTC
TTCGACCTCC TTCCCCGCGT CGACGCCGAG ATCGCCCTCG ACGTGTCCGG CTCGATGGCC
GCCCCGCCCC GGTCCCTGCT CGTCCGGCTG CGTGAGGCGT TTCCCGACGT CAAGCCCGTG
ATCACGCCGC GAGACGACCT CACGCTGAGC TCCTTCTCCA GGGTCGGTGG GAAGACCCGG
GTCAGGGAGC TCCAGCCGTC GGTGAGCCGT GAGGAGTTCG ACGAGCTCAC CAAGAGCGTC
ATCGGCGCGA CGGGGCGCGG CTCCGACGCC CCCGTATCGG ACATGATCAT GAATCTGAAC
GGGCGTGCCG GGTCACCCGG GCGCGCTCTC GTCGTCGTGA CCGACGGAGG CGTATTCGAC
AACGAGCGGC CGGGAGAGAG CGTCGGCAGG ACGCTCGCCC GCGCATCCCA CGTCACCGAC
CTCTACGTCC TGGCCCTCGG CGACAACGGC TGCGACCGCT CTCCCCCGCG GCGCGGGAAA
TACCGGGCCT GCGTCGAGAC CGGCACCGAC ATGCAACAGG CGCTGAGACA TCTGATCTCC
AGCATGCGCG GAGAGGCCCG GCGATGA
 
Protein sequence
MAGSSPSPPG DGSRGPDGDP SGLTPAKLRE IALTALIGGG VTALFTLLVE GMRALGTGLT 
LLISAVIGLA VAVLLTLGHF SRFLRRLLEP VKRVVTRRRA AATAGTGGPD RPPSRRRQRL
ERAVLRLSAG YVGVLLGAAV VMVPWGAVKG GAWLSHRIQP PDCERPLELR VITAPENVLA
LRESVALFIR SRLVDGCAPY AISVGVAPSI GELAYALGDN WYRDDVRQEG GEPFRRLYGP
RADAWIATST GEADLVSDEM RAGGATLRIG PAVASDRLVI GMISGRAEDI RERLPLERPG
SHSLRDLWTA IRSEAGMSVS YPQPELSTAG LAAVSDLLLL DGEADGTADG EADDAADDVV
DREQRGIVAE SVSSLLCQFK TAAQGTGGEN LARSLAVVVP FHSLTDYNNG RFNDPRCPEG
ASSGRNKLRE FSSPGLSRLD YPFVTIDWPK ERSGERAAGL DLLRGWLTTH PLFGDGAGGR
PETGAAVRGL KDLRDAQNEF FDLLPRVDAE IALDVSGSMA APPRSLLVRL REAFPDVKPV
ITPRDDLTLS SFSRVGGKTR VRELQPSVSR EEFDELTKSV IGATGRGSDA PVSDMIMNLN
GRAGSPGRAL VVVTDGGVFD NERPGESVGR TLARASHVTD LYVLALGDNG CDRSPPRRGK
YRACVETGTD MQQALRHLIS SMRGEARR