Gene Sros_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1079 
Symbol 
ID8664354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1100230 
End bp1103697 
Gene Length3468 bp 
Protein Length1155 aa 
Translation table11 
GC content65% 
IMG OID 
ProductDNA-directed RNA polymerase 
Protein accessionYP_003336821 
Protein GI271962625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.715503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.276189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAGCCT CGCGCAACGC CTCCGCCGTA CCCGCTGGTC CCCGTCGTGT GTCTTTCGCA 
CGAATTCAGG AGCCGCTCGA AGTTCCTGAT CTTCTCGCTC TCCAGACCGA GTCCTTCGAC
TGGTTGCTCG GCAACGAGAA GTGGAAGGGG CGGGTCGAGG CGGCTCGCCA GGCCGGGCGC
AAGGACGTTC CGGCCCAGTC GGGTCTCGAA GAGATCTTCG AAGAGATCAG TCCCATCGAG
GACTTCTCCG GGACCATGTC CCTGTCGTTC CGGGATCACC GGTTCGAGCC GCCCAAGTAC
TCAGTCGATG AGTGCAAAGA CAAGGACATG ACCTACTCCG CCCCGATGTT CGTCACGGCG
GAGTTCATCA ATAACACCAC TGGTGAGATC AAGAGCCAGA CCGTGTTCAT GGGCGACTTC
CCGCTCATGA CCGGCAAGGG CACTTTCATC ATCAACGGCA CCGAGCGTGT CGTGGTCTCC
CAGCTGGTCC GGTCCCCGGG CGTCTACTTC GACCGCAGCG TCGACAAGAC CTCCGACAAG
GACCTCTACG GCTGCAAGGT GATCCCCTCC CGGGGCGCCT GGCTCGAGTT CGAGATCGAC
AAGCGTGACA GTGTCGGCGT CCGCATCGAC CGTAAGCGCA AGCAGGCCGT CACGGTCCTG
CTGAAGGCGC TCGGGTGGAC CAACGACCAG ATCCTTGAGC GTTTCGGACA GTACGAGTCC
ATGCGCGCCA CCCTGGAGAA GGACCACACG GCCGGCCAGG ACGACGCCCT GCTGGACATC
TACCGCAAGC TGCGTCCGGG CGAGCCGCCG ACCAAGGAGT CGGCACAGAC GCTGCTGGAG
AACCTGTACT TCAACCACAA GCGTTATGAC CTCGCCAAGG TCGGCCGCTA CAAGATCAAT
AAGAAGCTCG GCGTCGACAG CGAGATCACG CAGGGGACGC TGACCGAAGA GGACATCGTC
GCCACGATCG AGTACATCGT CAAGCTGCAC GCCGGCGAGA CCTCGATGGC GGGCGCCAAC
GGTGAGATCG TCGTCGAGAC CGACGACATC GACCACTTCG GCAACCGTCG CCTGCGCACG
GTCGGCGAGC TCATCCAGAA CCAGGTCCGC CTGGGTCTGG CCCGTATGGA GCGCGTCGTC
CGCGAGCGGA TGACCACTCA GGACGTCGAG GCGATCACGC CGCAGACCCT GATCAACATC
CGTCCGGTCG TCGCGTCGAT CAAGGAGTTC TTCGGAACCT CCCAGCTGTC GCAGTTCATG
GACCAGACCA ACCCGCTGGC CGGCCTGACG CACAAGCGGC GTCTGTCCGC GCTGGGCCCC
GGTGGTCTGT CCCGTGAGCG GGCCGGCTTC GAGGTCCGTG ACGTCCACCC CTCGCACTAC
GGCCGCATGT GCCCGATCGA GACGCCGGAA GGACCGAACA TCGGTCTGAT CGGCTCGCTG
GCCTCCTTCG GCCGGGTCAA CTCCTTCGGC TTCGTCGAGA CGCCGTACCG CAAGGTCCTC
GACGGCCGGG TCACCGACAC GGTCGAGTAC CTCACCGCGG ACGAGGAGGA CCGTTACGTC
ATCGCCCAGG CGAACACGCC GATCGGTTCC GATGGCACGT TCCTTGAGGA CCGCGTGCTC
GTCCGCCGTA AGGGCGGGGA GTTCGAGTCC CTGCGGGCCA ACGAGGTCGA CTACATGGAC
GTGTCGGCGC GCCAGATGGT GTCCGTCGCG ACCGCGATGA TCCCGTTCCT GGAGCACGAC
GACGCCAACC GCGCGCTCAT GGGCTCCAAC ATGCAGCGCC AGTCGGTGCC GCTGCTCAAG
AGCGAGGCGC CGCTGGTCGG CACCGGCATG GAGTACCGTG CCGCGACCGA CGCCGGCGAC
GTCATCACCG CCGACAAGGC GGGCGTGGTG GAGGAGGTCT CCGCCGACTA CGTCACCGTG
ATGAACGACG ACGGCACCCG CACGACCTAC CGTGTCGCCA AGTTCAAGCG CTCCAACCAG
GGCACCTGCT TCAACCAGAA GCCGATCGTC AAGGAAGGCG ACCGGATCGA GGTGAACCAG
GTCGTCGCCG ACGGTCCCTG CACCGACGAC GGTGAGATGG CGCTCGGCAA GAACCTGCTC
GTGGCGTTCA TGCCGTGGGA GGGTCACAAC TACGAAGACG CGATCATCCT GTCCCAGCGT
CTGGTCCAGG ACGACGTCCT CTCCTCGATC CACATCGAGG AGCACGAGGT CGACGCCCGT
GACACCAAGC TGGGCCCCGA GGAGATCACC CGGGACATCC CGAACGTCTC CGAGGAGGTC
CTGGCCGACC TCGACGAGCG CGGCATCATC CGCATCGGCG CCGAGGTCGT CCCCGGCGAC
ATCCTCGTCG GCAAGGTCAC GCCCAAGGGC GAGACCGAGC TGACCCCCGA GGAGCGGCTG
CTGCGCGCGA TCTTCGGTGA GAAGGCCCGC GAGGTCCGTG ACACCTCCCT GAAGGTGCCG
CACGGCGAGC AGGGCAAGGT CATCGGTGTC CGCGTGTTCA GCCGCGAGGA GGGCGACGAG
CTCCCTCCGG GCGTCAACGA GCTGGTCCGC GTCTACGTGG CCCAGAAGCG TAAGATCACC
GACGGCGACA AGCTGGCCGG CCGTCACGGC AACAAGGGCG TCATCTCCAA GATCCTTCCG
GTCGAGGACA TGCCGTTCCT TGAGGACGGC ACGCCGGTCG ACATCATCCT CAACCCGCTG
GGCGTGCCCG GCCGTATGAA CGTCGGCCAG GTCCTGGAGA CCCACCTGGG GTGGATCGCC
GCCCGAGGAT GGGACATCTC GGGGATCGAG GAGGCGTGGG CCGAGCGGCT GCGCGACAAG
GGCTTCGCCG AGGTCGACCC GCGCACCAAC ATGGCCACCC CGGTGTTCGA CGGTGCCAAT
GAGGAAGAGA TCGTCGGTCT GCTCGACAAC ACCCTGGTCA ACAGGGACGG CGGGCGCATG
GTCGGTGCCA ACGGCAAGGC CCAGCTGTTC GACGGCCGCT CCGGCGAGCC GTTCCCGCAC
CCGATCTCGG TCGGCTACAT CTACATCCTG AAGCTGCTCC ACCTGGTCGA CGACAAGATC
CACGCTCGTT CGACCGGCCC GTACTCCATG ATCACCCAGC AGCCGCTCGG TGGTAAGGCA
CAGTTCGGTG GACAGCGCTT CGGTGAGATG GAGGTGTGGG CGCTGGAAGC GTACGGCGCC
GCCTACGCCC TGCAGGAGCT GCTGACGATC AAGTCCGACG ACGTCCTCGG CCGGGTGAAG
GTCTACGAGG CCATCGTCAA GGGCGAGAAC ATCCCCGAGC CGGGCATTCC GGAGTCGTTC
AAGGTCCTCA TCAAGGAAAT GCAGTCGCTG TGCCTGAACG TCGAGGTGCT CTCCAGCGAC
GGCATGTCCA TCGAGATGCG CGACACCGAC GAGGACGTCT TCCGCGCCGC GGAAGAGCTC
GGCATCGACC TGTCTCGGCG TGAGCCGAGC AGCGTCGAAG AGGTCTGA
 
Protein sequence
MAASRNASAV PAGPRRVSFA RIQEPLEVPD LLALQTESFD WLLGNEKWKG RVEAARQAGR 
KDVPAQSGLE EIFEEISPIE DFSGTMSLSF RDHRFEPPKY SVDECKDKDM TYSAPMFVTA
EFINNTTGEI KSQTVFMGDF PLMTGKGTFI INGTERVVVS QLVRSPGVYF DRSVDKTSDK
DLYGCKVIPS RGAWLEFEID KRDSVGVRID RKRKQAVTVL LKALGWTNDQ ILERFGQYES
MRATLEKDHT AGQDDALLDI YRKLRPGEPP TKESAQTLLE NLYFNHKRYD LAKVGRYKIN
KKLGVDSEIT QGTLTEEDIV ATIEYIVKLH AGETSMAGAN GEIVVETDDI DHFGNRRLRT
VGELIQNQVR LGLARMERVV RERMTTQDVE AITPQTLINI RPVVASIKEF FGTSQLSQFM
DQTNPLAGLT HKRRLSALGP GGLSRERAGF EVRDVHPSHY GRMCPIETPE GPNIGLIGSL
ASFGRVNSFG FVETPYRKVL DGRVTDTVEY LTADEEDRYV IAQANTPIGS DGTFLEDRVL
VRRKGGEFES LRANEVDYMD VSARQMVSVA TAMIPFLEHD DANRALMGSN MQRQSVPLLK
SEAPLVGTGM EYRAATDAGD VITADKAGVV EEVSADYVTV MNDDGTRTTY RVAKFKRSNQ
GTCFNQKPIV KEGDRIEVNQ VVADGPCTDD GEMALGKNLL VAFMPWEGHN YEDAIILSQR
LVQDDVLSSI HIEEHEVDAR DTKLGPEEIT RDIPNVSEEV LADLDERGII RIGAEVVPGD
ILVGKVTPKG ETELTPEERL LRAIFGEKAR EVRDTSLKVP HGEQGKVIGV RVFSREEGDE
LPPGVNELVR VYVAQKRKIT DGDKLAGRHG NKGVISKILP VEDMPFLEDG TPVDIILNPL
GVPGRMNVGQ VLETHLGWIA ARGWDISGIE EAWAERLRDK GFAEVDPRTN MATPVFDGAN
EEEIVGLLDN TLVNRDGGRM VGANGKAQLF DGRSGEPFPH PISVGYIYIL KLLHLVDDKI
HARSTGPYSM ITQQPLGGKA QFGGQRFGEM EVWALEAYGA AYALQELLTI KSDDVLGRVK
VYEAIVKGEN IPEPGIPESF KVLIKEMQSL CLNVEVLSSD GMSIEMRDTD EDVFRAAEEL
GIDLSRREPS SVEEV