Gene Sros_6451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6451 
Symbol 
ID8669760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7065752 
End bp7069627 
Gene Length3876 bp 
Protein Length1291 aa 
Translation table11 
GC content78% 
IMG OID 
Productexonuclease SbcC 
Protein accessionYP_003341908 
Protein GI271967712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0663249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0228854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCGC ACCGTCTGTG GATCAGCGCG TTCGGCTCCT TCCCCGCCGA GGAGGAGGTC 
GACTTCGACG CCCTGGCCGA GGCCGGGCTG TTCCTCATCC ACGGCCCCAC CGGCGCGGGC
AAGACCACCG TGCTGGACGC GCTCTGCTAC GCCCTGTACG GCAGGGTGCC GGGCAAGCGC
GACAGCGCCA AGAGCCTGCG CTGCGACCAC GCGCCCCCGG GCCGGGGGCC CAGCGTGGCG
CTGGAGGTCA CGATCAGGGG CCGGCGGCTG AAGATCACTC GATCGCCGGC CTGGCAGCGC
CCTAAGCTCA GGGGCACCGG CACCACGAGG GAGAACGAGA AGGCCCTGCT GCAGGAGCTC
ACGCCTTCGG GGGAGTGGAC CGGGCTGACC ACGCGGGTCG ACGAGGCCGG AGAGCTCATC
GGCACCCTGC TGGGCATGAA CGCCGAGCAG TTCTTCCAGG TCGCGATGTT GCCCCAGGGG
GATTTCGCGA AGTTCCTGCG CGCCGACGGC GAGGACCGCC GCAGGGTCCT CGAACGGCTC
TTCTCCGTGA GGATCTACGC CGCCGCCGAG TCCTGGCTCG CCGACCGGCG GACCGAGGCC
CACCGCGAGC AGCAGGCCCT CCGCAAGGAG GTGGACTTCG CGGTCAAGCG CCTGGAGGAG
GCCGCGGGCC CCGGCCTCCT CGCGTCGCTG GCAGGGGCCG CCTCCCCGCA GCCCCCCGGT
TCCGCGGAGG CCGGAGCGGC CGGGCTGTTC GAGCTCCCCG AGGGCGACGC CGGGCCGTCC
GCGGTGGCGG TCTCCGCGGA GGACGATCCG CTGGAGTGGG CCCGGACGCT GGAGGAGCTG
GCCGCCGAGG AGGTCGCCGG GATGAGCCGG GGCCACGGCG AGGGCGAGGA GGCGGTACGG
CAGGCACGCG CGAGGCTGGC GCACGGCGTC GCGCTGGCCG AGAGGCGGCG CCGTCACGCC
GAGGCGCTGA CGCGGAGCCG GGCGCTGGAG GAGAGCGCCG AGGAGCGGGC GGATCTGGAG
ACCATCCTGG CGGAGGCGGC CCGCGCCGAC CGGGTGCTGC CCCTGATCCA GGCCGCCGAG
CAGCGGGCCG AGGCGGCGGC CAAGGCCCAC CGGCTGGCCG CCGACGCCGT CGCCCGTGCC
CTCCCCCTCC TCCGTGACGG CCGCGACACC ACCCCGGACC GTCTGGCCAC GCTGGAGCGC
GACCGGCACC GCGAGATCAC CCGGCTGGGC GAGCTGCGCG CCGAGGAGTC ACGCCTGGCG
AAGATCCTCC AGGACCGGGA GGAGTCCGGC CGTGAGATCA CCGGGCTGAC CGCCGCGCAG
GCCGGGACCG ACGCCCGCCT GGCGGTCCTT CCCGGGCTCC GCCGCGACGT GGACGCCCGC
CTCACGGCCG CCCGCCTGGA CGCCGCCCGC ATCCCGGCCG CCGAGTCCGC CGGGGAGGGC
GCGGCCGCCC GCCTGCGGGC CGCCGAGCAC CGCGACGGCC TCGCCGCCGC GCTGGCCGCC
GCCCGCCACG ACCTGGCTGC CAGGCTCCGC GCCCTGCCTG CCCTCGCGGA CGGCGCCGGC
GTGCCCGCCG TCCGCGAGGG GGCCGAGGTC CGCGAGCTGC TGGCCTCCTG GGAGCGCGAG
CGCAGGGAGG AGAGCGCCGG GCTGGAGGGG TTGCGCGCCG ACGAGGCCCG GCTCGCCGAG
CTGGCCGGGC TGCTCGCCGC GCTCGACGTC GAGCTGCGGG AGGCGGCCGG GCAGGAGTCC
GCGCTCCGCG AGGCCCAGGA GGCGCTGCCC GCCGCACTGG CGGACGCCTC GGCGCGCCTG
GCGGCGGTCA GAGCGCAGGC GGCCGGGATC CCCGCCGCCC AGGCGGCCGC CGACGCCGCC
GCCGCCGGGC TCGACGCCGC CCGGCGGCGT GACGCCCTCC GCGTCGAGCT GGAGTCCGCG
CGCGCGGCCC AGACCGAGGC CACCGACCAC GCCCAGCGTC TCCGCGACCG CCACCTGGAC
CTCCGCCAGG CCAGGATCGA CGGCATGGCC GCCGAGCTCG CCCTCAAGCT CGCCCCGGGC
GAGCCCTGCG CGGTCTGCGG CTCACCCGAC CACCCGGCCC CCGCCGCCCC CGCCGACGCG
GCGCCCACCG CCGAGGACGA ACGCGCCGCC CAGAGCGCCT ACGACACCGC CACGGACCGG
CGCCGGACCG CCGAGAGCGC CGTCGCGGCC CTGGCCTCGC GGCTGGAGGA GGCGCTCGCC
GTCGCGGGCG AGCTGGGCGC CGACCACGCG CGGGAGGCTC TCGCCGAGGC CGGACGGGAG
CTGGCGGCCC TGACCGCCGC CGCCGGCACC GAGGCGGCGC TCAGCGCGGA GGCGGACCGC
GTCGCGGCCG AGCTGGAAGG CGTGAAGGCA CGGGCCGCGG AGACGGCCCG GCTGCTGGCC
GAGGGCCGTA CCCGCCGGGC CGGATGGCTG GCCGAGCGCG ACCGGCTGGC CGGCCGTCTG
GACGAGGCCC GTGGCGCCGA CCCCACCGTC CAGGCCCGCC GCGACCGCCT CACCGGGGAG
GCGTCCCTCC TCGCCGCCGC CCTCACCGCT GCCGTCCGCA CCGCCGAGCT GGAATCCGGC
CACCGAGAGG CGGCCGAGAG GACCGATCTC ACCCTGGAGC TGGCGGCACG CGAGCTCCGG
GAGGCCGAGC TGGCCCTGGC CAGGCTGCGG GAGTCGGCCG GGGCCGAGCC CGCGCTGGTG
GCCGAGGCCG ACCGGATCGC GGCCGAGCAG GGAGAGCTGG AGGAGCGCTC GCGCGAGTTC
GCGGTCACGC TGGCCGCCCG CCGTACGGAC GTGGACAGGC TCACCGCCGA GACCGAGCGG
CTGACCGCGC GCATCGACGG GGCCCGGGGC GAGGACCCGA CCCTGGCCGC CAGGCTGGAG
CGGCTGGCGG ACGAGGCCGA GCTGCTCAAG GAGGCGGTCG AGGCCACCCG GGAGGAGCTG
ACCACCGCCA CCGAGCTGGC GGCGGCCCGG GACCGGGCCG AGGCCGCCGT CGCCGAGGCG
GGCTTCCTCG GGCCCGACGA CGCCCGCGCG GCGGCCCGCC CTCCCGCCGA GCAGGAGGAG
AAGGCCGAAC GGCTCCGCGA GCTCGACAAG GAGCGCGCCG CGGTCGCCGC GGTGCTGGCC
GATCCGGAGC TGATCGCCGC CGCGGCCGAG GACGAGCCCG ACCTCGACGC TCTGCAGCGC
GCGCGGGACG CCGCCGACGC GGCCCACGCC GGCCTGCTCT CCGCCCGGCA CCAGGCCGAG
ACCCGCCGGG CCCGGCTGGC CGCGCTCCGC GCCGAGCTGG CCGAGTGCCT GGAGCGCTGG
CTGCCCGCCG CCGGACGGCA CCGCCTCGCC GACCGGCTCG CGGCCCTCAC GGGGGGAAAC
TCCACCGACA ACCAGTGGAA CATGCGGCTG TCGTCCTACG TGCTCGGCGA GCGCCTGCGC
CAGGTGGTCG AGTCGGCCAA CGAGCGGCTC GACCACATGT CGGGCGGACG CTACCTCCTC
GAACACCACC TCAGCCGTTC GGCGGGAGAC CGGAGCAAGT CGGGCGGAGG GCTCGGCCTG
CGGATCCTGG ACGGCTGGAC CGGGGTGGAC CGCGATCCGG CCACCCTGTC CGGCGGCGAG
AGCTTCGTCA CCTCGCTGGC GCTCGCCCTC GGCCTGGCCG ACGTGGTGAC CGCCGAGGCC
GGGGGAGTGG AGCTCGGCAC GCTCTTCGTC GACGAGGGCT TCGGCACCCT GGACGAGGAC
ACCCTCGACG GGGTGCTCGA CATCCTCGAC GGCCTGCGCG ACGGCGGCCG GGCCGTCGGC
ATCGTCAGCC ACGTCGCCGA GCTGCGCACC CGCGTCCCCG CCCAGCTGCG GGTCCGCAAG
GAACGCCACG GCTCCACCCT GTCCACGGTC ACCTGA
 
Protein sequence
MRPHRLWISA FGSFPAEEEV DFDALAEAGL FLIHGPTGAG KTTVLDALCY ALYGRVPGKR 
DSAKSLRCDH APPGRGPSVA LEVTIRGRRL KITRSPAWQR PKLRGTGTTR ENEKALLQEL
TPSGEWTGLT TRVDEAGELI GTLLGMNAEQ FFQVAMLPQG DFAKFLRADG EDRRRVLERL
FSVRIYAAAE SWLADRRTEA HREQQALRKE VDFAVKRLEE AAGPGLLASL AGAASPQPPG
SAEAGAAGLF ELPEGDAGPS AVAVSAEDDP LEWARTLEEL AAEEVAGMSR GHGEGEEAVR
QARARLAHGV ALAERRRRHA EALTRSRALE ESAEERADLE TILAEAARAD RVLPLIQAAE
QRAEAAAKAH RLAADAVARA LPLLRDGRDT TPDRLATLER DRHREITRLG ELRAEESRLA
KILQDREESG REITGLTAAQ AGTDARLAVL PGLRRDVDAR LTAARLDAAR IPAAESAGEG
AAARLRAAEH RDGLAAALAA ARHDLAARLR ALPALADGAG VPAVREGAEV RELLASWERE
RREESAGLEG LRADEARLAE LAGLLAALDV ELREAAGQES ALREAQEALP AALADASARL
AAVRAQAAGI PAAQAAADAA AAGLDAARRR DALRVELESA RAAQTEATDH AQRLRDRHLD
LRQARIDGMA AELALKLAPG EPCAVCGSPD HPAPAAPADA APTAEDERAA QSAYDTATDR
RRTAESAVAA LASRLEEALA VAGELGADHA REALAEAGRE LAALTAAAGT EAALSAEADR
VAAELEGVKA RAAETARLLA EGRTRRAGWL AERDRLAGRL DEARGADPTV QARRDRLTGE
ASLLAAALTA AVRTAELESG HREAAERTDL TLELAARELR EAELALARLR ESAGAEPALV
AEADRIAAEQ GELEERSREF AVTLAARRTD VDRLTAETER LTARIDGARG EDPTLAARLE
RLADEAELLK EAVEATREEL TTATELAAAR DRAEAAVAEA GFLGPDDARA AARPPAEQEE
KAERLRELDK ERAAVAAVLA DPELIAAAAE DEPDLDALQR ARDAADAAHA GLLSARHQAE
TRRARLAALR AELAECLERW LPAAGRHRLA DRLAALTGGN STDNQWNMRL SSYVLGERLR
QVVESANERL DHMSGGRYLL EHHLSRSAGD RSKSGGGLGL RILDGWTGVD RDPATLSGGE
SFVTSLALAL GLADVVTAEA GGVELGTLFV DEGFGTLDED TLDGVLDILD GLRDGGRAVG
IVSHVAELRT RVPAQLRVRK ERHGSTLSTV T