Gene Sros_6042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6042 
Symbol 
ID8669336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6620911 
End bp6622860 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content69% 
IMG OID 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003341518 
Protein GI271967322 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCCG ATAGGCTTTC TGGTGTGGCG AGATCTGTGG TTGGTCCGGC GAGTTTCCGG 
CCGAGTCCGG GGTCGATCCC CGAATCACCC GGCGTCTACC GGTTCAGGGG CGCCGAGGGC
CGGGTCATAT ACGTCGGCAA GGCCAAGAAC CTGCGCCAGC GGCTGAACTC CTACTTCGCC
GACTTCGCCG GGCTCCATCC GCGCACCCAG ACCATGCTCA CGACCGCCGT CGACGTCGAC
TGGACGGTCG TCGGCACCGA GGTCGAGGCG CTCCAGCTCG AATACTCCTG GATCAAGGAG
TACGACCCGC GGTTCAACGT CAAGTACCGC GACGACAAGT CCTACCCCTA CCTCGCGGTC
ACCATGGGGG AGGAGTTCCC CCGGGTCCAG GTGTTGCGCG GTGCCAAGCG CAAGGGCACC
CGCTACTTCG GCCCCTACTC CCACGCCTGG GCGATCAGGG AGACGGTCGA CCTGCTGCTG
CGGGTCTTCC CGATCAGGAG CTGCTCGGCG GGGGTGTTCC GGCGTGCCGG GCAGATCGGC
CGGCCCTGCC TGCTGGGCTA CATCGACAAG TGCTCGGCGC CCTGCGTCAG CAGGGTCACC
GCCGTGGAGC ACCGGACGCT GGCCGAGGAC TTCTGCGACT TCATGGCGGG CAACACCGGC
CGCTTCATGA AGCGGCTGGA GCGCGAGATG CGCCAGGCCG CCGCCGATCA GGAGTACGAA
AGGGCCGCCC GGCTGCGCGA TGACATCCAG GCGCTGCTGA GAGCCATGGA GAAGCAGGCC
GTGGTGCTGG GGGAGGGCAC CGACTGCGAC GTGATCGCAC TGGCGGAGGA TCCGCTTGAG
GCCGCGGTGC AGGTGTTCTA CGTCCGCGGC GGGCGCATCA GGGGCCAGCG CGGATGGGTG
GTCGACAAGG TCGAGGAAGC CGGGCCCGGC GAGCTGGTCG AGCAGTTCCT GCTGCAGATG
TACGGCGACG CGACCCCCGA GGCGCTGCCC AGGGAGGTGC TGGTGCCCGT GCTCCCGCCC
GACGCGGAGG CGGTCGCCGA GCTGCTCAGC GAGCACCGCA AGGCCCGGGT GGAGATCCGG
GTCCCGCAGC GCGGAGACAA GAAGTCGCTG ATGGAGACCG TCTGGCGCAA CGCCATGGAG
TCCCTGAAGC AGCACAAGCT CAAACGGGCC AGCGACCTGA CCACCCGGAG CAAGGCCCTG
CAGGAGGTCG CCGACGCGCT CGACCTGGAC CAGGCGCCGC TGCGCGTCGA GTGCTACGAC
GTCTCCCACA TCCAGGGCAC CGACGTGGTC GCCTCCATGG TGGTGTTCGA GGACGGCCTG
GCCCGCAAGA GCGAATACCG CAGGTTCTCC GTCCGCTCCA CCGAGGGCGA CGTCGCCTCG
ATCTACGAGG TGATCTCGCG CCGGTTCAAG CGGTATCTGG AGGAGCGCTC GGCCACCGGT
GAGCTCGACG CCGAGCCTGG CGCCGAACCG GGCATCGACC CGGAGACCGG CAGACCGCGC
AAGTTCGCCT ACCCGCCCAA CCTGGTCGTC GTCGACGGCG GTCCGGCCCA GGCGGCGGCC
GCCCAGCGCG CGCTGGACGA GCTGGGCGTG GACGACGTGG CGGTCTGCGG CCTGGCCAAG
CGGCTGGAGG AGGTCTGGCT TCCCGGCGAG GACCTGCCGG TAATCCTTCC CCGGTCGAGC
GAGGCCCTTT ACCTCCTGCA GCGCGTGCGT GACGAGGCCC ATCGATTCGC CATCACCTAC
CATCGTTCGA AGCGGTCGAA GAGCGTCAAG GAAAGCGCCC TGGACGAGGT TCCCGGGCTC
GGTCCGGCCC GCAGGCAGGC CCTGATCAAG CATTTCGGAT CGGTGAAGCG GTTACGGGAG
GCGACCGCGG CCCAGATCTG CGAGATTCCC GGGATCGGCC CCGCTACGGC CGAGACGATC
GTGTCGGCCT TGAAGGGGGA GAACCCATGA
 
Protein sequence
MKADRLSGVA RSVVGPASFR PSPGSIPESP GVYRFRGAEG RVIYVGKAKN LRQRLNSYFA 
DFAGLHPRTQ TMLTTAVDVD WTVVGTEVEA LQLEYSWIKE YDPRFNVKYR DDKSYPYLAV
TMGEEFPRVQ VLRGAKRKGT RYFGPYSHAW AIRETVDLLL RVFPIRSCSA GVFRRAGQIG
RPCLLGYIDK CSAPCVSRVT AVEHRTLAED FCDFMAGNTG RFMKRLEREM RQAAADQEYE
RAARLRDDIQ ALLRAMEKQA VVLGEGTDCD VIALAEDPLE AAVQVFYVRG GRIRGQRGWV
VDKVEEAGPG ELVEQFLLQM YGDATPEALP REVLVPVLPP DAEAVAELLS EHRKARVEIR
VPQRGDKKSL METVWRNAME SLKQHKLKRA SDLTTRSKAL QEVADALDLD QAPLRVECYD
VSHIQGTDVV ASMVVFEDGL ARKSEYRRFS VRSTEGDVAS IYEVISRRFK RYLEERSATG
ELDAEPGAEP GIDPETGRPR KFAYPPNLVV VDGGPAQAAA AQRALDELGV DDVAVCGLAK
RLEEVWLPGE DLPVILPRSS EALYLLQRVR DEAHRFAITY HRSKRSKSVK ESALDEVPGL
GPARRQALIK HFGSVKRLRE ATAAQICEIP GIGPATAETI VSALKGENP