Gene Sros_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0939 
Symbol 
ID8664212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp961112 
End bp962548 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content75% 
IMG OID 
Producthydrolase of the alpha/beta superfamily-like protein 
Protein accessionYP_003336686 
Protein GI271962490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGACA TCCTGCGAGG CGTCCGTTCC GCCGCCCTCG GCAACTCCCG TGACGTCTTC 
GTCTACTCGC CGCCCGGCTA TGACCCCGCC GCCGGGCCGT ACCCCGTGGT CTACCTCCAC
GACGGCCAGC ACGTGTTCGC CGGGGCCGGG CTTGAGCCGA GCTGGCGGCT CGACGAGACG
CTGGACGAGC TGATCGGCGG CGGCGCGCTG CCGCCGCTGG TCGCGGTCGG GGTGTCCAAC
GCCGGTGACG AGCGCGGCGC GGAGTACTCC CACGCCGTGC CCTACCCCCG CGACCCTCGC
GGCCTGTCCA AGGGCGAGCT GTACGAGCGG TTCCTCATCG AGGAGCTGAT GCCGCTCATC
CGGTCCCGGT ACGCGGTGAC CGGCTCCGCC GGGAGCACCG CCGTCATGGG CTCGTCGATG
GGCGGGCTGG TCAGCTACCA CCTGGCCTTC CGGCGTCCCG ACGTGTTCGG GCTGGCGGCG
ATCCTCTCGC CGTTCCTGGT GTTCGTCGAC CCCGAGACGC TCACGGAGAC GCCCGTCTAC
CGGCGCTTCA CCGAGCGCGG GCCCGGCCGG GTATGGGTCG ACATCGGCGG GATGGAGGGG
TTGATCACGG TCCGCCATGC CCGTGAGCTC GCCGCGCAGC TCGTCGGGCT CGGGTACGCG
CCCGATACGG AGCTGCGCTA CCGGCACGAG CCGGACGCGC CGCACCACGA GAGCGCGTGG
CAGGCGCGGG CGGCCAGCGC GCTGCTGCAC CTGTTCGGGG CCGAAGGGCC GCCGGTCGAG
CTCACCGCCG CCCCCGAGGC GCTGAGGGCC GGCGTGGGTG AGGCGATCGA CGCCGCGCCC
GTCGCCGTGC GCGCGGACGG CTGCGTCTAC TCGGCGCTCA GCGCCCAGGT GGCCTGGCAC
CCCGAGCACC GGCTCAAGCA CGCGGGTACC GCACTGCTCA CGGCGACCGA TCCCGGTACC
GCCGAGGTCA CGGCCACCGT GGGCGCGCTC ACGGCACGCC GGGTGATCAC CGTCGTGGAC
GGCGGGCCGA CCGCGACGCT CGACGTCACC GTCGTCACCC CGCCCGGCAC CCCCGAGGGG
GACACCGTCT ACTTCTCCGG CCTCGTCACC ACCCGCGTCG CCCCCGGCGT CCACCACGGC
CGGTGGCGGC TGCCCCGGGG GCTCGGGCTC AACGGCACCG TCGGCAGGGG CTGGCGCTGC
GACGGGCTGG ACGCGGACGG CCTCCCGATC CGCGGGCCGC TCAGGCACGA CGCCGACCGC
AGCCTGTCCG TCAGGGTCGA AGGCTGGAGC GATCCGGAGC GGGACGACCC GCACAAGGGC
TCGGACCCCG CGGACGATCC GGAGCGGGAC GGCCCGCACA GGGGCTCGGA CCTCGCTGAC
GGGCCCGGCA TGCCCGGCCC GGACTCCCAG ACCGACACCC CCCACTCCGG CTCATGA
 
Protein sequence
MIDILRGVRS AALGNSRDVF VYSPPGYDPA AGPYPVVYLH DGQHVFAGAG LEPSWRLDET 
LDELIGGGAL PPLVAVGVSN AGDERGAEYS HAVPYPRDPR GLSKGELYER FLIEELMPLI
RSRYAVTGSA GSTAVMGSSM GGLVSYHLAF RRPDVFGLAA ILSPFLVFVD PETLTETPVY
RRFTERGPGR VWVDIGGMEG LITVRHAREL AAQLVGLGYA PDTELRYRHE PDAPHHESAW
QARAASALLH LFGAEGPPVE LTAAPEALRA GVGEAIDAAP VAVRADGCVY SALSAQVAWH
PEHRLKHAGT ALLTATDPGT AEVTATVGAL TARRVITVVD GGPTATLDVT VVTPPGTPEG
DTVYFSGLVT TRVAPGVHHG RWRLPRGLGL NGTVGRGWRC DGLDADGLPI RGPLRHDADR
SLSVRVEGWS DPERDDPHKG SDPADDPERD GPHRGSDLAD GPGMPGPDSQ TDTPHSGS