Gene Sros_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3852 
Symbol 
ID8667142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4288397 
End bp4290685 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative dehydrogenase 
Protein accessionYP_003339513 
Protein GI271965317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0172698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGGA AGGCTCCGCG CGAGGATGTC ACCGAGGATG ATCTCGAGGT CGGCCCGCCC 
AAGGAGTGGG CGGCGGGCAT GCCGGGAGTG ACCCGCTCGC TGATGACCTC CTACGCCCAG
ATGGGTGTCG GCCGCACCCT CCTGACCCTG GTCCGCGTCA ACCAGAAGGA CGGCTTCGAC
TGCCCGGGCT GCGCCTGGCC CGAGGGGGAG CACCGCAGCC CGGCGGAGTT CTGCGAGAAC
GGTGCCAAGG CGGTCGCCGA GGAGGCCACC ACCCGCCGGG TCACCCGCGA CTTCTTCGCC
GGGCACACCG TGGGAGAGCT GGCGGGGCGT ACCGACTACT GGCTGGGCCA GCAGGGACGG
CTCACCGAGC CGATGCACAA GCCGGCCGGC TCCGACCACT ACGTGCCGGT GACGTGGGAT
GAGGCGTTCG CCATCGTCGC CCGGGAGCTG CGCGCGCTCG GCGGCCCGGA CGAGGCGGTG
TTCTACACCT CCGGCCGCAC CTCGAACGAA GCCGCGTTCG CCTACCAGCT CATGGTGCGC
CGGTTCGGCA CCAACAACCT GCCGGACTGC TCCAACATGT GTCACGAGTC CAGCGGCTCG
GCGCTCAACC AGACGCTGGG CATCGGCAAG GGCACGGTGT CGCTGGAGGA TCTGCACCGC
GCCGACCTCG TCTTCGTCGT CGGCCAGAAT CCCGGCACCA ACCATCCGCG CATGCTGTCG
GCGCTGGAGA AGGCCAAGCG GAACGGGGCG CGGATCGTCG CGGTCAACCC GCTGCCCGAG
GCGGGGCTGC TGCGCTTCAA GAACCCGCAG CGGCCGTCCG GCGTGGTGGG GCGCGGTACG
ACTCTGGCCG ACCGGTTCCT GCAGATCCGG CTCAACGGCG ACCTCGCGCT CTTCCAGGCC
CTGTCGCGGC TGCTGCTGGA GGCCGAGGAG GCGGACCCCG GCTCGGTGGT CGACCACGAG
TTCGTCCGGG CCCACACCCA CGGCTTCGAC GAGTGGGCCA AGAGCGTCCG CGACCTCGAC
TGGGCGGACG TCGAGGAGGC GACCGGGCTG GAGCGGCCCG CCATCGAGGA GACGGTCAGT
GAGGTGCTCG GGGCGCGCTC GGTGATCGTG TGCTGGGCGA TGGGCCTCAC CCAGCACAAG
AACTCCGTCG CCACCATCCG CGAGGTGGTC AACTTCCTGC TGCTGCGCGG GAACGTGGGG
CGGCCGGGTG CCGGGGTCTG CCCGGTGCGC GGGCACTCCA ACGTCCAGGG CGACCGCACG
ATGGGCATCT ACGAGAGACC CGCGGAGCGC TTCCTCGACG CCCTGCGTGA GGAGTTCGGT
TTCGAGCCGC CCCGGCACCA CGGCCTGGAC ACCGTGGCGG CGATCAGGGC GCTGCGCTCG
GGGGAGGCGA AGGTGTTCTT CGCGATGGGC GGCAACTTCG TGGCCGCGAC CCCGGACACG
GCGGTGACGG AGGCGGCGAT GCGCCGGGCC CGGCTGACGG TGCAGGTGTC GACCAAGCTG
AACCGCTCCC ACACCGTCTG CGGCGAGCAG GCGCTCATCC TCCCGACGCT GGGGCGCACC
GAGCGCGACG GCGACCGGTT CGTCACGGTC GAAGACTCCA TGGGCCTGGT CCACGCCTCC
CGGGGCCGGC TCCGCCCCGC CTCGCCGGAC CTGCTCCCCG AGGTGGCGAT CGTCTGCCGG
CTGGCCCGGG AGGTCTTCGG CGCCGACCCC CACGTGCCCT GGGAGGAGTT CGAGGCCGAC
TACGACACGA TCCGCGACCG CATCGCCCGG GTGGTGCCCG GGTTCGGCGA CTTCAACGCC
CGGGTCCGCG CACCCGGCGG CTTCGCCCTG CCCAACGCGC CCCGCGACGA GCGGCGCTTC
CCGACCGCGA CGGGGAAGGC CAACTTCACC GTCAACGCCC TGGAGGTGCT GCGCGTCCCC
GCCGGGCGGC TCCTGCTGCA GACGGTCCGC AGCCACGACC AGTACAACAC CACGATCTAC
GGCATGGACG ACCGCTACCG GGGGGTGAGC GGCGGCCGCC GCGTCGTCTT CGTCCACCCC
GGCGACCTGG CCGAGCGGGA CCTGGCCGAC GGCGACCTGG TGGACCTGGT CAGCGAGTGG
CCCGACGGGG AGCGCAGGGC CGAGGCGTTC CGCGTCATCG CCTACCCGAC CGCGCGCGGG
TGCTGCGCGG CGTACTTCCC CGAGACCAAC GTGCTCGTGC CGCTGGACTC GGTCGCGGAG
ACCTCCAACA CCCCCACGTC CAAGAGCGTC GTGGTGAGGC TCAGCCGGTG TGCCCCAGGC
GGTGGCTGA
 
Protein sequence
MARKAPREDV TEDDLEVGPP KEWAAGMPGV TRSLMTSYAQ MGVGRTLLTL VRVNQKDGFD 
CPGCAWPEGE HRSPAEFCEN GAKAVAEEAT TRRVTRDFFA GHTVGELAGR TDYWLGQQGR
LTEPMHKPAG SDHYVPVTWD EAFAIVAREL RALGGPDEAV FYTSGRTSNE AAFAYQLMVR
RFGTNNLPDC SNMCHESSGS ALNQTLGIGK GTVSLEDLHR ADLVFVVGQN PGTNHPRMLS
ALEKAKRNGA RIVAVNPLPE AGLLRFKNPQ RPSGVVGRGT TLADRFLQIR LNGDLALFQA
LSRLLLEAEE ADPGSVVDHE FVRAHTHGFD EWAKSVRDLD WADVEEATGL ERPAIEETVS
EVLGARSVIV CWAMGLTQHK NSVATIREVV NFLLLRGNVG RPGAGVCPVR GHSNVQGDRT
MGIYERPAER FLDALREEFG FEPPRHHGLD TVAAIRALRS GEAKVFFAMG GNFVAATPDT
AVTEAAMRRA RLTVQVSTKL NRSHTVCGEQ ALILPTLGRT ERDGDRFVTV EDSMGLVHAS
RGRLRPASPD LLPEVAIVCR LAREVFGADP HVPWEEFEAD YDTIRDRIAR VVPGFGDFNA
RVRAPGGFAL PNAPRDERRF PTATGKANFT VNALEVLRVP AGRLLLQTVR SHDQYNTTIY
GMDDRYRGVS GGRRVVFVHP GDLAERDLAD GDLVDLVSEW PDGERRAEAF RVIAYPTARG
CCAAYFPETN VLVPLDSVAE TSNTPTSKSV VVRLSRCAPG GG