Gene Sros_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4246 
Symbol 
ID8667540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4729638 
End bp4731281 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content67% 
IMG OID 
ProductCholesterol oxidase 
Protein accessionYP_003339891 
Protein GI271965695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.191663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACACCTCGGG CAGCACCGAC TCCAAAGGGA TCTCACGTCG TGGATTCATC 
GCTGGAACCG GTTCCATCTT GGGGGTCGCG GCCCTTACGG GTCGCGCCAC CGCAGCCCAG
GCGGCGGCCC TCCCCGCGGC CGCTCCGATC AGCAGTGGGG CACACGTCCC GGCCCTGGTG
ATCGGCACCG GATACGGCGG CTCCGTCGCC GCCCTGCGTC TCGCCCAGGC GGGCGTCGAC
GTGCACATGA TCGAGATGGG CATGGCCTGG GACACTCCCG GCTCCGACGG CAAGATCTTC
TGCAACACGC GCGAGCCGGA CTACCGGTCC TACTGGCTGC GCACCAAGAG CAAGGCGCCC
CTCAACTACT TCCTCGGCTT CCCGATCGAC AGGAACATCC CCCGCTACAC CGGGATCCTG
GACGCCGAGG ACTTCAGCGG CATCACGGTC TACCAGGGCC GCGGCGTCGG CGGCGGCTCG
CTGGTCAACG GCGGCATGGC GGTCACCCCC AAGCGCGAGA ACTTCGGCGC CGTCCTCCCG
TCGGTGAACG CCGCCGAGAT GTACGACATC TACTATCCGC GCGCCAACGC CGGGCTCGGG
GTCAGCTCCA TCGATCCGGC CTGGTTCGAT TCCACCGCCT GCTACCAGTA CGCCCGGGTC
GGCCGCAAGC ACGCCCAGCG TTCCGGCTTC CCGTTCGTCT TCGTGCCCGA CGTCTACGAC
TGGGACTACA TGAAGCAGGA GGCGGCCGGG ACCGTCACCA AGTCGGCGCT GGCCGGAGAG
ATCCTCTACG GCAACAACCA CGGCAAGAAA TCGCTGCAGC AGACCTACAT CGCCCGGGCC
AAGGCCACCG GCAGGGTCGC CATCTCGCCG CTGCACAAGG TCACCTCGGT CGCTCCGGCG
GCCGGCGGCG GCTACACGGT CGTCATCGAC CAGATCAACA CCAACGGCGA CACCACGGCC
ACCAAGACCG TGACCGCGGA CAGGGTGTTC TTCGCCGCCG GCAGCGTCGG CACCAGCAAA
CTGCTGGTCA AGCTGAAGGC CACCGGCGCA CTGCCCAACC TCAACGACGA AATCGGCAAG
GGCTGGGGCG ACAACGGCAA CGTCATGTGC GGCCGCGCCA ACCACATGTG GGACCCGACC
GGCAGTCTCC AGTCGGCCAT CCCCTGCGCC GGCATCGACA ACTGGGCCGC CGGCGGCGCG
TTCGCCGAGG TCGCGCCACT GCCCACCGGG ATCGAGACCT ACGCCTCGTT CTACCTGTCG
ATCACCAAGA ACCCGAACCG CGCCCAGTTC TCCTGGAACG CCGCGACCGG CAAGGTCGAC
CTGAACTGGC AGACCTCCTG GAAGCAGCCG TCCATCGACA TGGCCAAGAC GATCTTCGAC
AAGATCAACT CGAAGGAGGG GACGATCTAC CGGACCGACC TCTTCGGCAC CTACAAGATC
TGGGGCGATC ACCTCACCTA CCACCCGCTC GGCGGCGCGG TCCTGAACAA GGCCACCGAC
AACTACGGCC GTCTCGCCGG CCATCCCGGC CTGTATGTCA TCGACGGCTC GCTGATCCCC
GGCAACACCA GCGTCAACCC GTTCGTCACC ATCACGGCGC TCGCCGAACG GAACATCGAA
AAGATCATAG CCACCGATCT GTGA
 
Protein sequence
MSDNTSGSTD SKGISRRGFI AGTGSILGVA ALTGRATAAQ AAALPAAAPI SSGAHVPALV 
IGTGYGGSVA ALRLAQAGVD VHMIEMGMAW DTPGSDGKIF CNTREPDYRS YWLRTKSKAP
LNYFLGFPID RNIPRYTGIL DAEDFSGITV YQGRGVGGGS LVNGGMAVTP KRENFGAVLP
SVNAAEMYDI YYPRANAGLG VSSIDPAWFD STACYQYARV GRKHAQRSGF PFVFVPDVYD
WDYMKQEAAG TVTKSALAGE ILYGNNHGKK SLQQTYIARA KATGRVAISP LHKVTSVAPA
AGGGYTVVID QINTNGDTTA TKTVTADRVF FAAGSVGTSK LLVKLKATGA LPNLNDEIGK
GWGDNGNVMC GRANHMWDPT GSLQSAIPCA GIDNWAAGGA FAEVAPLPTG IETYASFYLS
ITKNPNRAQF SWNAATGKVD LNWQTSWKQP SIDMAKTIFD KINSKEGTIY RTDLFGTYKI
WGDHLTYHPL GGAVLNKATD NYGRLAGHPG LYVIDGSLIP GNTSVNPFVT ITALAERNIE
KIIATDL