Gene Sros_6008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6008 
Symbol 
ID8669302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6582877 
End bp6584016 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content74% 
IMG OID 
ProductSoxB2 
Protein accessionYP_003341485 
Protein GI271967289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00127478 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.130354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACG TCGTGGTGAT CGGCGCCGGA GTCGTCGGGG CCGCGTGCGC GTACTACGCG 
GCGCGTGCCG GGCTGGACGT GGTCGTCGTC GACCGCGGGC CCGTGGCGGG CGGCACGACC
GGCGCGGGAG AGGGAAACGT CCTGGTCTCC GACAAGGAGC CGGGTCCCGA GCTCGACCTC
GCCCTGCTCT CCAACGGCCT CTGGCGCGAG CTGGCGGAGC TCGACGGCTT CGAGTTCGAG
GCCAAGGGCG GCCTGGTGGT CGCCGAGACC GGCGAGGTGC TGGAGGCGCT CACCGGCCTG
GCGGGCAAGC AGGGGGTCGA GCACACGGTG GTCGCCTCCG GCGGGCTCAA CGACTACGAG
CCCCACCTGG CAGGTGGCTT CGCCGGAGGC GTGTTCTACC CGCAGGACGC CCAGGTCCAG
CCCATGCTGG CGGCGGCCAG GCTGATCCGG CGGGGCGCCG ACAGCTTCGG CCGCGGCGCC
CTGATGCTGC GCACCGGTGT CACGGTCACC GGCTTCCTGC GCGACGGCGA CCGGATCGGC
GGCGTCACGA CCGACCACGG CGACATCCTC GCCGGAGCCG TCGTCAACGC CGCCGGGACC
TGGGGCGGCG AGGTGGCCGC CATGGCCGGC GTGCACGTCC CGATCCTGCC CCGGCGCGGC
TTCATCCTGG TCACCGAGCC GTTCGACAGG CCGCTGATCA GGCACAAGGT CTACACCGCC
GCCTATGTCA CCAACGTGGC CAGCGACTCG GAGGGCCTGG AGACGTCCGC CGTCGTGGAG
GGCACCCCCT CGGGGCCGGT GCTCATCGGC GCCAGCCGCG AGCGCGTCGG CTTCGACCGC
ACGGTCTCCG TACCGGTGCT GGAACGCCTC GCCCGCCAGG CCGTGGAGCT GTTCCCGGCG
CTGGCCGACC GCAGGGCGAT CCGGGCCTAC TGCGGCTTCC GGCCCTACTG CCCCGACCAC
CTGCCGGTGA TCGGTGAGGA CCCCCGGGCC CCCGGCCTCC ACCACGCCTG CGGCCACGAG
GGGGCGGGCA TCGGCCTGGC CCCCGCCACC GGCCACCTGA TCGCCCAGTC GCTGGCCGGT
CTCCGCCCCG ACCTCGACCT CACGCCCTTC CGCCCGGACC GCTTCGAGGA GCGCCGATGA
 
Protein sequence
MPDVVVIGAG VVGAACAYYA ARAGLDVVVV DRGPVAGGTT GAGEGNVLVS DKEPGPELDL 
ALLSNGLWRE LAELDGFEFE AKGGLVVAET GEVLEALTGL AGKQGVEHTV VASGGLNDYE
PHLAGGFAGG VFYPQDAQVQ PMLAAARLIR RGADSFGRGA LMLRTGVTVT GFLRDGDRIG
GVTTDHGDIL AGAVVNAAGT WGGEVAAMAG VHVPILPRRG FILVTEPFDR PLIRHKVYTA
AYVTNVASDS EGLETSAVVE GTPSGPVLIG ASRERVGFDR TVSVPVLERL ARQAVELFPA
LADRRAIRAY CGFRPYCPDH LPVIGEDPRA PGLHHACGHE GAGIGLAPAT GHLIAQSLAG
LRPDLDLTPF RPDRFEERR