Gene Sros_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0472 
Symbol 
ID8663741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp471527 
End bp472726 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003336241 
Protein GI271962045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.891873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGTG ATCTACTCGA TCTCATCCTG ATCGCCCTGA TGGTGGCCTT CGCGGTGTCG 
GGATACCGCC AGGGGTTCAT CATCGGGGCC CTGAGCTTTG TCGGATTCGT CGGTGGTGGG
CTGCTGGGCA TCTTCATCGC CCCGCCGATC GCCGGCGCTT TCGTCGACGG CGAGACCGAG
CGGGCGCTGC TCGCCATCGT CATCGTCTTC CTGACCGCGA CGATCGGGCA GTTCGCGTCC
TCGACCATCG GCGCGGTGGT CCGCAGCCAC GTCACCTGGG AGCCCGCCAA GGTGGTCGAC
GCGGTCGGCG GCACCTTCGC CAGCGCGTTC TCGGTGCTGA TCATCGCCTG GCTGATCGGC
TCGCTGATCT CCTCCTCCCA GTTCACCCTG CTGAGCGAGC AGGTCAACAA GTCACTGCTG
ATCAGCACCG TCGACCAGGC GATGCCGAAG GCGGCCAAGG ACTTCCAGAA GCCCTTCAAG
GACTTCATCG ACACCTCCGG CTTCCCCAAG GTGTTCGACG CCATAGGGGC CGGCCAGCTC
GTCGAGGTCG AGCCGCCCGA CAAGAGCGTG CCCAAGGGGG CGCAGCTCTC ACGGGCCCGC
CGGGGCATCG TCAAGGTCCA GGGCGTCGCC TCCAGCTGCC GCCGGCACAT CGAGGGCACC
GGCTTCGTCT ACTCCCAGAA CAAGATCATG ACGAACGCCC ACGTGGTCGC GGGTGTCGAC
CAGGAGCTGC AGGTCACCGA CTACCTCAAC AACGCCCACG CGGCCAAGGT CGTGCTCTAC
AACCCCGACA GGGACATCGC GATCCTCCAC GTCCCCGGAC TGAACATGCC GATCCTGCGC
TTCGACGGCA CCGCCAAGAA GGGCGACGAC GCCATCGTCG CCGGCTTCCC GCACGGCGAG
GGCTTCACCA TGAACGCGGC CCGCATCCGG GTGCAGCAGA AGGCCAGGGG GCTCAACATC
TACGAGCGCA AGACCGTCGT CCGCGACGTC TACGCCATCC GCGGCCTGGT CCGGCAGGGC
AACTCCGGCG GGCCGCTGCT CACCCCCGAC GGCCGGGTCT ACGGGGTCGT CTTCGCGGCC
GCGCTCGACC AGCAGGAGAC CGGCTACGTC CTGACGGCGG CCGAGGTGTC ACCGGACGCC
GAAGACGGCT CCAAGCTGTT CAACCGGGTC GACACCCAGG AGTGCGACCA GAACAGCTAG
 
Protein sequence
MSGDLLDLIL IALMVAFAVS GYRQGFIIGA LSFVGFVGGG LLGIFIAPPI AGAFVDGETE 
RALLAIVIVF LTATIGQFAS STIGAVVRSH VTWEPAKVVD AVGGTFASAF SVLIIAWLIG
SLISSSQFTL LSEQVNKSLL ISTVDQAMPK AAKDFQKPFK DFIDTSGFPK VFDAIGAGQL
VEVEPPDKSV PKGAQLSRAR RGIVKVQGVA SSCRRHIEGT GFVYSQNKIM TNAHVVAGVD
QELQVTDYLN NAHAAKVVLY NPDRDIAILH VPGLNMPILR FDGTAKKGDD AIVAGFPHGE
GFTMNAARIR VQQKARGLNI YERKTVVRDV YAIRGLVRQG NSGGPLLTPD GRVYGVVFAA
ALDQQETGYV LTAAEVSPDA EDGSKLFNRV DTQECDQNS