Gene Sros_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0531 
Symbol 
ID8663800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp537091 
End bp538221 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003336298 
Protein GI271962102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGA TCACCCTGCT GGGGGCCGCC CTGCCGGACG CCCCGCCACC CACACCCGAG 
GCCGTCGCCC GCGCCCGCGC CCGCCTGACC GCCCACGGGG TACGGCGCCG CCGTCACCCC
ACCTGGACAC TGATCATTGG AGCCTCCATG GCCACCGCCG CCGTCATCAC CGCAGTCGCG
CTGGCCGCGA CCCTCCTGGC CCCGGCGCCG CCCTCGGTGC TGGAGACACC GAAGACCGGT
GAGCACCTGC TCCGGGAACT CGCGGACAGG GTGGAGAAGC TCTCACCCGG GACCGGCGCC
TACTGGCGCG TCCAGGGGAC TCGCGTCAAC CGGTATGCGG TCGGCACCGG ACCCACGCGC
TACTGGATCG CGTCCAGAGG GGAGGTGCGC CAGTGGACAC CGCGGAAGCC GGGAGCCTTG
TACGTCCAGG AGACCGAGTT GTCCGGCATC CGGCCGGACA CACCGCGGGA CGAGAAGATC
TGGCGGAAGC AGGGCTCGCC CGACCGCTGG CGCCTGCCCA AGTGCGAGAG CTCCTCCCCT
CCCTGCGCTC CGACCGCCCT CGCCGACAAG CGGTCGCGAC GCGAGTACCG GATCATGGGA
GACGTCCCCG ACCCCGGCCT GGGAGGTCTC ACCATCGCCG AGTTGGACGC CCTCCCGACC
GATCCGGCGC GGCTGCGGGA GCGCCTTGAG GGCTACCGCA AGGCCGAGCA GAAGCGGGGC
CTCAAACGGT CCTGGGAGGA GTTCCTCAAG GCGGCCGTGC GCGATATGAC GGTCACGCCG
GTCAGTCCCG GACTCCGGGC GGCGCTGCTG CGCCTGTACG TGGAACAGCC CGGGGCCGAG
GTGGCGCGGG AGGACAGCGA TCCGCTGGGC CGTCCCGCCA TCGCCATCGA CCTCGAGACC
AAGGGCTACT TCCAGCTGGG CACCCGTATG GTGCCGATCA CGAAAGAGAT CCTCCTCGAC
CCTCGGACCG GTGAGGGCAT GGCCGAGAGG TCTGTCACGA CGGACGCCGA AGGCGGGTTC
CCGAAGGGCA CCGTGGCCCA CTACGTGGTC GTCGAGAAGA TGGGCTGGAC CGATGAGCGG
CCCAAGCTTC CCTCGGGCTG CCGGCTGAAG GCCGGCGTCA CCTGCCGCTG A
 
Protein sequence
MDEITLLGAA LPDAPPPTPE AVARARARLT AHGVRRRRHP TWTLIIGASM ATAAVITAVA 
LAATLLAPAP PSVLETPKTG EHLLRELADR VEKLSPGTGA YWRVQGTRVN RYAVGTGPTR
YWIASRGEVR QWTPRKPGAL YVQETELSGI RPDTPRDEKI WRKQGSPDRW RLPKCESSSP
PCAPTALADK RSRREYRIMG DVPDPGLGGL TIAELDALPT DPARLRERLE GYRKAEQKRG
LKRSWEEFLK AAVRDMTVTP VSPGLRAALL RLYVEQPGAE VAREDSDPLG RPAIAIDLET
KGYFQLGTRM VPITKEILLD PRTGEGMAER SVTTDAEGGF PKGTVAHYVV VEKMGWTDER
PKLPSGCRLK AGVTCR