Gene Sros_5902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5902 
Symbol 
ID8669196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6470963 
End bp6472051 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content73% 
IMG OID 
ProductZn-dependent protease-like protein 
Protein accessionYP_003341380 
Protein GI271967184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0267336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.154235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGGC CGTTCGGCAT CCCGGTGTAC GTCTCGCCGA CCTGGTTCAT CGTGGCGGCG 
TTCATCACCT TCACCTACCA GCCGATCGTG ACCGGCCGGC TGCCCGAGCT GAGCGTCCCG
GTGACGTATC TCGTCTCGTT CGTCTTCGCG GTGCTGCTGT ACGTCTCGGT GCTGCTGCAC
GAGCTGGCGC ACTGCGTGGT CGCCAGGATG TACGGCCTGC CGGTCCGCCG CATCACCCTC
TACCTGCTCG GCGGCGTCTC GGAGATCGAG CGCGAGCCGG AGACCCCAGG CCGGGAGTTC
ATGGTGGCCT TCGCCGGGCC GCTGCTCTCG CTCGGGCTGG CCGCGATTGG TTTCGCCGCC
TCCCTGGTCA TCGACCCGGG CACCGTCCTC GGCGTGCTGA CCTTCCAGCT CTGGTTCGCC
AACCTGATCG TCGGGGTCTT CAACCTCCTG CCGGGGCTGC CGCTCGACGG CGGGCGCATG
CTGCGCGCCG GTGTCTGGAA GGCCACCCGC AACCCGGGCT CGGGCACGAT CGCCGCGGCC
TGGGTGGGCC GCGTGCTGGC CGTCGTCATG GTCGCGGTGC CCGTCGGCCT GGCGCTGATG
AGCGGGCAGG CGCCGGGCTG GGAGCTCATC TGGTCGGTGC TGCTGGCCTC CTTCATCTGG
TTCGGCGCCA CCCAGGCGCT GCGCGGCGCC CGGGTGCGCG CCCGCATCCC GCAGGTGAAC
GCCCGCGCCC TGGCCAGGCG GGCCATCGCG GTGACCGGCG ACGTGCCGCT GGCCGAGGCC
CTGCGCCGGG CCGCGGAGGC CCAGGCGGGG GCGATGGTGG TGGTCGACCA CGAGGGCCGC
CCGACCGGGA TCGTCAACGA GGTCGCGGTG GAGGCCACCC CGGAGAACCG CCGCCCCTGG
GTGACGGCCG GCTCGCTGGC GCGCGGCCTC GAACCGTCGC TGGTGCTGGC CGCCGACCTG
TCGGGGGAGT CCCTGATCGA CGCCATGCGC GAGGCCCCAG CGGGGGAGTA CCTCCTCGTG
GAGCGCGGCG GCGAGATCTT CGGGGTGCTG GCCACCTCGG ACGTGAACCG GGTGTTCAGC
GGGGTCTGA
 
Protein sequence
MGRPFGIPVY VSPTWFIVAA FITFTYQPIV TGRLPELSVP VTYLVSFVFA VLLYVSVLLH 
ELAHCVVARM YGLPVRRITL YLLGGVSEIE REPETPGREF MVAFAGPLLS LGLAAIGFAA
SLVIDPGTVL GVLTFQLWFA NLIVGVFNLL PGLPLDGGRM LRAGVWKATR NPGSGTIAAA
WVGRVLAVVM VAVPVGLALM SGQAPGWELI WSVLLASFIW FGATQALRGA RVRARIPQVN
ARALARRAIA VTGDVPLAEA LRRAAEAQAG AMVVVDHEGR PTGIVNEVAV EATPENRRPW
VTAGSLARGL EPSLVLAADL SGESLIDAMR EAPAGEYLLV ERGGEIFGVL ATSDVNRVFS
GV