Gene Sros_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1214 
Symbol 
ID8664489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1245434 
End bp1246564 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID 
Productputative hydrolase 
Protein accessionYP_003336955 
Protein GI271962759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.260991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCA TGAACGTTGA CATCACCCCG TTCCGCATCG AGATCCCGCA AGCCGACCTT 
GACGATCTGC GGGAGCGGCT GCGCCGTACC CGCTGGAGCG GGGAGATCGG CGGGCAGGGC
TGGAGCCGCG GAGTGCCCGT CGACTACCTC AGGCAGCTCG CCGACTACTG GGCTGACGGC
TACGACTGGC GCAAGCAGGA GGCCAGGCTG AACGACCTGC CCCAGTTCAC CACTGAGATC
GACGGGCAGC GCCTGCACTT CGCGCACGTC CGCTCGGCCA ACCCCGACGC CGTCCCGTTG
CTGCTCACCC ATGACTGGCC GGGCTCGTTC GTCCTGTTCC TCCAGGCTGT CGAGCCGCTC
TCGCGGGACT TTCACCTGGT CCTGACCACC CTGCCCGGCA TCGCCTTCTC CGGGCCGCTG
GCTGGGCCGG GCTGGAACAC CGGCAAGATC GCGGGCGCGT TCGTCGAGTT GATGGGGCGC
CTCGGCTACG ACCGCTATGG CGTTCAGGGG TCCGGCGGCG GCGCTGCGGT CGCCATCGAG
ATGGGCCGCC AGGCACCGGA GCAGGTGATC GGCGTCCACG GCAACGGTCA CATCACCTTC
CCCTCGGACG ACCCGGCCGA CTTCGCCGAC CTGACCGAGG CCGAGCAGCA GCGCCTGGCC
AGGCTGCAGA ACTTCCGCGA CGACAAGATG GGTTTCAACG TCATCTCCGC CACCAGGCCG
CAGACCCTGG CCTACGGCCT GCACGACTCC CCGGTCGGCC AGCTGGCCTG GATCACCGAG
AAGTTCAAGG AGTGGACCGA CGACTCGGCC GATCTGCCCG AGGACGCGGT CGACCGGGAC
ATCCTGCTGA CCAATGTCAG CCTGTACTGG TTCACCGGCA CCGCGGGCTC GTCCGCCAAC
CTGTACTACG AGGCGTCCCA CGACCCCGAC GCCTGGACGC CCAAGCCACG CAGCGGCGTT
CCGACCGGCT TCACGGTGGC CATGAGCACC GACGTGACCA TCCGCCGCTT CGCCGAACGC
GACAGCGACG TGGTCCACTG GAGCGAACTC GAACGCGGCG GCAACTTCCT CGCCCTCGAA
CAGCCGGCCG CCTACGCCGC GGATGTCAAG AAGTTCTTCG ACAGCCTCTG A
 
Protein sequence
MNVMNVDITP FRIEIPQADL DDLRERLRRT RWSGEIGGQG WSRGVPVDYL RQLADYWADG 
YDWRKQEARL NDLPQFTTEI DGQRLHFAHV RSANPDAVPL LLTHDWPGSF VLFLQAVEPL
SRDFHLVLTT LPGIAFSGPL AGPGWNTGKI AGAFVELMGR LGYDRYGVQG SGGGAAVAIE
MGRQAPEQVI GVHGNGHITF PSDDPADFAD LTEAEQQRLA RLQNFRDDKM GFNVISATRP
QTLAYGLHDS PVGQLAWITE KFKEWTDDSA DLPEDAVDRD ILLTNVSLYW FTGTAGSSAN
LYYEASHDPD AWTPKPRSGV PTGFTVAMST DVTIRRFAER DSDVVHWSEL ERGGNFLALE
QPAAYAADVK KFFDSL