Gene Sros_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4215 
Symbol 
ID8667509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4697307 
End bp4698947 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339860 
Protein GI271965664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0566809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000593788 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTCTC TCGACGTGCT GATCGTGGGC GCCGGGCCTA CCGGGCTGAC CCTGGCCTGC 
GACCTGATCC GCCGCGGCCT CACCTGCCGG ATCGTCGAGC AGGCCCCCAC CCCCCAGACC
GGCTCCCGGG GCTTCACCCT CAAGCCCCGC AGCCTGGAGA TCCTTGACGA CCTCGGCGCC
GCCCACCGTG TCCTGGCCGC CGCCCAGGTG CAGTCCCGGA CCCGTTTCCA CCTGGGCGAG
CCGCTGCTGT TCGACCTGCG GGTGCCGCCC GCGGCGCCCG ACCCGCGGCG TCCCCATCCC
AACTCGCTGG CCATTCCCCA GTGGCGGACC GAGGCCATCA TGCGCGAGCG GCTGGCCGAG
CTGGGCGGCG TGGTCGAGTT CGGGCGCCGG CTGACCGGCT TCCGCTCCGG CGACGGCCAG
GGCGAGGACG CGGGAGTGAC CGCGACCCTG CGACGCGACG GCGTCACCGA GACCGTGCGC
GCCTCCTACC TGGTCGGGGC GGACGGCGGC CGCAGCACCG TCCGCCGCCG TCTGGGCCTG
GCCTTCTCCG GTTCCACCGA CGGCGACGCC CGCGCGTTGA TCGCCGACGT GCACGTCGAC
GGACCGCGTC ACCGCGACGC GGTGCACCTC TGGATGGCCG CCGACGGCCA CATCGTGGTG
CTGCGGCCCA CCCCCCACGC CCCGACCTGG CAGGTCGTGG CCTCACTGGC CCCTGACACG
GACGGCACCT GGCCCGAAGC CTCGCTGGAA CACCTGCAGC GGGCGGTGAC CGAGCGCACC
GGCCGCGACG ACATCCGGCT GAGCGAGCCG GCCTGGCTGT CGGTCTGGCG CTACAACCTG
CGCATGGTCG ACACCTACCG AGTCGGCCGG GTGTTCCTCG CCGGCGACGC CGCTCACGTG
CACAGCCCGT TCGGCGGGTT CGGCATGAAC ACCGGCATCC AGGACGCCTA CAACCTCGGC
TGGAAACTCG CCCTGGTCCT GCGGGGCGCG GCCGGCGACG CCCTGCTGGA CACCTACCAG
GCCGAGCGGC TCCCCGTCGC CCGCGCGATC CTCGCCGAAA GCGACAGGCG CTTCGCCGCC
GCCACCCCGC CCCGCCTGAT CCGGCCGTTG CTGCGGTTCG TGCTCAAGCC GTTCTTCGCC
CGGCAACAGC TCAGCGACCG AAACGACCAT CCCACCTACC GCACCAGCCC GCTGAGCCTG
GACCTGACCG GCCGCCGCAG CCCCCTCCGC GCCGGCGACG TCGCCCCCGA CGGCCCCGTC
CAGCTTGACG CCGACGCCTC CCGCGCCCGG CTGTTCGACC TGTTCCGCGG GCCGCACTTC
ACCGTCCTGA CCTTCGGCGC CCAGCACGCC CGCGCGGCCG CGCACGCCAC CCGCGGCCTC
GCGGACCACC TGCGCGTCTG CACGGTCATC GCGCCAGGAC AGCAGCGGCC CCTCACCGGC
ACCACCGTCC TGATCGACAC CGAGGGCCGC ATCCGCCGCG CCTACGGCGC CCGCGACAGC
ACCCTGATCA TCGTGCGACC CGACGGCTAC GTCGGGCTCA TCGCGCACCG GCCCGCGGAA
TACACCCTGC GCGACTACCT CGCCCAGGCC CACCTGCGCC CCGTCCCCGC AGCGCCCCCT
CGCGCGGCCG CCTCGCGATG A
 
Protein sequence
MNSLDVLIVG AGPTGLTLAC DLIRRGLTCR IVEQAPTPQT GSRGFTLKPR SLEILDDLGA 
AHRVLAAAQV QSRTRFHLGE PLLFDLRVPP AAPDPRRPHP NSLAIPQWRT EAIMRERLAE
LGGVVEFGRR LTGFRSGDGQ GEDAGVTATL RRDGVTETVR ASYLVGADGG RSTVRRRLGL
AFSGSTDGDA RALIADVHVD GPRHRDAVHL WMAADGHIVV LRPTPHAPTW QVVASLAPDT
DGTWPEASLE HLQRAVTERT GRDDIRLSEP AWLSVWRYNL RMVDTYRVGR VFLAGDAAHV
HSPFGGFGMN TGIQDAYNLG WKLALVLRGA AGDALLDTYQ AERLPVARAI LAESDRRFAA
ATPPRLIRPL LRFVLKPFFA RQQLSDRNDH PTYRTSPLSL DLTGRRSPLR AGDVAPDGPV
QLDADASRAR LFDLFRGPHF TVLTFGAQHA RAAAHATRGL ADHLRVCTVI APGQQRPLTG
TTVLIDTEGR IRRAYGARDS TLIIVRPDGY VGLIAHRPAE YTLRDYLAQA HLRPVPAAPP
RAAASR