Gene Sros_5868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5868 
Symbol 
ID8669162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6430936 
End bp6432294 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003341346 
Protein GI271967150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.459714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGTC GCATCTTCGG GCTTGAGAAC GAGTACGGCG TCACCTGCAC GTTCAGGGGG 
CAGCGGCGGC TGTCACCCGA CGAGGTCGCG CGTTATCTAT TCCGCCGGGT CGTCTCCTGG
GGCCGATCGA GCAACGTCTT TCTGCGCAAC GGCGCCCGCC TCTACCTGGA CGTGGGCAGC
CACCCCGAGT ACGCCACCCC CGAGTGTGAC AACGTCGTGG AACTCGTCAC CCACGACAAG
GCGGGGGAGC GCATCCTGGA GGGACTCCTC GTCGACGCCG AGAAGAGGCT CCGCGAGGAG
GGCATCGCCG GCGACATCTA CCTGTTCAAG AACAACACCG ACTCGGCCGG AAACTCCTAC
GGCTGCCACG AGAACTACCT GGTCGGGCGG CACGGCGAGT TCGGCCGCCT GGCGGACGTG
CTCATCCCCT TCCTGGTGAC CCGGCAGATC GTCTGCGGCG CCGGGAAGGT GCTGCAGACC
CCCCGCGGAG CGGTCTACTG CGTCTCCCAG CGGGCCGAGC ACATCTGGGA GGGCGTCTCC
AGCGCGACCA CCCGGTCGCG CCCCATCATC AACACCCGTG ACGAGCCGCA CGCCGACGCC
GAGCGCTTCC GCCGCCTGCA CGTCATCGTC GGCGACTCCA ACATGAGCGA GACGACCATG
CTGCTCAAGG TCGGCGCCAC CGACCTGGTG CTGCGCATGA TCGAGGCGGG CACGGTGATG
CGCGACCTGT CGCTGGAGAA CCCGATCCGG GCGATCCGGG AGGTCTCCCA CGACATGACC
GGGCGGCGGC GCGTGCGGCT GGCCAACGGC CGGGAGGCGT CCTCGCTGGA GATCCAGCAG
GAATACCTCT CCAAGGCCCG CGACTTCGTC GACCGCCGCG GCGGCGACGA GATCAGCCAC
CGGGTGCTGG AGCTGTGGGA GCGGACGCTC AACGCCGTCG AGACCGGCAA CCTGGACCTG
GTCGCCAGGG AGATCGACTG GGTGACCAAA TACCAGCTCA TCGAGCGCTA CCGCAAGAAG
TACGACCTTC CCCTGTCGAG CCCCCGGGTC GCCCAGCTCG ACCTGGCCTA CCACGACGTG
CACCGCCGCC GTGGCCTGTT CTACCTCCTG CAGAAGCGCG GCGCGGTGGA GCGGGTGGCC
TCCGACCTGA AGATCTTCGA GGCCAAGTCC GTGCCGCCGC AGACCACCCG GGCGCGGCTG
CGCGGGGAGT TCATCCGCAA GGCGCAGGAG AAGCGCCGCG ACTTCACCGT CGACTGGGTG
CACCTGAAGC TCAACGATCA GGCGCAGCGC ACCGTGCTGT GCAAGGACCC CTTCCGCAGC
GTGGACGAGA GGGTGGACAA ACTTATCGCC GGCATGTGA
 
Protein sequence
MDRRIFGLEN EYGVTCTFRG QRRLSPDEVA RYLFRRVVSW GRSSNVFLRN GARLYLDVGS 
HPEYATPECD NVVELVTHDK AGERILEGLL VDAEKRLREE GIAGDIYLFK NNTDSAGNSY
GCHENYLVGR HGEFGRLADV LIPFLVTRQI VCGAGKVLQT PRGAVYCVSQ RAEHIWEGVS
SATTRSRPII NTRDEPHADA ERFRRLHVIV GDSNMSETTM LLKVGATDLV LRMIEAGTVM
RDLSLENPIR AIREVSHDMT GRRRVRLANG REASSLEIQQ EYLSKARDFV DRRGGDEISH
RVLELWERTL NAVETGNLDL VAREIDWVTK YQLIERYRKK YDLPLSSPRV AQLDLAYHDV
HRRRGLFYLL QKRGAVERVA SDLKIFEAKS VPPQTTRARL RGEFIRKAQE KRRDFTVDWV
HLKLNDQAQR TVLCKDPFRS VDERVDKLIA GM