Gene Sros_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4099 
Symbol 
ID8667393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4557959 
End bp4560025 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339750 
Protein GI271965554 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.296122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.569899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGC CCGGAGGCCT GCAGACCGTC GTCGTGACCG GACAGTATCT GACCCCCGAC 
GGCGAGCCCC GCCGGGGCAG CGTGCTCATC GAGCCGGAGC CCGACGCCCT CACCAGCGCC
GAGCACGGCC TCATCGTCCT CGGCGGGACC GAGGCGCAGC TCGACGACTC CGGCCGCTTC
AGCCTCGAAC TGCTGGCCAC GGACGCCGCC GGGGTCACCC CCTCCAGCTG GGCCTATCGC
GTCACCGAGC GGTGGCCCGA CGCTCCCGAC CGCAGCTACT CCCTGGAGCT GCCAGCCGCC
GAGCCGGCCG TCTCCCTGCC CGAGGCCGCT CTCTCCGGCA GGGCGCGCGC CGCCGGCGAC
GACCCCGCCG GAGACCCTCC GGCGGGTACG GCCGGCGCCC GGGGACGCGC GGGCGACGGG
AGGAGGCCCG CGGCCGGAGC CATGGACTCC CTCGTCTACG ACGCCCGCGA GCACGGCCTC
ACCGGTGACG GGGTCACGAA CGACCAGCCG GCCCTGGCGG AGCTGGTCGA CGTGGCGGGC
GCGGCCTGCG CCGCCGACGG CCGGGCGCGC GTCATCCACT GCCCGCCCGG CGTCTACTCC
ATCCGGGACT CGGGCACCGT GTGGCGGAGC GGGGTGTCGC TCATCGGCGC GGGCGCGGCC
GCGACCCGGT TCGTCCTGTC GAACTCCGGA AATCCCGCCG ACCCGACACC GCTGGCCTTC
TTCACCGCCA TCCAGCACGG CGCCGGCCCC GACAACCACC TCGCCGACTG CACCTTCGCC
GACTTCGAGA TCGACGGCTC CGGCGTCGCC CTCGCGGAGT ACGACGTGCT CGCCAAGGGA
CTGGGCCTGC AGTACGTGCT CCGGGGGCGG TTCCGCAACC TGTACATCCA TCACACCGCC
GCCTCGGGCT TCGGCTGCGA CTTCCTGCAG GACTCGGTGG TGGAGAGCAT CGTCGCCATC
GGCTGCGGGC GGCTGGACAG CGGCGAGCAG ATAGGCGGCG CCGGTCTCGG CATCGGGATC
GGCGGCTGGG GCGCGGTCGA ACGCCTCACC ATCATCGGCT GCACCGCCGT CGGCAACGGG
ACCAACGGCG TCTTCCTGGA ACTCCAGGAT CGGGAGTGGA CCCCGCCGCG CGGCATCCGC
ATCACCAACT GCCACGCCGA GGGCAACAGG TACGGCATCT CCGACTGGGG TGCCGACGGG
CTGATCGTCG CCGCCTGCAC CATGATCGGC AACCAGGTGG CCGGCTACGA CGTCTCCGGC
CTGGGCACGA CCTCCGTCGC CGGGAGGGGA GGCATCGTCA CCGGCTGCGT CGTCGACGGC
AACGTCCGCG ACGGCATCAG CATCGGCAAC ACCCCCGGCC GCTACACCGT CGAGGGCAAC
CGCATCAGCC GTAACGGCCG CCATGGCTAC CGGCAGCACA ACCTGCCCGG CGGGCCCGCG
CACGCCTCCA TCGAGATGGT CCTCGACGGC AACGACATCT GGGGCAACGC CCTCGACGGC
GTCCGGGTCG ACGGCACCCT CGTCGACGCC GCCCTGCTCG GCAACCGGAT CCGCGACAAC
GGGTGCCGGG CGGCCCCCGA GGCGTCCGGC GGGGGCGCCG GCGTCACCTA CACCGCCACC
TCCCTGACCG ACGCCGCGGC CGCCTGGCCG CCCGACGGGC ACCGCGGCAA GATACTCACC
GCCGGCGCCC GCACGGCGGT CGTCACCGCG AACACCGCCA CCGAACTCGT CCTCGCCCCC
TTCCGCCCCG GCGTGACGAC GGCCTGGATC GGCGATCCCC CCGCGCCGGG CACCCCGTAC
AGCCTGCCCG GCTCTCCCGC GGTCCGGGCC GGTATCAGCC TGAACGCGCC GACCCTCAGT
CCCACCGTCC GGGGCAACCG TGTCTGGGAC AACCAGGATC CCAAGACGCA GACCCACGGG
CTGTGGATCA CCGCCGAGGG CAGCTGCGTG TCGGGCGCGG TGGAGGACAA CGACCTGGCG
GGCAACGCCG TCGCCGCGGT CCGTTTCGAC ACCGCCCCCT CCGGCGGCCG CTGGGAACGC
GACCACGGTC TCGACGGCCG CTCCTGA
 
Protein sequence
MSLPGGLQTV VVTGQYLTPD GEPRRGSVLI EPEPDALTSA EHGLIVLGGT EAQLDDSGRF 
SLELLATDAA GVTPSSWAYR VTERWPDAPD RSYSLELPAA EPAVSLPEAA LSGRARAAGD
DPAGDPPAGT AGARGRAGDG RRPAAGAMDS LVYDAREHGL TGDGVTNDQP ALAELVDVAG
AACAADGRAR VIHCPPGVYS IRDSGTVWRS GVSLIGAGAA ATRFVLSNSG NPADPTPLAF
FTAIQHGAGP DNHLADCTFA DFEIDGSGVA LAEYDVLAKG LGLQYVLRGR FRNLYIHHTA
ASGFGCDFLQ DSVVESIVAI GCGRLDSGEQ IGGAGLGIGI GGWGAVERLT IIGCTAVGNG
TNGVFLELQD REWTPPRGIR ITNCHAEGNR YGISDWGADG LIVAACTMIG NQVAGYDVSG
LGTTSVAGRG GIVTGCVVDG NVRDGISIGN TPGRYTVEGN RISRNGRHGY RQHNLPGGPA
HASIEMVLDG NDIWGNALDG VRVDGTLVDA ALLGNRIRDN GCRAAPEASG GGAGVTYTAT
SLTDAAAAWP PDGHRGKILT AGARTAVVTA NTATELVLAP FRPGVTTAWI GDPPAPGTPY
SLPGSPAVRA GISLNAPTLS PTVRGNRVWD NQDPKTQTHG LWITAEGSCV SGAVEDNDLA
GNAVAAVRFD TAPSGGRWER DHGLDGRS