Gene Sros_4475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4475 
Symbol 
ID8667769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4988609 
End bp4991974 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340085 
Protein GI271965889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.161774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCACG GCAAGCCCTG GCGGCTGCTC CTCGCGGCAT GCGTCGCCGC GAGCGGGCTG 
ACCGCCCTCC CCCCCACCGA GGCCCTGGCG GCCGACACCG TCTTCCACGT GGCCACACAG
GGCGACGACG ACGCCGCGGG CACCGAGGTC GCACCGTTCA GGACGATCAC GCGCGCCCAG
CGCGCCGTAC GCGAGGCCCT GCCCACCGCG ACCGGCCCGC TGCGGGTACG GGTGCGGGGC
GGCGTCTACT ACCTCTCCGA GCCCCTCACC TTCACCCCCG CCGACTCCGG TGCCACCTAC
GAGGCGGCCG CGGGCGAGCC GGTCGTGCTC AGCGGCGGGC GCAGGCTCAC CCCCGCCTGG
ACCACCTACC AGGGCGCCAC CCTGGTCGCC GACATCGGCA AGGACCTCGA CTTCGACGAA
CTCTTCCTGG ACGGCAAGCG GCAGGTCCTG GCCAGGTACC CGAACTTCGA TCCCAAGGTC
GCGGTGCTCA ACGGGTACGC CGCCGACGCG ATCTCCCCGT CGCGGGTGGC CCGCTGGAAG
AACCCCACGA CCGCCCTGGT ACGCGGGCTG CACCAGGGCG AATGGGGCGG CAACTCCTTC
AAGGTGACAG GTGTCGACGG CAACGGCGAC CCGACGCTGC AGTGGGTGGG CGACAACAAC
CGGGGCAGCG GGCTGCACCC GCACAAGCGC ATGGTCGAGA ACGTCCTGGA GGAGCTGGAC
GCGCCGGGGG AGTGGTTCCA CGACAAGGCC GCGGGCAGGC TGTACCTCCA GCCTCCCGCC
GGCGTCGACC CCGCGGCGGT CCGCGTCGAG ACGGCCGAGC GGGAGGAGCT CATCCGCATC
GTGGGCGATT CACCCGCCTC GCCGGTGCAC GACCTCACCT TCGCCGGGTT CACCTTCACC
CAGACGCACC GGACGCTGTT CAGCCGGCCG TACGAAAAAC TGCAGCTGGG TGACTGGGCC
ATCGCCCGCG CGGGTGCGGT CTACCTCAAG AACACCAGAG GGATCACCGT CCGTGACGCC
CGCTTCGACC AGGTCGGCGG CAACGCCGTC TTCATGGACG GATACGCCGA GGGCAACGTC
GTCTCCGGCG GCGACTTCCG CGACTCCGGG GCGAGCGACA TCGCGGTCAT CGGCTCCCAC
GACGCCGTCC GAGAGCGCTC CACCTGGGAC GCCATGCAGC GCACCATCAC CGACACCACG
CCGGGCCCGA AGACCGAGGA CTACCCGCGT GACATCACGA TCACCGGCAA CTACCTGACC
CGCAACGGGC GCTTCGAAAA GCAGACCTCC GGCGTGCAGA TCTCGATGAG CCGCCGGGTC
ACGGTCTCCG GCAACACCGT GCACGACGGG CCGCGCGCCT GCATCGACAT CAACGACGGC
ACCTGGGGCG GGCACGTCAT CGAAGACAAC GACATCTTCG ACTGCGTCAA GGAGACCTCC
GACCACGGGC CGATCAACTC CTGGGGCCGC GACCGCTTCT GGCCCCTGAC CGCAGATGAC
GCCGTGAAGA AGAGTTACGC CAAGCTTGAC GCGATGGAGA CCACGAAGAT CCGGCACAAC
CGGATCTGGC ACTCCTCCCA CTGGGACGTC GACCTCGACG ACGGCTCATC CAACTATGAG
GTCACCGGAA ACCTGCTGCT CAACGGTGGG GTGAAGCTGC GCGAAGGCTT CTTCCGCACG
GTGAGCGGCA ACGTCTTCGT CAACGGCGGC GGGCACTTCC ACGTCTCCTA CGCCGACAAC
GGCGACGTCA TCGAGAAGAA CATCTTCGTC ACCGACGACC CCTACGACTT CATCCAGAGC
GATCCGTCCA CCTCCAAGAC CGTCTACGAC GACAACCTCT TCTGGGACAA CGGCAAGCCC
GTCGCCGACA TCACCGACGC CTGGCGCGCC CGCGGCCTCG ACACCCGCTC GGTCGTCGGC
GACCCCCTGT TCGAGGGGCA GAGCCCCTAC GCCGATCCGG CCAAGCTCGA CTACTCGCTC
AAGGCCGGCT CCCCGGCTCT GGCGCTGGGC TTCAAGCCCT TCGCGATGAC CGGATTCGGC
AAGCCGGGCT CGCCCACGCC GCCGCCGCTC ACCTGGCGCA GGCCGGACAC CGGCATGACG
ATCGGCAACC TGGCCGAGCC GCTCATGGGC GCGACGGTCA CCGAGATCCA CAGCGACGAG
GTGAAGTCCT CCGTCGGGCT CACCGACTAC GACGGCCTGT TCTTCGCCGC CGTGCCCGCC
GACTCCCATG CCTGGCGGCA GGGGCTGCGC ACCGGCGACG TCATCAGGGA GATCGCCGAC
GTCAAGGTGA GCGACCGGAA CAGCTTCTGG AAGGTCTACA ACCGCACCCC AGCCGGACGG
CCGACGGCGC TGAAGGTCTG GCGCAACCAG GCGCTGACGG ACTTGAGCCT GACCAAGGCC
GCCGGCGTGG AGACGATCGA CAACGTCTCC GGCGTCACCT ACACCGGCAC CGGCTGGGAC
TGGAAGAACG ACCAGCGTGG CGGCGCCCGA TCCACCCTCG ACGACCTGCA CGCCACCCAG
ACCGACGGGG ACGCCTTCGA ACTGGCCTTC CACGGCACCG GCGTCGAATA CATCGCCCAG
GTCAACTCCG ACGAGGGCAA GGTCGACCTC TACCTGGACG GCAAGCTCGA CACCACCATC
GACAACCACA GCCCGACCCG CGAGTACCAG AAGGTCGTCT ACACCAGGAC CGGCCTGGCA
CCGGGGCCGC ACACGCTCAA GGGAGTGAAA AAGGACGGCT CCTACTTCAT CGTCGACGGA
TTCAAGATCC ACACTGTTCC CGGGGGCGAC GGCGAGGCGC CGGTGACCAC CGCGTCCGGG
GTGCCCTCAG GCTGGACCTC CCAGCCCGTG CAGGTGGTAC TGACCGCCTC CGACGCGGGC
ACCGGCGTGG AGCGGACCGA ATACCACCTC GGCGACGGCT CCTGGCGGGC CTACACCGGT
CCCGTGCGCG TGGACCGTGA GGGGGAGAGC GTACTGACCT TCCGCAGCGT CGACCGGGCC
GGCAACGTCG AGGAGTCCCG GTCGGTCGCC GTCAGGATCG ACACGACGGC TCCCACGCTG
ACCGTCACCC CCAGCCCCGA CCGGTTGTGG CCGCCGAACC ACAGGCTGGT GCCGGTGGAG
ATCTCGCTGA AGGCGGCCGA CGCCGGTCCC GTCACGGTCA CCCTGGTCTC GATCACCAGC
TCGTCGGCCG GGGCGGAGGA CGACGTCCGC GATGCCTCGT ACGGCACGGC CGACATCTCC
TTCGCGCTGC GTGCGGAGCG GGATGGCGGG TCGGCCCGGG TCTACAGGAT CACGTACCGG
GTGGTGGACC GGGCGGGCAA CGCCACTGTG GCCCACACCC GGGTGAGCGT ACGGCAGGCC
GCCTAG
 
Protein sequence
MRHGKPWRLL LAACVAASGL TALPPTEALA ADTVFHVATQ GDDDAAGTEV APFRTITRAQ 
RAVREALPTA TGPLRVRVRG GVYYLSEPLT FTPADSGATY EAAAGEPVVL SGGRRLTPAW
TTYQGATLVA DIGKDLDFDE LFLDGKRQVL ARYPNFDPKV AVLNGYAADA ISPSRVARWK
NPTTALVRGL HQGEWGGNSF KVTGVDGNGD PTLQWVGDNN RGSGLHPHKR MVENVLEELD
APGEWFHDKA AGRLYLQPPA GVDPAAVRVE TAEREELIRI VGDSPASPVH DLTFAGFTFT
QTHRTLFSRP YEKLQLGDWA IARAGAVYLK NTRGITVRDA RFDQVGGNAV FMDGYAEGNV
VSGGDFRDSG ASDIAVIGSH DAVRERSTWD AMQRTITDTT PGPKTEDYPR DITITGNYLT
RNGRFEKQTS GVQISMSRRV TVSGNTVHDG PRACIDINDG TWGGHVIEDN DIFDCVKETS
DHGPINSWGR DRFWPLTADD AVKKSYAKLD AMETTKIRHN RIWHSSHWDV DLDDGSSNYE
VTGNLLLNGG VKLREGFFRT VSGNVFVNGG GHFHVSYADN GDVIEKNIFV TDDPYDFIQS
DPSTSKTVYD DNLFWDNGKP VADITDAWRA RGLDTRSVVG DPLFEGQSPY ADPAKLDYSL
KAGSPALALG FKPFAMTGFG KPGSPTPPPL TWRRPDTGMT IGNLAEPLMG ATVTEIHSDE
VKSSVGLTDY DGLFFAAVPA DSHAWRQGLR TGDVIREIAD VKVSDRNSFW KVYNRTPAGR
PTALKVWRNQ ALTDLSLTKA AGVETIDNVS GVTYTGTGWD WKNDQRGGAR STLDDLHATQ
TDGDAFELAF HGTGVEYIAQ VNSDEGKVDL YLDGKLDTTI DNHSPTREYQ KVVYTRTGLA
PGPHTLKGVK KDGSYFIVDG FKIHTVPGGD GEAPVTTASG VPSGWTSQPV QVVLTASDAG
TGVERTEYHL GDGSWRAYTG PVRVDREGES VLTFRSVDRA GNVEESRSVA VRIDTTAPTL
TVTPSPDRLW PPNHRLVPVE ISLKAADAGP VTVTLVSITS SSAGAEDDVR DASYGTADIS
FALRAERDGG SARVYRITYR VVDRAGNATV AHTRVSVRQA A