Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4475 |
Symbol | |
ID | 8667769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4988609 |
End bp | 4991974 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003340085 |
Protein GI | 271965889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.161774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCACG GCAAGCCCTG GCGGCTGCTC CTCGCGGCAT GCGTCGCCGC GAGCGGGCTG ACCGCCCTCC CCCCCACCGA GGCCCTGGCG GCCGACACCG TCTTCCACGT GGCCACACAG GGCGACGACG ACGCCGCGGG CACCGAGGTC GCACCGTTCA GGACGATCAC GCGCGCCCAG CGCGCCGTAC GCGAGGCCCT GCCCACCGCG ACCGGCCCGC TGCGGGTACG GGTGCGGGGC GGCGTCTACT ACCTCTCCGA GCCCCTCACC TTCACCCCCG CCGACTCCGG TGCCACCTAC GAGGCGGCCG CGGGCGAGCC GGTCGTGCTC AGCGGCGGGC GCAGGCTCAC CCCCGCCTGG ACCACCTACC AGGGCGCCAC CCTGGTCGCC GACATCGGCA AGGACCTCGA CTTCGACGAA CTCTTCCTGG ACGGCAAGCG GCAGGTCCTG GCCAGGTACC CGAACTTCGA TCCCAAGGTC GCGGTGCTCA ACGGGTACGC CGCCGACGCG ATCTCCCCGT CGCGGGTGGC CCGCTGGAAG AACCCCACGA CCGCCCTGGT ACGCGGGCTG CACCAGGGCG AATGGGGCGG CAACTCCTTC AAGGTGACAG GTGTCGACGG CAACGGCGAC CCGACGCTGC AGTGGGTGGG CGACAACAAC CGGGGCAGCG GGCTGCACCC GCACAAGCGC ATGGTCGAGA ACGTCCTGGA GGAGCTGGAC GCGCCGGGGG AGTGGTTCCA CGACAAGGCC GCGGGCAGGC TGTACCTCCA GCCTCCCGCC GGCGTCGACC CCGCGGCGGT CCGCGTCGAG ACGGCCGAGC GGGAGGAGCT CATCCGCATC GTGGGCGATT CACCCGCCTC GCCGGTGCAC GACCTCACCT TCGCCGGGTT CACCTTCACC CAGACGCACC GGACGCTGTT CAGCCGGCCG TACGAAAAAC TGCAGCTGGG TGACTGGGCC ATCGCCCGCG CGGGTGCGGT CTACCTCAAG AACACCAGAG GGATCACCGT CCGTGACGCC CGCTTCGACC AGGTCGGCGG CAACGCCGTC TTCATGGACG GATACGCCGA GGGCAACGTC GTCTCCGGCG GCGACTTCCG CGACTCCGGG GCGAGCGACA TCGCGGTCAT CGGCTCCCAC GACGCCGTCC GAGAGCGCTC CACCTGGGAC GCCATGCAGC GCACCATCAC CGACACCACG CCGGGCCCGA AGACCGAGGA CTACCCGCGT GACATCACGA TCACCGGCAA CTACCTGACC CGCAACGGGC GCTTCGAAAA GCAGACCTCC GGCGTGCAGA TCTCGATGAG CCGCCGGGTC ACGGTCTCCG GCAACACCGT GCACGACGGG CCGCGCGCCT GCATCGACAT CAACGACGGC ACCTGGGGCG GGCACGTCAT CGAAGACAAC GACATCTTCG ACTGCGTCAA GGAGACCTCC GACCACGGGC CGATCAACTC CTGGGGCCGC GACCGCTTCT GGCCCCTGAC CGCAGATGAC GCCGTGAAGA AGAGTTACGC CAAGCTTGAC GCGATGGAGA CCACGAAGAT CCGGCACAAC CGGATCTGGC ACTCCTCCCA CTGGGACGTC GACCTCGACG ACGGCTCATC CAACTATGAG GTCACCGGAA ACCTGCTGCT CAACGGTGGG GTGAAGCTGC GCGAAGGCTT CTTCCGCACG GTGAGCGGCA ACGTCTTCGT CAACGGCGGC GGGCACTTCC ACGTCTCCTA CGCCGACAAC GGCGACGTCA TCGAGAAGAA CATCTTCGTC ACCGACGACC CCTACGACTT CATCCAGAGC GATCCGTCCA CCTCCAAGAC CGTCTACGAC GACAACCTCT TCTGGGACAA CGGCAAGCCC GTCGCCGACA TCACCGACGC CTGGCGCGCC CGCGGCCTCG ACACCCGCTC GGTCGTCGGC GACCCCCTGT TCGAGGGGCA GAGCCCCTAC GCCGATCCGG CCAAGCTCGA CTACTCGCTC AAGGCCGGCT CCCCGGCTCT GGCGCTGGGC TTCAAGCCCT TCGCGATGAC CGGATTCGGC AAGCCGGGCT CGCCCACGCC GCCGCCGCTC ACCTGGCGCA GGCCGGACAC CGGCATGACG ATCGGCAACC TGGCCGAGCC GCTCATGGGC GCGACGGTCA CCGAGATCCA CAGCGACGAG GTGAAGTCCT CCGTCGGGCT CACCGACTAC GACGGCCTGT TCTTCGCCGC CGTGCCCGCC GACTCCCATG CCTGGCGGCA GGGGCTGCGC ACCGGCGACG TCATCAGGGA GATCGCCGAC GTCAAGGTGA GCGACCGGAA CAGCTTCTGG AAGGTCTACA ACCGCACCCC AGCCGGACGG CCGACGGCGC TGAAGGTCTG GCGCAACCAG GCGCTGACGG ACTTGAGCCT GACCAAGGCC GCCGGCGTGG AGACGATCGA CAACGTCTCC GGCGTCACCT ACACCGGCAC CGGCTGGGAC TGGAAGAACG ACCAGCGTGG CGGCGCCCGA TCCACCCTCG ACGACCTGCA CGCCACCCAG ACCGACGGGG ACGCCTTCGA ACTGGCCTTC CACGGCACCG GCGTCGAATA CATCGCCCAG GTCAACTCCG ACGAGGGCAA GGTCGACCTC TACCTGGACG GCAAGCTCGA CACCACCATC GACAACCACA GCCCGACCCG CGAGTACCAG AAGGTCGTCT ACACCAGGAC CGGCCTGGCA CCGGGGCCGC ACACGCTCAA GGGAGTGAAA AAGGACGGCT CCTACTTCAT CGTCGACGGA TTCAAGATCC ACACTGTTCC CGGGGGCGAC GGCGAGGCGC CGGTGACCAC CGCGTCCGGG GTGCCCTCAG GCTGGACCTC CCAGCCCGTG CAGGTGGTAC TGACCGCCTC CGACGCGGGC ACCGGCGTGG AGCGGACCGA ATACCACCTC GGCGACGGCT CCTGGCGGGC CTACACCGGT CCCGTGCGCG TGGACCGTGA GGGGGAGAGC GTACTGACCT TCCGCAGCGT CGACCGGGCC GGCAACGTCG AGGAGTCCCG GTCGGTCGCC GTCAGGATCG ACACGACGGC TCCCACGCTG ACCGTCACCC CCAGCCCCGA CCGGTTGTGG CCGCCGAACC ACAGGCTGGT GCCGGTGGAG ATCTCGCTGA AGGCGGCCGA CGCCGGTCCC GTCACGGTCA CCCTGGTCTC GATCACCAGC TCGTCGGCCG GGGCGGAGGA CGACGTCCGC GATGCCTCGT ACGGCACGGC CGACATCTCC TTCGCGCTGC GTGCGGAGCG GGATGGCGGG TCGGCCCGGG TCTACAGGAT CACGTACCGG GTGGTGGACC GGGCGGGCAA CGCCACTGTG GCCCACACCC GGGTGAGCGT ACGGCAGGCC GCCTAG
|
Protein sequence | MRHGKPWRLL LAACVAASGL TALPPTEALA ADTVFHVATQ GDDDAAGTEV APFRTITRAQ RAVREALPTA TGPLRVRVRG GVYYLSEPLT FTPADSGATY EAAAGEPVVL SGGRRLTPAW TTYQGATLVA DIGKDLDFDE LFLDGKRQVL ARYPNFDPKV AVLNGYAADA ISPSRVARWK NPTTALVRGL HQGEWGGNSF KVTGVDGNGD PTLQWVGDNN RGSGLHPHKR MVENVLEELD APGEWFHDKA AGRLYLQPPA GVDPAAVRVE TAEREELIRI VGDSPASPVH DLTFAGFTFT QTHRTLFSRP YEKLQLGDWA IARAGAVYLK NTRGITVRDA RFDQVGGNAV FMDGYAEGNV VSGGDFRDSG ASDIAVIGSH DAVRERSTWD AMQRTITDTT PGPKTEDYPR DITITGNYLT RNGRFEKQTS GVQISMSRRV TVSGNTVHDG PRACIDINDG TWGGHVIEDN DIFDCVKETS DHGPINSWGR DRFWPLTADD AVKKSYAKLD AMETTKIRHN RIWHSSHWDV DLDDGSSNYE VTGNLLLNGG VKLREGFFRT VSGNVFVNGG GHFHVSYADN GDVIEKNIFV TDDPYDFIQS DPSTSKTVYD DNLFWDNGKP VADITDAWRA RGLDTRSVVG DPLFEGQSPY ADPAKLDYSL KAGSPALALG FKPFAMTGFG KPGSPTPPPL TWRRPDTGMT IGNLAEPLMG ATVTEIHSDE VKSSVGLTDY DGLFFAAVPA DSHAWRQGLR TGDVIREIAD VKVSDRNSFW KVYNRTPAGR PTALKVWRNQ ALTDLSLTKA AGVETIDNVS GVTYTGTGWD WKNDQRGGAR STLDDLHATQ TDGDAFELAF HGTGVEYIAQ VNSDEGKVDL YLDGKLDTTI DNHSPTREYQ KVVYTRTGLA PGPHTLKGVK KDGSYFIVDG FKIHTVPGGD GEAPVTTASG VPSGWTSQPV QVVLTASDAG TGVERTEYHL GDGSWRAYTG PVRVDREGES VLTFRSVDRA GNVEESRSVA VRIDTTAPTL TVTPSPDRLW PPNHRLVPVE ISLKAADAGP VTVTLVSITS SSAGAEDDVR DASYGTADIS FALRAERDGG SARVYRITYR VVDRAGNATV AHTRVSVRQA A
|
| |