Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6163 |
Symbol | |
ID | 8669465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 6761418 |
End bp | 6764504 |
Gene Length | 3087 bp |
Protein Length | 1028 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003341636 |
Protein GI | 271967440 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.367103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000303541 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAACGTTC TTACCCGGAT AAGCCTCGCC AACAGGGCGC TGGTGGCACT TCTGGCGATC GCGGTGCTGA TATTCGGAGC GATCGCCACC ATGTCGCTGA AACAGGAGCT GCTCCCGTCG TTCTCCGTGC CCACGGTGAG CGTGACCGCG ATCTATCCCG GGGCCGCGCC GCAGATCGTC GAGCGCAACG TCACCGAGAA GATCGAGGAG GCCGTCGAGG GAGGCGCAGG CCAGAAGAGG ATCACCTCGT TCTCCCGCGA CGGCCTGGCG GGAGTCTCCG TCGAATACGA CCACGGCACC GATCTCGACG GGGCGATCCA GGATCTGCAG CAGCGGATCG GCCGGATCCA GCCGGAGCTC CCGTCCCAGG TGACCCCGCG GGTCGCGCCG GGGATGTCGA CCGAGTTCCC GGTCCTGAGC CTGGCGGCCA CCGGCGACGA CGAGCGCCGG CTGGCCCGGC TGCTGAAGGA ACGCGTCCCC GGCGAGCTGG GCGGCATCGA CGGCGTGAGC CGGGTGCTGG TCAGCGGCGA GCGGGCCGAG ACCGTGGAGA TCCGCCTCGA CCAGGACAGG CTCCGCAAGT GGGGCCTCAC CGCCGACGCG GTCGCCGCCA CGCTGGGCTC CAGCGGGTCG GTCGTGCCCG TCGGCACCGT CACCGCCGGC GGCGAGTCGC TCGCCCTGCA GGTCGGCGAG AGCTTCGACT CCCTCGCGGA CGTGCGCGGC ATCCACCTGG CGCCGAAGGT CACGCTGGAC GACGTCGCCG ACGTGCGCCT CGTCGAGGAG CGTCCGGAGA CGCTGACCCG CACCGACGGC AGGCCCAGCC TGAGCGTCTC GGTCATGGCC AAGCCGACGG GCAACACGGT GGCGATCGCC AAGGAGGTGA AGCAGAGGCT GCCGGAGCTC ACCCGGTCCC TGGGCGGCGC GGCCACCCTG ACCGTCGCGT TCGACCAGTC GGAGTTCATC GAGAAGTCCA TCGGCGACCT GACCACCGAG GGCCTGCTGG GACTGGGCTT CGCCGCCCTG ATCATCCTTG TCTTCCTCTT CTCCTTCCGG TCCACCGTCG TGACGGTGGT CTCGATCCCC CTCTCGCTGG TGGTCGCGCT CATCGGCCTG TGGGTCAACG GCTTCACGCT CAACATGCTG ACGCTCGGCG CGCTGACCAT CGCCGTCGGC CGGGTCGTGG ACGACTCCAT CGTGGTCATC GAGAACATCA AACGCCATCT GGACCACGGC GAGGCCAAGC TCCACGCGGT CCAAGCGGGC ACCCGCGAGG TTGCCGGGGC GGTCACCGCC TCCACGCTGA CCACGGTCGC CGTCTTCCTG CCCATCGCGT TCACGGGCGG GATCACCGGA AGCCTGTTCT CGGCGTTCGC GTTGACCGTC ACCATCGCCC TGCTGGCCTC GCTGCTGGTG TCGCTGACGG TCATCCCCGT GCTCGCCTAC TGGTTCCTCA AGACGCCCAA GGCCGGGGTG GCCCGCGAAG TCGTAGAGGC CAGGGAGCGC AGCGGCGTGC TCCAGCGCGT CTACGTCCCG GTCATCGGGT TCGCGGTGCG GCGGCGCTGG GTGACGCTCC TGGCCGCCCT CGGGATCCTC GTGGCCACCG GGGGCATGGC GGGCCGGCTC ACGACCGACT TCCTGGGCAG CTCCGGACAG GACACCTTCC AGGTGCGGCA GGAGCTGGCC GCCGGCACGA GCCTGGCGGC GGCAGACCAG GCGGCCCGCG CGGTGGAGCG GGCGGTCAAG GACGTGCCGG GCCTGAAGTC CTACCAGGTC ACCATCGGGC CGGACAACAG TGAGGAGGGC GGGGCGCGCA ACGTGGCGAC GTTCTCCGTC ACCGCCGAGG AGGGCGCGGA CGTGGCGGCC CTCCGCGAGG CCGTCCGGAC CCGGGTCACC GAGCTGGCCG CGTCCGACCC GAAGACCGGC AAGGTGACCG TCCAGGGCGA TCAGGGCGGC CTGACCTCCA CCGACCTCGC GGTCACCGTG TCGGCGGGCG ACGATGTCTC CCTGGCACGC GCGGCGGAGC TGGTCGGCGG GGCCATGCGG CAGACGCCCG GCACCTCGGA GGTCAGGTCA GGCCTGGCGG GGACGGCCCG GCAGATCGCG GTGCGGGTCG ACGGCGAGGC GGCCGTCGCA CGCGGGCTCA CCGAGGGCCA GGTAACGCAG GCGGTGGCCC GGGTGACCCA GGGGCAGCGG GTCTCCCGGG TCACCCTGGA CGGCGCCGAG CGTGAGATGT CCCTCCGGGT AGGCGCCCCG GCGGACGACC TGACCGCGCT GCGGAACCTG AGGATCCCCA CGCCGCTCGG CCGTACCGTC AAGCTCGACG ACGTGGCCAC GGTCGAGACC GTGACGGCGC CCACCCGGCT GACGCGGATC GACGGGGAGC GCGCGACCAC GGTGAGCGCG AAGTTCTCCG GCAAGGACCT CGGCGCGGCG AGTTCCACGC TCTCCAGGCG GCTGGGAGAG CTGAAGCTGC CCGCCGGCGC CTCCGCCGAG ATCGGCGGGG TCAGCGAGCA GCAGACCAGC TCGTTCAACA GCCTGTTCGT GGCCCTCGGC GCGGCGATCC TGATCGTCTA CCTGATCATG GTGGCCACGT TCCGGAGCCT GGTGCACCCG CTCATGCTGC TGGTGTCGAT CCCGTTCGCG GCGACCGGCG CCCTCGGTCT GCTGGTGGTC ACCGGCACCC CGCTCGGCCT GCCCAGCCTG ATCGGCATGC TGATGCTGGT CGGCATCGTG GTCACCAACG CGATCGTGCT GCTCGACCTG GTCCGCCAGT ATCGCGACGG CGGCATGAGC GCCCGTGAGG CGGTCATCGA GGGCGGCAGG CACCGGGTCC GCCCGGTCGT CATGACCGCG CTGGCGACCA TCTGCGCGCT GACGCCGATG GCCACCGGCC TGACCGGGGA GGGCGGTTTC CTGTCCAAGC CCCTGGCCGT GGTCGTCATC GGGGGCCTGA CCAGCTCCAC GCTGCTGACC CTGATCCTGC TGCCGACCCT CTACACGATC GTCGAGGACG TCAAGGACCG GTTCCGGGGC TCCCGCCGCC CGGATCCCGC CCCGCTCCCC GAGCGGGAGC CGGCGCACGC GGGCTGA
|
Protein sequence | MNVLTRISLA NRALVALLAI AVLIFGAIAT MSLKQELLPS FSVPTVSVTA IYPGAAPQIV ERNVTEKIEE AVEGGAGQKR ITSFSRDGLA GVSVEYDHGT DLDGAIQDLQ QRIGRIQPEL PSQVTPRVAP GMSTEFPVLS LAATGDDERR LARLLKERVP GELGGIDGVS RVLVSGERAE TVEIRLDQDR LRKWGLTADA VAATLGSSGS VVPVGTVTAG GESLALQVGE SFDSLADVRG IHLAPKVTLD DVADVRLVEE RPETLTRTDG RPSLSVSVMA KPTGNTVAIA KEVKQRLPEL TRSLGGAATL TVAFDQSEFI EKSIGDLTTE GLLGLGFAAL IILVFLFSFR STVVTVVSIP LSLVVALIGL WVNGFTLNML TLGALTIAVG RVVDDSIVVI ENIKRHLDHG EAKLHAVQAG TREVAGAVTA STLTTVAVFL PIAFTGGITG SLFSAFALTV TIALLASLLV SLTVIPVLAY WFLKTPKAGV AREVVEARER SGVLQRVYVP VIGFAVRRRW VTLLAALGIL VATGGMAGRL TTDFLGSSGQ DTFQVRQELA AGTSLAAADQ AARAVERAVK DVPGLKSYQV TIGPDNSEEG GARNVATFSV TAEEGADVAA LREAVRTRVT ELAASDPKTG KVTVQGDQGG LTSTDLAVTV SAGDDVSLAR AAELVGGAMR QTPGTSEVRS GLAGTARQIA VRVDGEAAVA RGLTEGQVTQ AVARVTQGQR VSRVTLDGAE REMSLRVGAP ADDLTALRNL RIPTPLGRTV KLDDVATVET VTAPTRLTRI DGERATTVSA KFSGKDLGAA SSTLSRRLGE LKLPAGASAE IGGVSEQQTS SFNSLFVALG AAILIVYLIM VATFRSLVHP LMLLVSIPFA ATGALGLLVV TGTPLGLPSL IGMLMLVGIV VTNAIVLLDL VRQYRDGGMS AREAVIEGGR HRVRPVVMTA LATICALTPM ATGLTGEGGF LSKPLAVVVI GGLTSSTLLT LILLPTLYTI VEDVKDRFRG SRRPDPAPLP EREPAHAG
|
| |