Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4506 |
Symbol | |
ID | 8667800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5022696 |
End bp | 5024321 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | chaperonin GroEL |
Protein accession | YP_003340114 |
Protein GI | 271965918 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.030252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.702116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGA AAATGATCGC CTTTGACGAG GACGCCCGGC GCGGCCTGGA GCGCGGTATG AACCAGCTCG CGGACGCCGT CAAGGTGACC CTCGGCCCCA AGGGCCGTAA CGTCGTCCTG GAGAAGAAGT GGGGCGCACC CACGATCACC AACGACGGTG TCTCCATCGC CAAGGAGATC GAGCTCGAGG ACCCGTGGGA GAAGATCGGG GCCGAGCTCG TCAAGGAAGT CGCCAAGAAG ACCGACGACG TCGCGGGCGA CGGCACCACC ACCGCCACCG TGCTCGCCCA GGCTCTCGTA CGTGAGGGCC TGCGCAACGT CGCCGCCGGC GCCAACCCGA TGTCCCTGAA GAAGGGCATC GAGGCCGCCG TCGAGCGCGT CTCCGAGGAG CTGTCCAATC TGGCCAAGGA CGTGGAGACC AAGGAGCAGA TCGCCTCCAC CGCCTCCATC TCCGCCGCCG ATCCCGAGAT CGGCTCGCTC ATCGCCGAGG CGATGGACAA GGTCGGCAAG GAAGGAGTCA TCACCGTCGA GGAGAGCAAC ACCTTCGGCC TGGAGCTTGA GCTCACCGAG GGCATGCGCT TCGACAAGGG CTACATCTCC CCCATCTTCA TCACCGACCC CGACCGGCTC GAAGCGGTGC TCGACGAGCC GTACGTGTTG CTCGTGAGCG GCAAGGTCGC CGCCAACAGG GAGGTGCTGC CGGTCCTCGA CAAGGTCGTG CAGTCGGGCA GGCCGCTGCT GGTCATCGCC GAGGACATCG AGGGCGAGGC CCTGGCCACC CTGGTCGTCA ACAAGATGAA GGGTCTCTTC CGGTCGGTCG CGGTCAAGGC GCCGGGCTTC GGTGACCGCC GCAAGGCCAT GCTGGGCGAC ATCGGCATCC TGACCGGTGC CCAGGTCATC AGCGAGGACC TCGGCCTCAA GCTGGAGTCC ACCACGCTGG ACCAGCTCGG CCGCGCCCGC CAGGTCATCG TCACCAAGGA CGAGACCACC ATCGTCGACG GTGGCGGCGA CGCCGAGCAG ATCGCCGGCC GGGTCAACCA GATCCGCGCC GAGATCGACA ACACCGACTC CGACTACGAC CGCGAGAAGC TCCAGGAGCG TCTGGCCAAG CTGGCCGGCG GCGTGGCCGT CATCAAGGCC GGCGCGGCGA CCGAGGTCGA GCTCAAGGAG CGCAAGCACC GCATCGAGGA CGCCGTTCGC AACGCGAAGG CGGCCGTCGA GGAGGGCATC GTCCCCGGCG GTGGCGTGGC CCTGCTGCAG GCCGGCGCCA AGGCGTTCGA CAAGCTGGAG CTGTCCGGCG ACGAGGCCAC CGGTGCCGCG ATCGTGAAGA AGGCTCTTGA GGAGCCGCTG AAGCAGATCG CCGTCAACGC CGGCCTTGAG GGCGGCGTCG TGGTGGAGAA GGTGCGCAAC CTCACCCCGG GTGAGGGCCT GAACGCCGCC ACCGGCGAGT ACGTCAACAT GTTCGAGTCG GGCATCATCG ACCCGGCCAA GGTGACGCGC TCCGCGCTGC AGAACGCCGC TTCCATCGCG GCGCTCTTCC TCACCACCGA GGCCGTCATC GCCGAGAAGC CCGAGAAGGC CGGAGCCGCT CCCGCCATGC CGGGCGGCGG CGACATGGAC TTCTAG
|
Protein sequence | MTAKMIAFDE DARRGLERGM NQLADAVKVT LGPKGRNVVL EKKWGAPTIT NDGVSIAKEI ELEDPWEKIG AELVKEVAKK TDDVAGDGTT TATVLAQALV REGLRNVAAG ANPMSLKKGI EAAVERVSEE LSNLAKDVET KEQIASTASI SAADPEIGSL IAEAMDKVGK EGVITVEESN TFGLELELTE GMRFDKGYIS PIFITDPDRL EAVLDEPYVL LVSGKVAANR EVLPVLDKVV QSGRPLLVIA EDIEGEALAT LVVNKMKGLF RSVAVKAPGF GDRRKAMLGD IGILTGAQVI SEDLGLKLES TTLDQLGRAR QVIVTKDETT IVDGGGDAEQ IAGRVNQIRA EIDNTDSDYD REKLQERLAK LAGGVAVIKA GAATEVELKE RKHRIEDAVR NAKAAVEEGI VPGGGVALLQ AGAKAFDKLE LSGDEATGAA IVKKALEEPL KQIAVNAGLE GGVVVEKVRN LTPGEGLNAA TGEYVNMFES GIIDPAKVTR SALQNAASIA ALFLTTEAVI AEKPEKAGAA PAMPGGGDMD F
|
| |