Gene Sros_4506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4506 
Symbol 
ID8667800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5022696 
End bp5024321 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content68% 
IMG OID 
Productchaperonin GroEL 
Protein accessionYP_003340114 
Protein GI271965918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.030252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.702116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA AAATGATCGC CTTTGACGAG GACGCCCGGC GCGGCCTGGA GCGCGGTATG 
AACCAGCTCG CGGACGCCGT CAAGGTGACC CTCGGCCCCA AGGGCCGTAA CGTCGTCCTG
GAGAAGAAGT GGGGCGCACC CACGATCACC AACGACGGTG TCTCCATCGC CAAGGAGATC
GAGCTCGAGG ACCCGTGGGA GAAGATCGGG GCCGAGCTCG TCAAGGAAGT CGCCAAGAAG
ACCGACGACG TCGCGGGCGA CGGCACCACC ACCGCCACCG TGCTCGCCCA GGCTCTCGTA
CGTGAGGGCC TGCGCAACGT CGCCGCCGGC GCCAACCCGA TGTCCCTGAA GAAGGGCATC
GAGGCCGCCG TCGAGCGCGT CTCCGAGGAG CTGTCCAATC TGGCCAAGGA CGTGGAGACC
AAGGAGCAGA TCGCCTCCAC CGCCTCCATC TCCGCCGCCG ATCCCGAGAT CGGCTCGCTC
ATCGCCGAGG CGATGGACAA GGTCGGCAAG GAAGGAGTCA TCACCGTCGA GGAGAGCAAC
ACCTTCGGCC TGGAGCTTGA GCTCACCGAG GGCATGCGCT TCGACAAGGG CTACATCTCC
CCCATCTTCA TCACCGACCC CGACCGGCTC GAAGCGGTGC TCGACGAGCC GTACGTGTTG
CTCGTGAGCG GCAAGGTCGC CGCCAACAGG GAGGTGCTGC CGGTCCTCGA CAAGGTCGTG
CAGTCGGGCA GGCCGCTGCT GGTCATCGCC GAGGACATCG AGGGCGAGGC CCTGGCCACC
CTGGTCGTCA ACAAGATGAA GGGTCTCTTC CGGTCGGTCG CGGTCAAGGC GCCGGGCTTC
GGTGACCGCC GCAAGGCCAT GCTGGGCGAC ATCGGCATCC TGACCGGTGC CCAGGTCATC
AGCGAGGACC TCGGCCTCAA GCTGGAGTCC ACCACGCTGG ACCAGCTCGG CCGCGCCCGC
CAGGTCATCG TCACCAAGGA CGAGACCACC ATCGTCGACG GTGGCGGCGA CGCCGAGCAG
ATCGCCGGCC GGGTCAACCA GATCCGCGCC GAGATCGACA ACACCGACTC CGACTACGAC
CGCGAGAAGC TCCAGGAGCG TCTGGCCAAG CTGGCCGGCG GCGTGGCCGT CATCAAGGCC
GGCGCGGCGA CCGAGGTCGA GCTCAAGGAG CGCAAGCACC GCATCGAGGA CGCCGTTCGC
AACGCGAAGG CGGCCGTCGA GGAGGGCATC GTCCCCGGCG GTGGCGTGGC CCTGCTGCAG
GCCGGCGCCA AGGCGTTCGA CAAGCTGGAG CTGTCCGGCG ACGAGGCCAC CGGTGCCGCG
ATCGTGAAGA AGGCTCTTGA GGAGCCGCTG AAGCAGATCG CCGTCAACGC CGGCCTTGAG
GGCGGCGTCG TGGTGGAGAA GGTGCGCAAC CTCACCCCGG GTGAGGGCCT GAACGCCGCC
ACCGGCGAGT ACGTCAACAT GTTCGAGTCG GGCATCATCG ACCCGGCCAA GGTGACGCGC
TCCGCGCTGC AGAACGCCGC TTCCATCGCG GCGCTCTTCC TCACCACCGA GGCCGTCATC
GCCGAGAAGC CCGAGAAGGC CGGAGCCGCT CCCGCCATGC CGGGCGGCGG CGACATGGAC
TTCTAG
 
Protein sequence
MTAKMIAFDE DARRGLERGM NQLADAVKVT LGPKGRNVVL EKKWGAPTIT NDGVSIAKEI 
ELEDPWEKIG AELVKEVAKK TDDVAGDGTT TATVLAQALV REGLRNVAAG ANPMSLKKGI
EAAVERVSEE LSNLAKDVET KEQIASTASI SAADPEIGSL IAEAMDKVGK EGVITVEESN
TFGLELELTE GMRFDKGYIS PIFITDPDRL EAVLDEPYVL LVSGKVAANR EVLPVLDKVV
QSGRPLLVIA EDIEGEALAT LVVNKMKGLF RSVAVKAPGF GDRRKAMLGD IGILTGAQVI
SEDLGLKLES TTLDQLGRAR QVIVTKDETT IVDGGGDAEQ IAGRVNQIRA EIDNTDSDYD
REKLQERLAK LAGGVAVIKA GAATEVELKE RKHRIEDAVR NAKAAVEEGI VPGGGVALLQ
AGAKAFDKLE LSGDEATGAA IVKKALEEPL KQIAVNAGLE GGVVVEKVRN LTPGEGLNAA
TGEYVNMFES GIIDPAKVTR SALQNAASIA ALFLTTEAVI AEKPEKAGAA PAMPGGGDMD
F