Gene Sros_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4022 
Symbol 
ID8667316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4476414 
End bp4478315 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content69% 
IMG OID 
Productchaperone protein DnaK 
Protein accessionYP_003339673 
Protein GI271965477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0107791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0563214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGAG CTGTCGGGAT CGACCTCGGC ACGACCAACT CGGTGATCGC CACGATGGAG 
GGTAGCCAGC CCACGGTGAT CCCCAACGCC GAGGGATCCC GGACGACGCC GTCGGTCGTG
GCCTTCACCG AGCAGGGCGA ACGCCTGGTC GGGCAGCTGG CCCGGCGTCA GGCCATCCTC
AACCCCAAGG GCACGATCTA CTCCGCCAAG CGGTTCATCG GGCGCCGCTT CGACGAGGTC
ACCAGCGAGA TGAACGCCGT GTCGTTCGAC GTGGTGCCCG GCCCGGACGG GGCCGTGCGC
TTCGACGTCC ACGGCAAGCT GTACGCGCCG GAGGAGATCG CCGCCCAGGT CCTGCGCAAA
CTGGTCGACG ACGCCTCGAA GTTCCTCGGA GAGAAGATCA CCGAGGCGGT CATCACGGTG
CCCGCCTACT TCAACGACTC CCAGCGCCAG GCGACCAAGG ACGCCGGGAA GATCGCGGGC
CTGGAGGTGC TGCGGATCGT CAACGAGCCG ACCGCGGCCG CCCTCGCCTA CGGCATGGAC
CGCAAGGAGA ACGAGACCGT CCTCGTCTTC GACCTGGGCG GCGGGACGTT CGACGTCAGC
ATCCTCACCA TCGGCGACGG CGTCGTCGAG GTACGGTCGA CCTCGGGCGA CACCCACCTG
GGTGGCGACG ACTTCGACCG GCGCATCGTC GACTATCTCG CCGACGAGTT CCAGCGGGAC
AACGGCATCG ACCTGCGCAA CGACCCCCAG GCGCTGCAGC GGCTGTTCGA GGCGGCGGAG
AAGGCCAAGG TCGAGCTGTC CTCGGTCACC CAGACGCAGA TCAGCCTGCC CTTCATCACA
GCGGACGCCT CCGGCCCCAA GCACCTCAAC ACCACGCTGC GGCGGGCGAC GTTCGAGGAG
ATGACGGCCG ACCTCCTGGA ACGCTGCAAG GGCCCGGTGG AGCAGGCGAT AGCCGACGCC
AAGCTCTCCT CCAACGACAT CGACGAGGTG ATCCTGGTGG GCGGCTCGAC CAGGATGCCC
GCCGTGCAGA ACCTGGTGCG CCGCATGACC GGCGGCAAGG ACCCGAACAT GACGGTCAAC
CCCGACGAGG TCGTGGCGCT GGGCGCCGCG GTCCAGGCGG CCGTCATCAA GGGTGAGCTG
CAGGACGTCG TCCTGCTCGA CGTGACGCCG CTCTCGCTCG GCATCGAGAC GCTCGGCGGG
ATCATGACCA AGGTCATCGA GCGCAACACG ACGATCCCGG CCCGCCGTAC CGAGGTGTTC
AGCACCGCCG AGGACAACCA GAGCGCCGTC GACGTCGTGG TGCTCCAGGG GGAGCGCGAA
CGCGCCGCCG ACAACCGGGC CCTGGGCCGG TTCCGGCTGG AGAACATCAG GTCCGCGCCG
CGCGGCGAGC CCCAGGTGGA GGTGACCTTC GACGTCGACG CGAACGGCAT CGTGAACGTC
TCGGCCAGGG ACAAGGACAC CAACGCCGAG CAGCGCATCA CCATCAGCGA GAGCTCCAAC
CTTGACCAGA GCGAGGTCGA GCGCATGGTC TCCGACGCCG AGCAGCACCG TGAGGAGGAC
GTGCGGCTGC GGCAGGCGGT CGACGCGCGC AACGAGCTCG ACAGCGCCGC CTACCAGGTG
GAACGGCGGC TGAACGAGCT GGGCGAGGCG GTGCCCGTCC ATGAGAAGGC GCGGGCCGAG
ATGCTCGTCA ACGACGCCCG GGACGCGGTC AAGCAGCAGG ACACCCCGGT CGACCGGCTC
CGGTCCCTGA CCTCCGAGCT GCAGCAGGTC TACCAGAGCC TCGCCGTCGC CTCCGCGGGC
CAGCCGGCGG GAGCCGGGCC GGGAGCCCAG GGCTCCCAGG ACGCTCGGGG CGGTGGCGGG
GACGACGACG TCATCGACGC GGAGTTCACG ACCGATGAGT GA
 
Protein sequence
MARAVGIDLG TTNSVIATME GSQPTVIPNA EGSRTTPSVV AFTEQGERLV GQLARRQAIL 
NPKGTIYSAK RFIGRRFDEV TSEMNAVSFD VVPGPDGAVR FDVHGKLYAP EEIAAQVLRK
LVDDASKFLG EKITEAVITV PAYFNDSQRQ ATKDAGKIAG LEVLRIVNEP TAAALAYGMD
RKENETVLVF DLGGGTFDVS ILTIGDGVVE VRSTSGDTHL GGDDFDRRIV DYLADEFQRD
NGIDLRNDPQ ALQRLFEAAE KAKVELSSVT QTQISLPFIT ADASGPKHLN TTLRRATFEE
MTADLLERCK GPVEQAIADA KLSSNDIDEV ILVGGSTRMP AVQNLVRRMT GGKDPNMTVN
PDEVVALGAA VQAAVIKGEL QDVVLLDVTP LSLGIETLGG IMTKVIERNT TIPARRTEVF
STAEDNQSAV DVVVLQGERE RAADNRALGR FRLENIRSAP RGEPQVEVTF DVDANGIVNV
SARDKDTNAE QRITISESSN LDQSEVERMV SDAEQHREED VRLRQAVDAR NELDSAAYQV
ERRLNELGEA VPVHEKARAE MLVNDARDAV KQQDTPVDRL RSLTSELQQV YQSLAVASAG
QPAGAGPGAQ GSQDARGGGG DDDVIDAEFT TDE