Gene Sros_4208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4208 
Symbol 
ID8667502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4688205 
End bp4690466 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content74% 
IMG OID 
ProductHyaluronate lyase 
Protein accessionYP_003339853 
Protein GI271965657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.431721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000853972 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACGGAT ATTCCAGGCG ATTATTTCTG CAGCTCGGCG GAGTCGCCAC GGCTGGGACG 
CTGATCCAGG CGCGTTCCGC GCACGCGGAG GACGATTCCT TCGCCCTGCT GCGCCGCCGA
TGGCGGGACG TGACGGCGGG GTCGGGATTC GACGCGGCGG CCGAACCGTA CCGGACCCGG
CTGGCGGAGC TGGGGACGAC GGCGGCCCGA TACCGGGATA CCATGGCGCC CGCCGACGCG
TCGCTCTGGC CCGGCCTGGC GTTCCCCTCC TTCATCGGCA CCCCCGCGCG GCTGCAGACC
ATGGCCCGCG CGTACGCCCT GCCGGGGACC GGCCTGACCG GCGACGCGGG CCTGGCCTCG
GCCGTCGCCG CCGGCATCGA CCACTACCGG CGGCGGGTGT ACACCGCGGG CGCGGACCCG
GCCGGCAACT GGTGGAACTG GCAGATCGGC ACCCCCGGAA GGCTCCTCGA CGCGGCCGTG
CTGATCGGCC CGCACCTGGC GGACCGGCAG AGCAGGGCAC TGCGGGACGC GGTGGACCAC
TTCCTCCCCG AGTGCCGCCT GAACGACTAC AGCGGCACCA GCACCGGGGC CAACCGGGTG
GACCTGTGCA CGGTGACGCT GCTGCGCGCC ATCCTCGGAT CCGACCCGGG GAAGGCCGCG
CTGGCCGCCT CGGCCCTCTC CCCGGTCTTC CCCTACGTCT CCGAGGGCGA CGGAATCTAC
CGGGACGGCT CGTTCGTCCA GCACACGTCC GTCCCCTACC AGGGCGGGTA CGGGGCGGTG
ATGCTGTCCG GTCTCGCGAC GCTGTTCGCG GTGCTGCGCG GCTCGCGGTG GGAGATCACC
GACGCGGGCA GGCAGATCGT CTTCGACACC GTCGAGAGGT CGTTCGCGCC GTTCGTCCAC
GACGGCTTCT GCATGGACCT CGTCAGCGGC CGCGGGATCG GCAGGGAGCC GTACGGTGAC
CACAGGAGAG GCCGCGGGAT CGCCTCCTCG ATCCTGCTGC TGGGGGACGC GGCCTCCGCC
GCCGAGCGGG CCCGCTGGCA GGGCATGGTG AAGGGGTGGG CGCTGCGGGA CACCTGCCGG
CCCATGCTCG GCGCGGCCGA GAGGAGCGAC CTGGGGTTCC ACGCCCGCCT CGCCGCGGTC
CTGGATGACG ACGCGATCCC CGCGGCGGAC GAGCCCATCG GGCACCGGCT GCTGGCGATG
AGCGCCCGCG CCGTCCATCG CAGGCCGGGC TGGTGCGCGG GGCTCAGCAT GGCCTCCTAC
CGGATCGGCC ACTACGAGCA CGGCAACGGC GAGAACCTCC GGGGCTGGCA CACCGGCTCG
GGAATGCTCT ACTGGTGGGG CGAGGGCCAC GGCGACCAGT ATTCCGACTC CTTCTGGCCC
ACCGTCGACC CCTACCGCCT GCCGGGCACG ACCGTCTCCA CCCGGCGGCT GGCCGACGGC
GCCGGTGAGG GGTGGGGCGA CACCTGCCCG CCAGGCCGCT GGGTGGGTGG CGCCACCGAC
GGCACGTACG CGACGGTCGG CCAGCACCTG AACGGCTTCG AGAGCACGAT GGAGGCCTTC
AAGTCATGGT TCTTCCTCGA CGACGCGGTG GTCTGCCTGG GCGCCGCGAT CACCGGCGAG
GACGGCGTGC CGGTCGAGAC CATCGTGGAC AACCGCAGAA CCGACGCGTT CCTCACCGTG
GACGACAGGG CGGGCTGGGC ACACCTGGAG GGGCACGGCG GCTACGTCGT GCCCTGCGCC
CGCCTGCACA CGCTGCGCGA GGAACGCACC GGCGGCCCGG AGCCGGTCAC CCGGAGCTAT
GTGACCCTCT GGCTCGACCA CGGCGTCGAC CCCCGCTCGG CCGGCTACGT CTACCTGCTC
CTGCCGGGTG CGAGCCTGGC GCGGACCCGG GCCCGCGCCG CCGACCCCGG CTGGGCGCGC
GTGCTCGCCA ACACCGCCCG CCGGCAGGGT GTCCAGGTCC CTTCGCTCGG GATCACCGCG
GTGAACTTCT GGAACGACGG AGCCGCCGGC GGCCTGACCG CCTCCGCCCC GTGCGCGGTC
CTCGTCAGGG AGCGCGGCGA CGGCACGGCC ACGCTGACCG TGTCGGACCC CCGGCGCGAC
CTGGACGGGC TGACCGTGAC CTGGGACCGG CCGGTGACCG GGGTGCTCCG CGGGCACCCG
CTCCTGACGG GCGCCGCGAC CGGCGCGAAG CTCACGCTCA CCTTCGGGCG GCTGTCGGAC
CGGGGCGGCA GCTCGAAGAC GGTCACCGTA CGGCTCGGCT GA
 
Protein sequence
MNGYSRRLFL QLGGVATAGT LIQARSAHAE DDSFALLRRR WRDVTAGSGF DAAAEPYRTR 
LAELGTTAAR YRDTMAPADA SLWPGLAFPS FIGTPARLQT MARAYALPGT GLTGDAGLAS
AVAAGIDHYR RRVYTAGADP AGNWWNWQIG TPGRLLDAAV LIGPHLADRQ SRALRDAVDH
FLPECRLNDY SGTSTGANRV DLCTVTLLRA ILGSDPGKAA LAASALSPVF PYVSEGDGIY
RDGSFVQHTS VPYQGGYGAV MLSGLATLFA VLRGSRWEIT DAGRQIVFDT VERSFAPFVH
DGFCMDLVSG RGIGREPYGD HRRGRGIASS ILLLGDAASA AERARWQGMV KGWALRDTCR
PMLGAAERSD LGFHARLAAV LDDDAIPAAD EPIGHRLLAM SARAVHRRPG WCAGLSMASY
RIGHYEHGNG ENLRGWHTGS GMLYWWGEGH GDQYSDSFWP TVDPYRLPGT TVSTRRLADG
AGEGWGDTCP PGRWVGGATD GTYATVGQHL NGFESTMEAF KSWFFLDDAV VCLGAAITGE
DGVPVETIVD NRRTDAFLTV DDRAGWAHLE GHGGYVVPCA RLHTLREERT GGPEPVTRSY
VTLWLDHGVD PRSAGYVYLL LPGASLARTR ARAADPGWAR VLANTARRQG VQVPSLGITA
VNFWNDGAAG GLTASAPCAV LVRERGDGTA TLTVSDPRRD LDGLTVTWDR PVTGVLRGHP
LLTGAATGAK LTLTFGRLSD RGGSSKTVTV RLG