Gene Sros_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3410 
Symbol 
ID8666698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3747853 
End bp3749121 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID 
ProductHNH endonuclease domain protein 
Protein accessionYP_003339090 
Protein GI271964894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.601947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.516211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCCGGC ATACCCCGTT CGTCATCCGG CTGAAGGACC GGACCGTCGC CGGCTCGACC 
ATGCAAGGTG TCGAGCTGGG GATCGACCCC GGTAGCAAGC ACACCGGCAT CGCCGCGTTC
TCCGAGCGCG GCGGTAGCCG TATCGGCCTG TACGCACTCC AGCTTGACCA TCGGGGCGGT
CAGATCCGCG ACAAGCTCGC CTCACGGGCG GCGTTGCGCC GTGGTCGCCG GTCACGGAAC
CTGCGCTACC GCGCACCCCG CTTCAACAAC CGCACCAGAC CGCAGGGGTG GATTGCGCCG
TCCCTGCGGC ACCGGGTGGA CGGCACCGTG TCGTGGGTGT CCCGGTTGTC CCGTTGGGCT
CCCGTCACGG CTGTTCATGT GGAGCGGGTC GCCTTTGATA CGCACCTCTT GTCGGCCGGC
AGGCCGCTCG AAGGGGTGGA GTACCGGTAC GGCACCCTGC ACGGCTACGA GGTGCGGGAG
TACCTGCTGG CCAAGTGGGG CCGGGCGTGT GCGTACTGCG GCGCGTCCGG TGTGCCGTTG
AACCTCGACC ACATCCACCC CCGCAGCCGG GGCGGCTCCA ACCGGATCAG CAACTTGTGC
GTGGCGTGCG TCGGCTGCAA CCAGGCCAAG AACGCCACCC CGATCGAGGA GTTCCTCACG
GATCGGCCAG TGGTGCTGGT GAAGATCCTG CAGCAGTCGA AAGCTCCGCT CAGGGACGCC
GCCGCTGTGA ACGCGACCCG GTGGGCGTTG TGGCGGGCCT TGACCGCCAC CGGCCTGCCG
GTCGCTACGG CGTCGGGTGG CCGCACGAAG TGGAACCGAT CACGCACCGG CGCGGCGAAG
TCACACACGT TGGACGCGTT GCACGTCGGC GCACTTGACC ACGTGACCGG CTGGCCGTCC
ATGGTTCTCG TGATCGCGGC AACCGGACGC GGAACGTATG CCCGCACCCG AGCCGACCGG
TACGGGTTCC CCCGGCTGGC GTTGCCCCGC ACCAAACAAC ACCACGGTTT CCAGACCGGA
GACCTTGTCC GGGCGGTCGT CCCGACCGGC AAGAAAGCAG GGGTCCATAC CGGTCGGGTA
GCGGTCCGCT CCACCGGAAA CTTCAACATC CGTACCCGGC ACGGCTCCGT GCGGGGCATC
AGTCACCGTC ATGTCCGTCT GCTCCAGCGA GCCGACGGCT ACGGATACAC CACCCATCCA
GAAGCGCGGA ACCGTGCCGC GTTTCCTCCC CCGCCTGAAG GCGGGGGTAT CCACGCTGGG
GGTAATTGA
 
Protein sequence
MVRHTPFVIR LKDRTVAGST MQGVELGIDP GSKHTGIAAF SERGGSRIGL YALQLDHRGG 
QIRDKLASRA ALRRGRRSRN LRYRAPRFNN RTRPQGWIAP SLRHRVDGTV SWVSRLSRWA
PVTAVHVERV AFDTHLLSAG RPLEGVEYRY GTLHGYEVRE YLLAKWGRAC AYCGASGVPL
NLDHIHPRSR GGSNRISNLC VACVGCNQAK NATPIEEFLT DRPVVLVKIL QQSKAPLRDA
AAVNATRWAL WRALTATGLP VATASGGRTK WNRSRTGAAK SHTLDALHVG ALDHVTGWPS
MVLVIAATGR GTYARTRADR YGFPRLALPR TKQHHGFQTG DLVRAVVPTG KKAGVHTGRV
AVRSTGNFNI RTRHGSVRGI SHRHVRLLQR ADGYGYTTHP EARNRAAFPP PPEGGGIHAG
GN