Gene Sros_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3521 
Symbol 
ID8666809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3900307 
End bp3902064 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content69% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003339200 
Protein GI271965004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.225727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGGG ATACCGGCCG GGTGATCCCG GCGGAGACGG TCCGTGCGGC CTGGGCGGCC 
AATCCTTCTG GAACTCCGGC GATGTGGATC CGGGACCGGC TCGCGGGAGT GTTCGGCGAG
AAGGACTTCG TCGGCTGGTT TCCCGCTGAT GGGCGGCGTG GATTGTCGCC GGTGGTGTTG
GCGCTGGTCA GCGTGTTGCA GTTCGCGGAG AACCTGACCG ATCGGCAGGC GGCGCTGGCG
GTGCGATGCC GGATCGACTG GAAGTACTGC CTCGGGCTGG AGCTGACGGA TCCGGGGTTC
GACCACTCTG TGCTCTCGGA GTTCCGAGAT CGGATGGCCC AAGACGATCG GGCGGACCGA
CTGCTGGCGG TGATGGTGCA ACGGCTGGTC GAGGCGGGGC TGGTCAAGCA GCGGGGCCGG
GTCCGGACCG ATTCCACGCA TGTGCTGGCC GCGGTCCGCA AGCTCAACCG CGTCGAGTTG
GTCGGGGAGA CGCTGCGGGT CGCGCTGGAG GAACTCGCCG CCGCCGATGA ACCCTGGCTG
GCCGCCCTGA TCACCCCGGA GTGGGCCAGC CGCTATGGCC GGCCGGTCCG CTATGACCGG
CTGCCGCGCG GCAAGGATGA TCTGGCCGCG CACGTGCTGC AGATCGGCCA GGACGGGATG
ACGGTCCTGG AGGCGGTGCA TGCGGCCGGG GCGTCGCGTC GGCTGCGGGA TCTGCCGGGG
GTGCAGGTAC TGCGTCAGGT ATGGGTGCAG CAGTACTGGA CAGACTCCTA CGGTGATCTG
GCCTGGCGAG CCGCCAAGTC CAGCCGGGAC CGGCAGAGCC GCCACGGCCG GCCGCGTCGG
TCATCCGGCG AGGAAAGCGG CCAACAGCCG GACCCGGCAC GGGTGCCATG GTCCGGGATC
GAGATCGTCA GTCCGCACGA TCCCGAAGCC CGGTACTGCC GCAAGGAAGG AAAAACGACC
ACGAAAGCTG AGTGGGTCGG CTACCGGGAT CATCAGAGCG AGACCTGCGA CGACAATGTT
CCCAACGTGA TCGTTCACGT CCTCACCCGC CCGGCGCCGG TCCAGGACAT CGATGCCGTG
GACGACATCC ACGCGGGCCT GGCCGCCAGC GGCTTGACCC CGGCCGAGCA TCTCCTCGAC
AGCGGATACG TCACCCCGGA CGTCATCCAC CACACCGCCC AGCAGTGGGG CGTCGCTCTG
ATCGGGCCAG TTCGAGCCGA CCCGCGAGGC CGCCACGGGT TCACCAAGGA AGACTTCCAC
GTCAACTGGG ACGATCACAC CGTCACCTGC CCGCGCGGGG TGACCAGCCC GCCCTTCAAA
CCCACCCTCG GCGATGGCAA GCCTCGCCTG TCGGTGCTGT TCCCCCGCGC GGCCTGCCGG
GCCTGCCCAG ACCGCCAGGC CTGCACCGGT GACGCCAACG GCAAGGGTCG CCACCTCACC
CTGCTGCCCG AGCCGCTGCA GCAGATCCAG ACCCGCAATC GCGCCGACCA GCACACCGAA
CCTTGGAAGG CCCGCTACGC CCTGCGCGCC GGCTGCGAGG CCACCGTCTC CGAAACCACC
CGCGCCCACG GCCTACGCAA TTGCCGCTAC AAAGGCCTCG CCAAAACCCA CGTCCAGCAC
GTCCTGACCG CGGCCGGCAC CAACGTCATC CGCCTCGCCG ACTGCTACAC CCCCGGCATC
ATCCCCGACC GACCGCCACG TCCGATCAGC CCGTTCCAAC AACTCTGCCG ACGGCTGGCC
GCCCAGACCC CAGAATGA
 
Protein sequence
MGGDTGRVIP AETVRAAWAA NPSGTPAMWI RDRLAGVFGE KDFVGWFPAD GRRGLSPVVL 
ALVSVLQFAE NLTDRQAALA VRCRIDWKYC LGLELTDPGF DHSVLSEFRD RMAQDDRADR
LLAVMVQRLV EAGLVKQRGR VRTDSTHVLA AVRKLNRVEL VGETLRVALE ELAAADEPWL
AALITPEWAS RYGRPVRYDR LPRGKDDLAA HVLQIGQDGM TVLEAVHAAG ASRRLRDLPG
VQVLRQVWVQ QYWTDSYGDL AWRAAKSSRD RQSRHGRPRR SSGEESGQQP DPARVPWSGI
EIVSPHDPEA RYCRKEGKTT TKAEWVGYRD HQSETCDDNV PNVIVHVLTR PAPVQDIDAV
DDIHAGLAAS GLTPAEHLLD SGYVTPDVIH HTAQQWGVAL IGPVRADPRG RHGFTKEDFH
VNWDDHTVTC PRGVTSPPFK PTLGDGKPRL SVLFPRAACR ACPDRQACTG DANGKGRHLT
LLPEPLQQIQ TRNRADQHTE PWKARYALRA GCEATVSETT RAHGLRNCRY KGLAKTHVQH
VLTAAGTNVI RLADCYTPGI IPDRPPRPIS PFQQLCRRLA AQTPE