Gene Sros_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3044 
Symbol 
ID8666331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3317853 
End bp3318983 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID 
Productputative transposase, IS891/IS1136/IS1341 
Protein accessionYP_003338739 
Protein GI271964543 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.396348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGGA TCGTGAAGCG GGCGTTCAAG TTCCGCTTCT ACCCGACCCC GGAGCAGGCT 
TCCGAGCTTG CCCGGACGTT CGGCTGCGCC CGCCTCGTTT ACAACAAGGC CCTCGAAGAG
CGCAGCCGCG CCTACACCCT CGAAGGCCGC AAGGTCTCCT ACGTGCAGTC GTCGGCGGCG
CTGACGCAAT GGAAGCGCAC CGAGGAACTG GCGTTCCTGA ACGAGGTGTC GTCGGTGCCG
TTGCAGCAGG CGCTCAGGCA CCTGCAGGCG GCGTTCGCGA ACTTCTTCGC CAAACGCGCC
AAGTACCCGA CGTTCAAATC CCGTAAGAAG TCCCGGCTCA GCGCCGAGTA CACCCGTTCG
GCGTTCACCT ACCGAGACGG CCGGATCACC CTGGCGAAGA TGGGCGGCCC GCTGAATATC
GTGTGGTCGC GTCCGTTGCC GGAGGGGGCG GACCCGTCCA CGGTGACGGT GTCGAAGGAC
GCGGCCGGGC GGTGGTTCGT GTCCATCCTG TGCGAGGACA CGATCGGCCC TTTGGACCCG
GCTGAGGGCG TGGTCGGCAT CGATGCCGGG ATCACGTCCC TGCTCGTCTT GTCGCGGCCG
ATTCCCGGCC TCACCGACGC GGCGGGAAAG GTCGCCAACC CCCGCCACGA GCGTGCCGAC
CGCAACCGGC TGGCCCGTGC GCAGCGCGCC CTGGCCCGCA CGGAGAAGGG CTCCGGCAAC
CGGGCCAAAG CACGGGTGAA GGTCGCCCGG GTGCATGCTC GCATCACCGA CCGCCGCCGC
GACCACCTGC ACAAGCTCAC CACCTCGATC GTCCGTGAGA ACCAAACGGT CGTGATCGAG
GACCTCACCG TGCGCAACAT GGTGAGGAAT CACTCGCTGG CCCGCGCCGT CTCGGATGCG
AGCTGGCGGG AGCTGCGCTC CATGCTGGAG TACAAAGCGG CGTGGTATGG GCGGGAACTG
GTGGTCGTCG ACCGCTGGTT CCCCTCCTCC AGGCTGTGCT CGGCGTGCGG GGCCATCCAG
CGGTCCATGC CGCTGAACGT CCGTGACTGG GTGTGCGCCT GTGGCGCCGC CCATGACCGG
GATGTGAATG CTGCGAAGAA CATTCTCGCC GCCGGGCTGG CGGAGAGGTA A
 
Protein sequence
MARIVKRAFK FRFYPTPEQA SELARTFGCA RLVYNKALEE RSRAYTLEGR KVSYVQSSAA 
LTQWKRTEEL AFLNEVSSVP LQQALRHLQA AFANFFAKRA KYPTFKSRKK SRLSAEYTRS
AFTYRDGRIT LAKMGGPLNI VWSRPLPEGA DPSTVTVSKD AAGRWFVSIL CEDTIGPLDP
AEGVVGIDAG ITSLLVLSRP IPGLTDAAGK VANPRHERAD RNRLARAQRA LARTEKGSGN
RAKARVKVAR VHARITDRRR DHLHKLTTSI VRENQTVVIE DLTVRNMVRN HSLARAVSDA
SWRELRSMLE YKAAWYGREL VVVDRWFPSS RLCSACGAIQ RSMPLNVRDW VCACGAAHDR
DVNAAKNILA AGLAER