Gene Sros_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3003 
Symbol 
ID8666290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3273765 
End bp3276143 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content71% 
IMG OID 
Productprotein kinase 
Protein accessionYP_003338699 
Protein GI271964503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTGG TGGCATCAAC CACCTCGAAG CGGCTGCAGG CACAGGGGCT GCCGGCCGAG 
GTGACCAGTT TCGTGGGACG TCGGCACGAG GTGGCCGAGG TCAAACGGCT GCTGTCCGGA
GCCCGTGTGG TGACCCTCAC CGGCACCGGC GGGGTCGGCA AGACCCGGCT GGCCCTGCGG
GTCGCGGCCG ACGTGCTGCG GGCCTTCCGG GACGGCGTGT GGCTGGTCGA CCTCGCCGCA
CTGGAGTGCC CGGAGCTGCT CGTCCAGGCG GTCAGCGAGG CCCTGAGGAT CCGGGACCAC
TCCACCCGTC CCCCCATGCA GGTGCTCATC GAACACCTGC GGGACAAACA GACCCTGGTG
ATCCTGGACA ACTGCGAGCA CCTCCTGCAC GACTGCGCCG TGCTGGCCGA GACACTGGTG
CGCGCCGCGC CCGAGGTGCG GATCCTCGCC ACCAGCCGGC AGGTGCTGGG CATCGCCGGC
GAGCAGACGC TGGCCGTCCC GACGCTGCCG TTGCCGCTGC CGGACTCCGG CGTCTCGCGC
CCCTCCCTCG AATCGCTGGC GCAGTATGAC GCGATACAGC TGTTCGTCGA GCGGGCCGGG
GCCGTACTAC CGGGTTTCGC CGTCACCGAG TCCAACAGGG ACGCCGTGGA GCGGATCTGC
CGGCGGCTGG ACGGCATCCC GCTCGGGATC GAACTGGCGA CCGTGCGGCT GCGGGCCCTC
TCCGTCGAGC AGTTGCTCGC CCGTTTGGAC GACCGTTTCC AGCTGCTCAC CTCGGGATCA
CGGGCGGCGC TGCCCCGCCA GCAGACCCTC CGTGCGCTGA TCGACTGGAG CTACGCCCTG
TGCACCGAGC AGGAGCGGCT GCTGTGGGCG CGCGCCTCGG TGTTCGCCGA CGGCCTGGAC
CTGGCTGCGG CCGAGGAGGT CTGCTCCGGC GACGGCATCC TCCGTGAGGA GGTCGTCGAC
CTGGTGATCG GACTGGTGGA CAAGTCCGTT CTGATCAGAG ATGACCATCC GTCTTCCCCG
TCGGCGGCCC GGTATCGGCT GCTGGAGACG GTCCGGCAGT ACGGCCGGGA GCGGCTCGCC
ACGACAGGGC AGGCGGCCGC GTTGCAGCGG CGGCACCGTG ACTACTACCG GAAGCTGGCG
GCGGAGGCGC GAGCGCGGCA TTTCGGCCCG TCCCAGGTGG CCTGGTTCAC CACCCTGAAG
ATCGAGCACG CCAACCTCCA CGCGGCCCTG GAGCACTGCT TCTCCAGGCC GGAGGGGGTC
GCGACGGGCC TGGGCATGGC CGCCGACCTG CTCTACCACT GGATCACCAA CTGCTACCTG
CACGAGGGGC GCCGCTGGCT GGACCGGGGG CTCACCGCCC ACACCGCACC GGACGAGACC
CGGGCCCGGG CCCTGTGGAC CGACAGCTGG CTGGCCGTCA TCCAGACCGA CGTCACCTCC
GCCACGGCGA TGCTGGAGGA GGCCCGGGCC CTCGGGGAGC GGCTCGGCCA GGAGCCGGTC
CGCGCCTACG TCGCCCTCTA CTCCGGAATG GTCGCCATGT GCCGGAGGGA CGCCGGATCC
GCGGTCGCGC TCTACGAGGA GGCGGTGACT CGCCACCGCT CCACCGGCGA CCCGGTGGGC
CAGGCGCTGG CGCTCATCCG GCTCTCCCTC GCCCGATCCT TCCTGGGCGA CTCGCCGGGC
GCCGTCTCCG CCGCGGAGGA GTGCCTGGCG GTGTGCGACG CCCACGGAGA GGGCTGGCAC
AGGGCGTACG CGATGATGGC CCTCGGCATC GAGATCTGGC GTCAGGGCGA CACCCCGCGC
GCGGCTGCAC TCGAGAAGGA GAGCCTGCGG TTCAACTGCT CGCTCAACGA CCCGCTGGGC
GCCGGGATCA ACCTGGAGGT GCTGGCCTGG ATCGCCGCCG CGGAGAAGCA GTACCGGAAA
GCGGCCCGGC TGCTCGGAGC CCTGGAGACC ATCTGGCAGG CGATCGGCGC GCCGCTGTCC
GGATTCGGGC ACCTGGCCGG CTACCACGAC GAATGCGTGT CCCGTGCCCG CCGGGCGCTC
GGGGAGCCGG CCTTCAACGC GGCCGTCAGG CGAGGCGCCA GGCTCCCCTA CGAGGAGGCG
CTCGCCTACG CCCTCGAAGA GGGCGCACCC GGGGACGGGT CCCAGGCGGA GGAGGGGCGG
CAGGCGCCGT TGACCCGCAG GGAGATGGAG ATCGCCCAGC TCGTCGCCCA GGGGATGAGC
AACAAGGAGA TCGCGGCCGC GCTGGTGATC GCCCAGCGCA CCGCCGAGGG ACACGTCGAG
CACATTCTGA GCAAGCTCGG CTTCACCTCA CGCGCTCAGG TCGCCGTCTG GATAGGCGAG
TGGAACCGGA GCGTGGACGG TGAGCACGGT TCCGGCTGA
 
Protein sequence
MNVVASTTSK RLQAQGLPAE VTSFVGRRHE VAEVKRLLSG ARVVTLTGTG GVGKTRLALR 
VAADVLRAFR DGVWLVDLAA LECPELLVQA VSEALRIRDH STRPPMQVLI EHLRDKQTLV
ILDNCEHLLH DCAVLAETLV RAAPEVRILA TSRQVLGIAG EQTLAVPTLP LPLPDSGVSR
PSLESLAQYD AIQLFVERAG AVLPGFAVTE SNRDAVERIC RRLDGIPLGI ELATVRLRAL
SVEQLLARLD DRFQLLTSGS RAALPRQQTL RALIDWSYAL CTEQERLLWA RASVFADGLD
LAAAEEVCSG DGILREEVVD LVIGLVDKSV LIRDDHPSSP SAARYRLLET VRQYGRERLA
TTGQAAALQR RHRDYYRKLA AEARARHFGP SQVAWFTTLK IEHANLHAAL EHCFSRPEGV
ATGLGMAADL LYHWITNCYL HEGRRWLDRG LTAHTAPDET RARALWTDSW LAVIQTDVTS
ATAMLEEARA LGERLGQEPV RAYVALYSGM VAMCRRDAGS AVALYEEAVT RHRSTGDPVG
QALALIRLSL ARSFLGDSPG AVSAAEECLA VCDAHGEGWH RAYAMMALGI EIWRQGDTPR
AAALEKESLR FNCSLNDPLG AGINLEVLAW IAAAEKQYRK AARLLGALET IWQAIGAPLS
GFGHLAGYHD ECVSRARRAL GEPAFNAAVR RGARLPYEEA LAYALEEGAP GDGSQAEEGR
QAPLTRREME IAQLVAQGMS NKEIAAALVI AQRTAEGHVE HILSKLGFTS RAQVAVWIGE
WNRSVDGEHG SG