Gene Sros_3414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3414 
Symbol 
ID8666702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3753219 
End bp3755048 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content72% 
IMG OID 
ProductTranscriptional regulator-like protein 
Protein accessionYP_003339094 
Protein GI271964898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.442542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.390313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGATGC ACCTTGGGCA GGACTCGGGG GGTTCCGCCG ATGACGGCGG GAAACGGCGC 
ACCCGGCGGC GCGGCGCGGG CGGCGAGGCG CGGCCCGGCG CCGGCGGCGC GCAGGACACC
CCTGAAACAC AGGCCTCAGG GCTGCCGGAG ACCGGCCGGG GGCGGGAGCC CGAGGCGGCC
GACAACACCG TGCGGGTGGA GAAGCCCGGG AAGGCGCGGG ACGCGGGACA GGACCGCGAG
CCCGCGCAGG CGGCCGACAA CACCGTGCGG GTGAGCAGGC CGGGGCCGGC GCCCGCGGCC
GAGCCCCCGC CGTCCGGCCC GCCCCCGAAG GGCCGGCCGC AGTTCGCCAA GCCGCTGACG
ACCCCCTCGC TCATCGGCTG GACCCTGCTG TCGGCGATCC TGCCCGGCGC GGCGCACCTG
CGCGCCGGGC GGCGCCGTAC CGGCTACATC CTGCTCGGCG TCTTCGGCCT GCTGCTGGTG
ACCGCGCTGG TCGGCGGTCT CACCCTGTAC GGATCGGGCA ACACCGGCGT CGTCACCCGG
GACGGCACGC TTCTCGCCGC GGTGATCGTC GCCGCGCTCG GCGCGCTCGG CTGGTTCCTG
CTGGTGCTGT CGTCCTACAT CTCCCTGAAG CCCAACCGGC TGACCGGCAG GGGGCAGATC
GTCTCCGGCA TCGTGGTCGG CGTGCTGTGC GTGTCGGTGA TGGCGCCGTT CGCGCTGACC
GCCAGCACCG TGCTGGCCGC CAAGGAGACG GCCAACGCCA TCTTCCCGAG CGTCAAGGAC
GACACGGCGG CGACCCCGAT CAAGCACGAG GACCCGTGGG ACGGGCGCGA GCGGGTGAAC
TTCCTGCTCA TCGGCGGTGA CGGGGCCGGC AACCGGGAGG GTGTACGGAC CGACAGCATG
AACGTGGCCA GCGTGAACGT CAAAACCGGC AACACGGTCA TGTTCAGCCT GCCCCGCAAC
CTGCAGCACG TCCACTTCAG GCCCGGCACC CCGCTCGCCA AGCATTTCCC CAACGGCTTC
ATGCGCGAGC TGCCCAACGG CGGCCTGCTC AACGAGGTCT GGCAGTACGG CGAGGACCAT
CCGGAGATCG TGCCGGGCAA GAACGACCAG CGCGGCCCCC GGGCGCTCAT GAACGCCATC
GGCCAGACGC TCAACCTGCA GATCGACTAC TACGCCCTCG TGAACATGTT CGGCTTCGCC
CACCTGGTGG ACGCCATCGG CGGGCTGAAG ATCCGGGTGG ACAACGACGT CAAGTGGGGC
GGCCTCTACG GCACCGCGGG CACGATCAAG GCCGGCTACC AGACGCTGTC CGGCGAGGAG
GCGCTCTGGT ACGGACGCTC CCGCGTCGGC AGCGACGACT TCTCCCGCAT GGCCCGCCAG
CGGTGCGTCA TCGGCGCCTT CGCCCAGCAG GCCACCCCGT CGGTCGTGCT CACCAACTTC
GTCAAGGTCG CGAACGTGGC CAAGCGGATG GCCAAGACCA ACATCCCCCG CGAGCTGCTG
GAGCACGTGA CCGACCTCGC TCTCAAGGTC AAGGACGCCA GGATCACCAG CCTGCAGTTC
GTGCCGCCGG AGTTCTACAC CGGTTCCCCC GACTGGCAGA AGATCCGCAC CGCGACCGCG
AGGGCGCTGC GCCAGTCCTC CCAGCCCTCC CGCCGCGCCC TCGCCGCCGG TGTGACGGCC
TCCCCCGGGG CCAGCGCCTC TCCCGGCGCG AGCGGCGCCA GCCCCACTCC GGACCCGACC
CCCACCAGGA CCGCGACGGT CCGGCCGAGC CAGACCCCCA CCCAGAACGG CAAGGCGGCC
CAGTCCCTCA GCGAGCTCTG CGGGTTCTGA
 
Protein sequence
MTMHLGQDSG GSADDGGKRR TRRRGAGGEA RPGAGGAQDT PETQASGLPE TGRGREPEAA 
DNTVRVEKPG KARDAGQDRE PAQAADNTVR VSRPGPAPAA EPPPSGPPPK GRPQFAKPLT
TPSLIGWTLL SAILPGAAHL RAGRRRTGYI LLGVFGLLLV TALVGGLTLY GSGNTGVVTR
DGTLLAAVIV AALGALGWFL LVLSSYISLK PNRLTGRGQI VSGIVVGVLC VSVMAPFALT
ASTVLAAKET ANAIFPSVKD DTAATPIKHE DPWDGRERVN FLLIGGDGAG NREGVRTDSM
NVASVNVKTG NTVMFSLPRN LQHVHFRPGT PLAKHFPNGF MRELPNGGLL NEVWQYGEDH
PEIVPGKNDQ RGPRALMNAI GQTLNLQIDY YALVNMFGFA HLVDAIGGLK IRVDNDVKWG
GLYGTAGTIK AGYQTLSGEE ALWYGRSRVG SDDFSRMARQ RCVIGAFAQQ ATPSVVLTNF
VKVANVAKRM AKTNIPRELL EHVTDLALKV KDARITSLQF VPPEFYTGSP DWQKIRTATA
RALRQSSQPS RRALAAGVTA SPGASASPGA SGASPTPDPT PTRTATVRPS QTPTQNGKAA
QSLSELCGF