Gene Sros_8472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8472 
Symbol 
ID8671806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9349884 
End bp9352376 
Gene Length2493 bp 
Protein Length830 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003343859 
Protein GI271969663 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.547779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.957952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCG CCTCGCGGGC ACGCAACCGG CTCGCCGTCC GATACTTCGA CGACCGGATC 
CTGCTGACCG ACTCGGCGGT CTGGGCCTAC TTCCGGCTGC CGACCGTGAG CTACGAGTTC
ATCACCCCCG AGGAGCGGGA GGCGCTGGCC ACCAACATCA CGATCGCGCT GGCGGCGATC
CGGATGCCGG ACGCCGAGGT GCACCTGCGG GTGGCGCACC GCACCTACCC CGCGGCCGAG
TGGGCGATGG CGCTGAACGC CACCTCCGAC GAGGGCCCCG GCTGGCGCGA CTACCTGGAG
GAGATGTACC GGCACGTCTG GGCGAAGGAC TTCTGGAGCA AGGAGGTCTA CCTCGGCGTA
CGGCTCGGCC TGCGCGGCAG GCAGCTCGGC ACCGGCGTCC TGTCCCAGCT CTTCGGCTTC
TACCAGCGCA GCGAGAAGGT CCTGGGCCTC GAGGACGACC ACGTCCCCAA GGCGGAGATC
GCCAAGTGGT CCGAGCAGGC GGAGCGGCTG GGCCGGGCCC TGTCCGCGAG CGCCCTGTAC
GCCCGGCACG CCACCTCCAC CGAGATCGCC TGGCTCTTCC AGCACGCCGC GACAGGCTCC
CTCGGCGACC CGCCGCCCTC GGCCAGCCCC AAGCGGCGGT GGGGACAGGG CGAGATCGAG
TCCCTCGTCG AGGGGCAGAT CCACAACGGC AGGTCGCTGC TCCGGATCGA GCAGCCGCAG
GGCGACTCCT ACGTCGCGTA CCTGTCCTTC GCCCGGTTCC CGGATCTGAT GCCCTTCCCC
GACGGCGAGC CGTGGATGCA CTTCGCCGAC CAGCTCCCCT TCCCGGTGGA GATCAGTTCG
AGGATGCGGC TGATCTCGCC GGTCAAGGCC AGCAAGGACG TGGCGCGCAA GCTCGCCCAC
GCCCGTGACA TGGACATCCA CATCCGGGAG GCGGGCGCCG AGGCGCCGCT GGCGCTGGCC
GAGCAGATCG ACGCGGCCCG GATGCTGGAA CACGGCATCA CCAAGGAGCG CCTGCCGTTC
GTGTACGGCT GGCACCGGCT GATCGTCTCC GCCCCGACCG AGGAGATCTG CGTGCAGCGG
GTCGAGGCGG TCGTGGAGCA CTACCGCGAC ATGGGCATCG ACATCGTCAA CTCCACCGGC
GACCAGTTCT CCCTGTTCTG CGAGGCGCTG CCGGGCGAGC GGGTCCGGGT CAACGCCTAC
GCCCAGCGGC AGCCGCTGCG CACCATCGCG GGCGGCATGG CGACCGCCAC GGTGGACCTG
GGTGACCGGG CGGACGAGGG CAACGCCGGC TGGATGGGAC CGTACGTCGG GGAGACCCTG
GGCCGGGCCC GGAGCATCGT GCACTTCGAC CCGCTGGTGG CGGCCACGCG CAACCGGCCG
ACGGCCATCG CGATCACCGG TGAGCCGGGC GGCGGCAAGA CCACCCTGGC ACTGCTGATG
ATCTACCAGA TGGCGCTGCG CGGCGTGACG GTCGCGGTCA TCGACCCCAA GGGCGACGCC
GACTCCCTGG TCCAGCTGCT GCAGCGGCGG GGCAGGAAGG CGCGGGTCAT CCCGCTCGGC
TCGGCCGCGC CGGGGCTGCT CGACCCGTTC TCGTTCGGCG ACGACATCGC GGCCAAGAAG
ACGATGGCCA GCGAGACCCT CCGGCTGCTG CTGCCGCGCA TGTCGGAGGA GCGCGAGTCG
GCGATGATCC AGGCGGTCGC GGCGGTCTCC AACGGCGAGG ACCCCTCGCT GGGCAAGGTC
GTCGACTTCC TGGAGCAGAC CGAGGACGCC GCCTCCAAGA ACCTGGGCGC CGTGCTCCGC
TCGATGTCGG AGATGCACCT CGCCCGGCTC TGCTTCGACC CCTCCGGCGG CGACCAGATC
GACACCGAGG GGTGGACCAC GGTCTTCACC CTCGGCGGCC TGACCCTTCC GGACGCCTCC
ACCGGGCGCG ACGACTACTC CTACGAGCAG CGGCTGTCGG TGGCCCTGCT CTACCTGGTC
GCCCAGTTCG CCCGGGGGCT GATGAACGGC CTGGACCGGC GGACCCCCAA GGCGATCTTC
CTGGACGAGG CCTGGGCGAT CACCTCCACG CCGGAGGGCG CCAAGCTGGT GCCCGAGGTC
AGCCGGATGG GCCGCTCCCG CAACACGGCC CTGGTGCTGG TCTCGCAGAA CGCCGGCGAC
CTGCTGAACG AGCAGGTGAC GAACTGCCTG TCGTCGGTGT TCGCCTTCCG GTCCACCGAG
CGGGTCGAGG TGGAGCACGT GATGGCGCTG CTCGGGGTGG AACCCTCGGA GGAGCACAAG
GCGATCCTGC GCTCGCTGGG CAACGGCGAG TGCGTCTTCC GGGATCTGGA CGGTCGTGCG
GGCCGTATCG GGGTGGACCT CATCTCCGAC GAGCTGGTGC GCTGGCTCGA TACGAATCCG
ACCCACGACA AACCAGGCGA GAACGTGCAT GATCTTTCCG GGGGCGACAG AGTCAGTAGG
CCGGGGGCCG CGGCGCTGGA GGTAGGGTCA TGA
 
Protein sequence
MSRASRARNR LAVRYFDDRI LLTDSAVWAY FRLPTVSYEF ITPEEREALA TNITIALAAI 
RMPDAEVHLR VAHRTYPAAE WAMALNATSD EGPGWRDYLE EMYRHVWAKD FWSKEVYLGV
RLGLRGRQLG TGVLSQLFGF YQRSEKVLGL EDDHVPKAEI AKWSEQAERL GRALSASALY
ARHATSTEIA WLFQHAATGS LGDPPPSASP KRRWGQGEIE SLVEGQIHNG RSLLRIEQPQ
GDSYVAYLSF ARFPDLMPFP DGEPWMHFAD QLPFPVEISS RMRLISPVKA SKDVARKLAH
ARDMDIHIRE AGAEAPLALA EQIDAARMLE HGITKERLPF VYGWHRLIVS APTEEICVQR
VEAVVEHYRD MGIDIVNSTG DQFSLFCEAL PGERVRVNAY AQRQPLRTIA GGMATATVDL
GDRADEGNAG WMGPYVGETL GRARSIVHFD PLVAATRNRP TAIAITGEPG GGKTTLALLM
IYQMALRGVT VAVIDPKGDA DSLVQLLQRR GRKARVIPLG SAAPGLLDPF SFGDDIAAKK
TMASETLRLL LPRMSEERES AMIQAVAAVS NGEDPSLGKV VDFLEQTEDA ASKNLGAVLR
SMSEMHLARL CFDPSGGDQI DTEGWTTVFT LGGLTLPDAS TGRDDYSYEQ RLSVALLYLV
AQFARGLMNG LDRRTPKAIF LDEAWAITST PEGAKLVPEV SRMGRSRNTA LVLVSQNAGD
LLNEQVTNCL SSVFAFRSTE RVEVEHVMAL LGVEPSEEHK AILRSLGNGE CVFRDLDGRA
GRIGVDLISD ELVRWLDTNP THDKPGENVH DLSGGDRVSR PGAAALEVGS