Gene Sros_3568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3568 
Symbol 
ID8666856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3954316 
End bp3956409 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339245 
Protein GI271965049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.44867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTCT CCCCCCTCGC GCAGCGCCTG CACAATTTGA TCACTGGGAC CGCGAATGAC 
CCCACGTACT GGCCCGAGGC CTCCCAGTTG AGCGCCGGGC TGGGCGCGGC GGTCCGGACC
GTCGACGGCA GTTTGATATC GGCACCGTAC GTGGTCGACG AGCAGCGGCT GGCCGTGGCG
GCCCGCGAGC TGGTCGGATA CCTGCTGGAC GAACACCGAC GGTACGAACC CTTCTCCGTG
ATGGCGGCGG CGGGGCTGGC GGGCATGGAC GCCGAAGTGA GCGCCCAGTT CGCGGAGAAG
CGCGGGCCGA TCGCGGTCGC CGAGATGGAG GTGGTGTTCG GCTGGGACGT GGAGCTGCGG
GCGCTGCTCG CCGAGGCGAT GGCCAAGGCG ATGCGAGAAC GGCCAGACCA CGCCGCCCCG
CTGCCCGCCG TCATCGAATG GCTCGGCGGC CGGCCCGGCT ACCTGGACTT CGCCCGGACG
GTTCTGGAGG CGGCCGAGGC CCGCGTCACG GCGATCCACG CCGGCGAGAT TCCCTACAGC
GCGGAGAAGG CGTTCGACGA CGGAGAGAAG AGGACCATCG GCCGGGCCGT ACGGCTGGCG
CTCCTGCGGG ACGAGCCCTG GCTGCCCGAG CTGCTGGACA GGCTGCTGCG CGGGATCGCG
GTGGCTCCGA CCCAGGCCAA GACCCTGCCG TCGCAGGGGC TGCTGTTCGA GATCGCCCGC
GCGGTGGAAG AGCACCCGAC GCCCGAGGCG ATCTCCGCGC TGCGCGCCGC CCGCCAGGCC
ACCCGGCACG CCGGGGTGCC CAAGCAACTG GACCGCATGT TCAAGCGCAT CGAGGCCGCC
CTGGCCAACC GGCTGGAGGT GGCGTTCCGG ATACCCGACG GGCAGGTACG GCAGGCCGTC
GGCGAGCACA CGGCGGTGAT CTCAACCGAC GGCAAGGTCG AGCTGTCGTG GTGGCACGGG
GACAAGAAGC TGAAGACGGT CCCGGCGGCG GTCAGACGGG AGCACCCCGA GGAGGTCAAG
CGGCTGCGCG AGCTGGCCAA GCAGACCGCG CAGCAGCAGG CCACCCTGAG CAGGGCGCTG
GAAGCCGGAT GCGCCGGCGA GACGGCTCCG CCGTACCGGC AGCTGGAGGG CAACCCGGTC
ACCGATCGGC TGATCTGGGA GTTCGAGGTC TCGCCGGGCG TGTGGCGCAG CGAGCTGGGC
TTGACGGTGC CGGACGTGCC GGTGCGGCTG TGGCATCCGG CCCGCGCGTC CCTGGAGGAG
GTGCGCGCCT GGCGGGAGGT GGTGCAGGGC AAGGAGCTGC GCCAGCCGTA CAAGCAGGCC
TTCCGCGAGG TCTACCTGCT CACTCCCGCC GAGGAGGAGA CGCGCGACCA CTCGCGCCGG
TTCTCCGACC ACCTGCTCCG GTACGGCCAG GCCAAGGCGC TGCTCACCGA TCGCGGCTGG
ACCGGCATGA CGCTGGGCCA CTGGGACTGG TCCGGAGGGT CCGCTGAGTG CACGGCCACC
AAAAAGCTGC CCGGCGGCCT GACCGTCACC TGGGACTTCC ACCTGGACGA GGGGTCCGCC
GAGCGGGACA ACGTCGGCCC CGTCTCCATC TGCGTCAGCG GCGGCATCCG TTTCCTCGCC
GGCGTCTCGC CGGTCCCGCT GGCCGAGGTT CCTCCGCTCA TCCTGTCGGA GGCGCTGCGA
GACGCCGACC TGGTCGTCGG CGTCACCTCC ACCGGCCTGG ACCCCAATGG TCACGGGGAC
TACTGGCAGT CCTACAGCTT CGGCGACCTG GCCGAGAGCG CCCAGGTCCG GCGCGACGCG
CTCTCCCGGT TGATCGGGCG TACGGCCATC GCCGACCGGT GCGCCATGAC CGACCGCTTC
CTGGTGGTCC GGGGTGATCT ACGCACCTAC AAGATCCACC TGGGGTCCGC GAACATCCTG
ATGGAACCCA ACGACGCCTA CCTGTGCATC GTCTCCGCCC GCGACCGCCA CGCCGGACTG
TTCCTTCCGT TCGAGGAGGA CGGCCGGTTG GCGCTCATCC TCAGCAAGGC TTTCCTGCTG
GCGAACGACA CCGCCATCAC CGACCCCTCC ATCACCCGCC AGATCCGGGC TTGA
 
Protein sequence
MHLSPLAQRL HNLITGTAND PTYWPEASQL SAGLGAAVRT VDGSLISAPY VVDEQRLAVA 
ARELVGYLLD EHRRYEPFSV MAAAGLAGMD AEVSAQFAEK RGPIAVAEME VVFGWDVELR
ALLAEAMAKA MRERPDHAAP LPAVIEWLGG RPGYLDFART VLEAAEARVT AIHAGEIPYS
AEKAFDDGEK RTIGRAVRLA LLRDEPWLPE LLDRLLRGIA VAPTQAKTLP SQGLLFEIAR
AVEEHPTPEA ISALRAARQA TRHAGVPKQL DRMFKRIEAA LANRLEVAFR IPDGQVRQAV
GEHTAVISTD GKVELSWWHG DKKLKTVPAA VRREHPEEVK RLRELAKQTA QQQATLSRAL
EAGCAGETAP PYRQLEGNPV TDRLIWEFEV SPGVWRSELG LTVPDVPVRL WHPARASLEE
VRAWREVVQG KELRQPYKQA FREVYLLTPA EEETRDHSRR FSDHLLRYGQ AKALLTDRGW
TGMTLGHWDW SGGSAECTAT KKLPGGLTVT WDFHLDEGSA ERDNVGPVSI CVSGGIRFLA
GVSPVPLAEV PPLILSEALR DADLVVGVTS TGLDPNGHGD YWQSYSFGDL AESAQVRRDA
LSRLIGRTAI ADRCAMTDRF LVVRGDLRTY KIHLGSANIL MEPNDAYLCI VSARDRHAGL
FLPFEEDGRL ALILSKAFLL ANDTAITDPS ITRQIRA