Gene Sros_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2372 
Symbol 
ID8665655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2561974 
End bp2563929 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338095 
Protein GI271963899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.214126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCC GCCGTCGATT CATCCGGCTC GCCGCGGGCT CGGTCGCCAC GGGCGCCGTC 
GCCGCGGTCA TCCCGGCCGC ATCCGCCCAC GGAGCAACCG GGCAGCCCGA GGGAACGCTC
ACCGATCTCG GCCCCGCGAG CGTCACCAAC GCCCTGGGGA ACGCCGAGTT CGCCGGCGAC
GTCCTCTACG CCGCCACCCG GGGGCTGTCC CCCAACGTGG TGGGCTCCTA CGACCTGGCC
GCGGACGCGG TCACCGCCCA TTTCGACATC CCCACCGGCA TCGGCGTGTG GGCGATGTGC
GCCACCGGCA CCGACGTGTA CGTCGGCACC CACGGCGGCG ACTCCGACCT CTACCGGCTC
GACACCCGCA CCGGCGCCGT CACCAAAGCG GCCGGCTACT CCGACGACTA CATCTGGGCC
ATGGCGGCCT CGCCCGACGG CAAGGTCTAC ATGGGGCTGT CCCCGACCGG CCGGGTGGTC
GAGTACGACC CGGCCACCGG CACGAGCCGC GACCTGGGCG TCGCGACGCC GGGCGAGCAG
TACGTGCGCA GCGTCGCCGC CGACGCCACC ACCGTCTACG CGGGCGTCGG CGCGCACGCC
CATCTGGTGG CGATCGACCG CGCCACCGGC GCGAAGCGGG AGATCCTCCC GGCGGAGCTG
GCGAGCCGGG ACTTCGTGGC GAGCATGGCC ATCTCCGACA CCCACCTCGC GGCGGGCATC
TCGTCGTCCG GCGAGGTGCT CGTGCTGTCC ACCTCCGACC CCGCCGACCA CCGGATCCTC
AAGGCGACCG CGGCCGGGGA GAAGTACGTG ACCTCGGTGG TCATCCACGA CGGATACGTC
TACTTCGCCG GCAGGCCGTC GGGCGCCCTG TACCGGTGCC CGCTGTCCGG CGGCGAGGTC
GAGTCGCTCG GCGTGGCCTA CCCCGAGGCC GCCACCCACC GGCTCCTGGT CCACGACGGC
CGCGTCTACG GCGTCCAGGA CGGCGCGGTC TTCGTCTACG ATCCGGCGAC CGGCTCGCTG
GAGTACCGCA ATCTCGTCCA GCGCGGCTTC CGGGCCGCCC CCGAGCAGCC GATGTCCGTG
CACTCCGACG GCCGCCGCGT CTACGTGGGA GGGAAGGGCG GCGCGGACAT CCACGACGTG
GCCGCGGGGA CCCGTACCCG GCTGGGCGTC CCGGGCGAGC CGAAGACGGC GCTGACGCTC
AAGGACACCA CGTATCTCGG CGTCTACACC CAGGGCCTGC TCTACGCCCA CCGGGCCGGC
GAGAGCTCGG CACGGCTGCT GGCCCGCACC GGCAACCAGC AGGACCGGCC GCGCGACCTG
GCCTACGACG CCCTGACCGG GCTGATCGTG ATGCCCACCC AGCCCGAGCC CGGCCACATC
AACGGGGCGC TGTCGCTCTA CTCCCCGCGC ACCGGGAAGT TCGACACCTA CCGGCCGGTC
GTCGAGCGCC AGAGCGTGTA CTCGGTGGCC ACCCGGCGCG GCACGGCCTA CCTCGGCACG
AACGTCCAGG AGGGCCTCGG GCTGCCCCCG GTGACCACCA CGGCCCGCCT GGCGGCCTTC
GACCTGCGAG GACGCAAGCT CCTCTGGGAG CTCGAACCGG TGCCCGGCGC ACGGTACGTC
GCCGCCCTCG GGCAGACCCC CCTGGCCCTG TACGGCCTGA CCAACACCGG CGTGCTGTTC
GAGTACGACT TCCGGCGCCG CCGGGTCACC CGGACCGCCA AGGTCGCCGG CCGCGGAGGC
GAGCTCGTCG TCACCGGGGC GGTGGCCTAC GGCACCGACG GTGACAGCGT CTACAAGGTC
GACCTGCTCC GGCTCACCAC GACGACGATC GCCGACGGCC TGGCGGGCGA GTGGTTCGGC
GGGGAGCCGA AGCTCTCCCT CGACCCGTCG GGCCGCGCGC TGTACGGCCT GCGTGGCCGC
AACCTCGTCC GCATCGCGAT CTCCGGCCGG CGCTGA
 
Protein sequence
MIPRRRFIRL AAGSVATGAV AAVIPAASAH GATGQPEGTL TDLGPASVTN ALGNAEFAGD 
VLYAATRGLS PNVVGSYDLA ADAVTAHFDI PTGIGVWAMC ATGTDVYVGT HGGDSDLYRL
DTRTGAVTKA AGYSDDYIWA MAASPDGKVY MGLSPTGRVV EYDPATGTSR DLGVATPGEQ
YVRSVAADAT TVYAGVGAHA HLVAIDRATG AKREILPAEL ASRDFVASMA ISDTHLAAGI
SSSGEVLVLS TSDPADHRIL KATAAGEKYV TSVVIHDGYV YFAGRPSGAL YRCPLSGGEV
ESLGVAYPEA ATHRLLVHDG RVYGVQDGAV FVYDPATGSL EYRNLVQRGF RAAPEQPMSV
HSDGRRVYVG GKGGADIHDV AAGTRTRLGV PGEPKTALTL KDTTYLGVYT QGLLYAHRAG
ESSARLLART GNQQDRPRDL AYDALTGLIV MPTQPEPGHI NGALSLYSPR TGKFDTYRPV
VERQSVYSVA TRRGTAYLGT NVQEGLGLPP VTTTARLAAF DLRGRKLLWE LEPVPGARYV
AALGQTPLAL YGLTNTGVLF EYDFRRRRVT RTAKVAGRGG ELVVTGAVAY GTDGDSVYKV
DLLRLTTTTI ADGLAGEWFG GEPKLSLDPS GRALYGLRGR NLVRIAISGR R