Gene Sros_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1950 
Symbol 
ID8665232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2091615 
End bp2092841 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337681 
Protein GI271963485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.275408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTC TTCTTGTAGG GGCCGGCGGG GTCGGCTCCG CTGTGGTTCC GATCGCCGCG 
CGCCGAGATT TCTTCGAACA CATCGTGGTG GCGGACTCCA AACAGAGCCG CGCCGCCGAC
GCCGTGGCCA AGATCGGTGA TCCGCGCTTC AGTGCCATCG GGCTGGACGC CTCCGACCAG
GCGGCGGTCG AGGCCGCCCT GGCCGAGCAC CGCTGTGACG TCCTCTTCAA CGCCGTGGAT
CCCCGTTTCA CCATGTCCCT TTTCCGGGCG GCGCTCAACG CCGGGGCGCA CTACCTCGAC
ATGGCGATGT CACTGTCGCG GCCCCACCCC CGCAGGCCGT ACGAGCTGAC CGGCGTGAAG
CTGGGCGACG AGCAGTTCGC GCTCGGCGAC GCCTGGCGCG ACAGGGGCAC GCTGGCGCTG
GTCGGCATGG GGGTGGAGCC CGGCCTCGCC GACGTGTTCG CGCGCTACGC GGCCGAGCAC
CTCTTCGGGA GCATCGAGGA GATCGGCATC CGCGACGGGT CGAACCTGGT GGTCGAGGGC
TACGACTTCG CGCCGACCTT CTCGATCTGG ACGACCATCG AGGAGTGCCT CAACCCGCCG
GTGATCTGGG AGAACGGCGG CTGGCACACC ACCGAGCCGT TCAGCGAGCC GGAGGTCTTC
GACTTCCCCG AGGGGATCGG CCCGGTCGAG TGCGTGAACG TCGAGCACGA GGAGGTGCTG
CTCGTGCCGC GCTGGATCGA CACCAAGCGG GTGACGTTCA AGTACGGCCT CGGCGAGGAG
TTCATCGACG TCCTCAAGAC CCTGCACAAG CTCGGCCTGG ACAACGCGGG CAAGATCCGG
GTCGGCGGTG TCGAGACCTC TCCCCGTGAC GTCGTCGCGG CCAGCTTGCC CGATCCGGCC
ACGCTCGGCG ACCGGATGCG CGGCAAGACC TGCGCCGGCA CCTGGGTGAA GGGCGTCGGC
AAGGACGGCG AGCCGCGCGA GGTCTACCTC TACCACGTGG TCGACAACGA GTGGTCGATG
CGGGAGTACG GTTGCCAGGC GGTGGTCTGG CAGACCGCCG TGCACCCGGT GGTCGCCCTG
GAACTGCTGG CCACCGGCGG CTGGTCGGGC ACCGGGGTGC TGGGGCCCGA GGCGTTCGAC
GCGGTGCCGT TCCTGGATCT GCTCAACGCC TACGGCTCGC CGTGGGGAAT GCGCGACCAG
GCCGGCCAGG TCCTGCAGCC CGCCTGA
 
Protein sequence
MRILLVGAGG VGSAVVPIAA RRDFFEHIVV ADSKQSRAAD AVAKIGDPRF SAIGLDASDQ 
AAVEAALAEH RCDVLFNAVD PRFTMSLFRA ALNAGAHYLD MAMSLSRPHP RRPYELTGVK
LGDEQFALGD AWRDRGTLAL VGMGVEPGLA DVFARYAAEH LFGSIEEIGI RDGSNLVVEG
YDFAPTFSIW TTIEECLNPP VIWENGGWHT TEPFSEPEVF DFPEGIGPVE CVNVEHEEVL
LVPRWIDTKR VTFKYGLGEE FIDVLKTLHK LGLDNAGKIR VGGVETSPRD VVAASLPDPA
TLGDRMRGKT CAGTWVKGVG KDGEPREVYL YHVVDNEWSM REYGCQAVVW QTAVHPVVAL
ELLATGGWSG TGVLGPEAFD AVPFLDLLNA YGSPWGMRDQ AGQVLQPA