Gene Sros_2402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2402 
Symbol 
ID8665688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2596599 
End bp2597894 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID 
ProductMcrBC 5-methylcytosine restriction system component-like protein 
Protein accessionYP_003338124 
Protein GI271963928 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.073965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAACCG TGGTCCTGGA CGAGTACCAG AAGACCTGGG TCGGCGATTT GAGTCCCAGC 
GCTACCGACC GGGCAGCGGC CTACTCCGAG GACCTTGGGC AACGGCTGAG ACTGCGCTGG
CTGGCGAGCG GCGAGCTAGA GGTCGAAGCG ACCTCGTACA TAGGAGTCGT CGCCCTTGAC
TGCGTCACTA TCCACATCCG ACCCAAGCTG GTCGGCCGCG AGCTGGCCGT ATTGAGGATG
CTCGACTACG CAAGCGGGTT ACCTGCGCTC CGGCATATGG ACCGCCTGCG CAACCTCCCG
AACCAGGGGC ACGACCTTCG GGATCTGATC TGCCTGCTGC TGACCGTCGA ATGTGAGGCA
TTGGTACGGC ACGGGCTGCG GCGCGACTAC ATCCGGCGGC AGGAGACTCT GCCCGCGATA
CGCGGCCGGC TACTTGCCGA CCAGCAGGTG CTGCGCCGAT TCGGCCGACT GGACAGGTTG
GAATGCCGAT TCGACGAGTT CGACAGCGAC ATCCTCGACA ACCGGCTGTG CGCGGCAGCT
CTGCGAGTGG CCGCGCACAG CGCTCGCGAT GAAGCACTGC GCGCCCGGGC CCGCCGCGTG
GCGACCGACT TCTCCGAAGT CTGCACCACA GATGGCCTTG ACGTGCGCTG GGTCGCGCAA
CACCTGACCT ACCACCGCCC CAACGAGCAC TACCGCCAGG CACATCGGTG GGCACTACTC
CTGCTGCAGG CCCCCGGCTT TACCGATCTC CTCTCCACCG GCGGACCGTC CTCCCGCACA
TTCATGCTGG ACATGAACAG CTTGTTCGAA GCCTTTGCCA CGCAGCTGCT CCGCGAGGCC
ACGCACCGCA CCGGCATCGC CGTACGCGCC CAGGAAAGCT TGTCTCGCAG TATCAGCCGT
CCCGACGGCC GCAGCTACAC GTCGATCACC CCGGACATCC AGCTCGTCCA CGGACATGGC
CCAGGCGCCT GGCGACGTTC GGTCGATGTC AAGTACAAGC TCTACGCCGA CCGGACGATC
AAGCCTTCCG ACCTCTACCA GAGCTTCGCC TACGGCCAGG TCTTGAGCAG CGAAGAGACT
CCGACCGCGT ACATCCTTTT CGCCAGTGAC CGGGACGGCG AGCCAGACCA TGTCGTCCTG
CGCCGCCTCG ATGGTGTCCC TGTCGCGAAG CTCATCAGCG TCGGAGTGAA CGTGCCGCGC
GTCCTGGAGA CCCTCGGCAC ACCGAGGCTC CAGCCCTTGC TCAACAGGCT TCTCGATGAT
TTGGGCGGTC ACATCCAGTG GGCTTTTTCA ACCTGA
 
Protein sequence
MRTVVLDEYQ KTWVGDLSPS ATDRAAAYSE DLGQRLRLRW LASGELEVEA TSYIGVVALD 
CVTIHIRPKL VGRELAVLRM LDYASGLPAL RHMDRLRNLP NQGHDLRDLI CLLLTVECEA
LVRHGLRRDY IRRQETLPAI RGRLLADQQV LRRFGRLDRL ECRFDEFDSD ILDNRLCAAA
LRVAAHSARD EALRARARRV ATDFSEVCTT DGLDVRWVAQ HLTYHRPNEH YRQAHRWALL
LLQAPGFTDL LSTGGPSSRT FMLDMNSLFE AFATQLLREA THRTGIAVRA QESLSRSISR
PDGRSYTSIT PDIQLVHGHG PGAWRRSVDV KYKLYADRTI KPSDLYQSFA YGQVLSSEET
PTAYILFASD RDGEPDHVVL RRLDGVPVAK LISVGVNVPR VLETLGTPRL QPLLNRLLDD
LGGHIQWAFS T