Gene Sros_4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4446 
Symbol 
ID8667740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4960653 
End bp4961780 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSite-specific recombinase XerD-like protein 
Protein accessionYP_003340059 
Protein GI271965863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000279305 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00122928 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCCAG CGGATCTGAA TCGTCTCACC GTCCAGGAGT CGGCCGATCG CTATGTGGAG 
CTGATCCGCG CCAAGACGCT GACGGGTGCG CTGTCGTCGG CGACGGCGGA GGTCTATGCG
CGGGATGTGG CCACGTTCGT ACGGCTGGCG GGTGAGGGGA AGGTTCTGGA TGACCTGACC
GGCGAGGATG TGGATGCGAT TTTGCTGGCT TTCGCGCGTA AGCCCGACGG GCGGCGGGCC
GAGCCGGGTG CCGGGCTGCA GTCGGCCTCC TCCCAGTCGC GTTTTCGCCG GTCGGTCTCG
GCGTTGTTCA AGCACGCGAC GCTGACGGGC TGGGTGCAGA TCAATCCGAT GGCGCTGGCG
ACGGTGACGG CCAAGGAGCG CGGGGGGTTG CGTCCGGAGC GGCGGGCGCT GACCCGTGAG
CAGGCGCAGG GGCTGATCGG CGCGGCGCGT GATCTCAGTG GTCACCGCGG CGACGGCGGC
GGCCGGGGTG GTGCGCGGGG TGAGGCGGGG GCTTTGGGCG GCACAGTGGC CGCGCCGGGG
GCCGCGCCGG GGGCCGACGG CTCGCCGGCG GGTGCGGGGC GTGGCCGACG GCGGGACCAG
CGCACCGAGC TGCGCGATGC GCTGATCGTG CTGCTGCTGT CGACGATGGG GCCGCGGGTC
TCGGAGCTGG TGCGGGCGAA TGTGGAGGAT TTCTACACCA ACGACGGGGT GCGTTACTGG
CGGATCTTCG GTAAGGGCGG CAAGACGCGT GATGTGCCGC TGCCGGGTGA TGTGGCCCGT
GTGCTGGAGG CCTATCTGCT GGAGCGGGGC CGGGCCGAGG TGCAGGACAA GGCGCTGCTG
CTGTCGTGGC GGGGCAGGCG CCTGGCGCGG GGGGATGTGC AGGCGGTGAT CGACCGGGTG
CAGCGGCGGG TGGATCCCGA TCGGCGGCGT TCGGTGACGC CGCATGGTCT GCGGCATACC
ACCGCCACCC ATCTGCTGGC CGACGCGGTG GACATGGACG CGGTGCGGCG GGTGCTGGGG
CACAGTGATC TGGCGACGTT GGGCCGCTAC CGCGACGAGC TGCCCGGCGA GCTGGAGGTG
GCGATGCGGG CTCATCCGCT GCTGCGCGGG CCGTCCGGCG GCGGGTGA
 
Protein sequence
MRPADLNRLT VQESADRYVE LIRAKTLTGA LSSATAEVYA RDVATFVRLA GEGKVLDDLT 
GEDVDAILLA FARKPDGRRA EPGAGLQSAS SQSRFRRSVS ALFKHATLTG WVQINPMALA
TVTAKERGGL RPERRALTRE QAQGLIGAAR DLSGHRGDGG GRGGARGEAG ALGGTVAAPG
AAPGADGSPA GAGRGRRRDQ RTELRDALIV LLLSTMGPRV SELVRANVED FYTNDGVRYW
RIFGKGGKTR DVPLPGDVAR VLEAYLLERG RAEVQDKALL LSWRGRRLAR GDVQAVIDRV
QRRVDPDRRR SVTPHGLRHT TATHLLADAV DMDAVRRVLG HSDLATLGRY RDELPGELEV
AMRAHPLLRG PSGGG