Gene Sros_3622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3622 
Symbol 
ID8666910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4017557 
End bp4018948 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content73% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003339296 
Protein GI271965100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAGG ACGAGCACGG GCTGGCGATG AGCTCCGCGA GCGACGCCGC CGCCCGCCAC 
TACGACCGGG CCGTCCACGA ACTGCTGCAC TTCCGCGCGG AGGTGGACGC CGAGGCGGAG
GCGGCGCTGG CGGAGGACCC CGGCTTCCCG ATGGGCAACG TCCTGGCCGC CTACCTCGGC
CTGCTGACCA CCGAGCCCGC GGATGCGCGG CGCGCGCGAG AGAGGTTCGC GCGGTTCCGC
TCCGGCGTCG ACGTGACGGC CCTGCCGTCC CGCGAGCAAG CGCACGTGCA TGCCGTACAG
GCGCTCCTGG ACGGCGACCT CCTCACCTGC GGGACCCTGC TGGGACGGAT CACCGAGGAG
CATCCGAGGG ACGCGCTGGC GCTGATCGCC GGGCACCAGA TCGACTTCCT CACCGGCGAC
GCGCGGGCGC TGCGGGACCG GGTCGGGGGC GCGCTGTCGG CCTGGGGCGA GGACGACAGG
CACTACGGCC ACCTCCTCGG GATGTACGCC TTCGGGCTGG AGGAGGCGGG CCACTACGAC
AGGTCCGAGG AGGTGGGCCT GCGCGCGGTG GAGCTCAACC CCAAGGACGT GTGGGGCGTC
CACGCCGTCG CGCACACCTA CGAGATGCAG GGCCGCTTCG GCGAGGGCGT CCGCTACCTC
GACGACAGGC TGGCCGACTG GTCCACCGGC ACGTTCTTCA ACGTGCACAC CTGGTGGCAC
TACTCCCTCT ACGCCCTGGA GGCGGGCGCG ACCGGACGGG TGCTCGACAT CTACGACTCC
GTCCTGGCGG GCGGGGAGAC CGCGATGGAG ATGCTCGACG CCGCGGCCCT GCTCTGGCGC
CTCCACCTGG AGGGCGGCGA CCAGACGGAG CGGTGGAAGG TGCTCTCCGA CACCTGGGTG
CCCAGGATGG AGGAGCCGTT CTACGCCTTC AACGACATGC ACGCCGTCAT GTCCTACGTG
GGCGCGGGCC GGATCGCCGA GGCCGAGAGG CTGATCGCCG GCCGCGAGGA CTACGTGGCG
GGCGAGCACG CCACGACCAA CCACGCGATG ACCGCCCGGG TCGGCCTGCC CGTCTGCCGG
GCCCTCGTCG CGTTCGGACG GCGCGACTAC GGCGGGGTCG TCGACCTGCT CCACCCGATC
AGGCACCGGA TCAACGAGTT CGGCGGCAGC CACGCCCAGC GCGACGCGGT CCACAAGACC
CTCGTCGAGG CCGCGATCCG GGCGGGACGG AGCGAGGCCC GGGTGCTGGT GAGCGAGCGG
ATCAGCATCC GGCCGCGCAG CCCGTTCAAC TGGCTCAAGC AGAGCGCGGT GGCCGACGAC
CTCGGCGCGC GGGCCGCCGC CCGGGCACGG GCCGAGGAGC TGGTACGGCA GGCGGCCCTC
CCGTTCCGGT GA
 
Protein sequence
MHKDEHGLAM SSASDAAARH YDRAVHELLH FRAEVDAEAE AALAEDPGFP MGNVLAAYLG 
LLTTEPADAR RARERFARFR SGVDVTALPS REQAHVHAVQ ALLDGDLLTC GTLLGRITEE
HPRDALALIA GHQIDFLTGD ARALRDRVGG ALSAWGEDDR HYGHLLGMYA FGLEEAGHYD
RSEEVGLRAV ELNPKDVWGV HAVAHTYEMQ GRFGEGVRYL DDRLADWSTG TFFNVHTWWH
YSLYALEAGA TGRVLDIYDS VLAGGETAME MLDAAALLWR LHLEGGDQTE RWKVLSDTWV
PRMEEPFYAF NDMHAVMSYV GAGRIAEAER LIAGREDYVA GEHATTNHAM TARVGLPVCR
ALVAFGRRDY GGVVDLLHPI RHRINEFGGS HAQRDAVHKT LVEAAIRAGR SEARVLVSER
ISIRPRSPFN WLKQSAVADD LGARAAARAR AEELVRQAAL PFR