Gene Sros_3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3476 
Symbol 
ID8666764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3846415 
End bp3848043 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content76% 
IMG OID 
Productpeptide arylation protein 
Protein accessionYP_003339155 
Protein GI271964959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00809458 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00477913 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCTGC ACCTGGACGA CCTGGTCCCC GCCACGTTGA GACGGGGGTG GGCCGCCGAC 
GGCACCTGTC CCGACCTCGA CCTGTACGCG CTGTTCCGCA CCCACCGCCT CGCGGCCCCC
GGCCGGGTCG CGGTGATCGA CGCGGCGGGC GAGCTCACCT ACGCCGAACT GGACCACCTC
GCCCGCGACG CGGCGGCCGG ACTGCGGAGA CTCGGCGTCT GCGAGGGCGA CGTCGTCGGC
GTCCAGCTGC CCAACGGCCG GGACGCGGTC GTCGCCGACC TGGCCCTGGC CGCGCTCGGC
GCCGTGGCGC TGCCCTTCCC CGTCGGGCGC GGCATCACCG AGGCGGTCTC CCTGCTGGGG
CGCTCCGGCG CGCGGGCCGT CATCGCGGCC ACCGAGCACC GGGGGACGGC GCACGCGAGC
GAGCTGGTGG CCGCCGCCGA GCGACTGCCC GGCCTGCGGG CCGTCGTGGC CGCCGGTCCG
CAGGAGCCTC CGGCGGGCAG TGCGGCATGG AGCGAGGTCC TCTCCGCCGA CGGCCGCGCC
TTCGTCCCGG CGCGGCCCGA CCCGGACGGC GCGGCCCGCA TCCTGGTCTC CTCCGGTTCG
GAGTCCGAGC CCAAGATGGT CGCCTACTCC CACAACGCGC TGGCGGGCGG ACGGGGCAAC
TTCATGGCCA CGCTGATCGC CGGCGCGGAA CCGCCCCGCT GCCTGTTCCT GGTCCCGCTG
GCCTCGGCCT TCGGCAGCAA CGGCACCGCC GTCACCCTGG CCAGGCACGG AGGCTCGCTG
GTCCTGCTCG ACCACTTCTC GCCCCGGGGC GCGCTCGCCG CGATCGGCGA GCACCGGCCC
ACCCACGTGC TGGCCGTACC GACCATGATC CGCATGATGC TCGACCAGCC GAGGCCCGGA
CCGATGCCGC CGATGACCGC GCTGGTGCTG GGCGGCGCGG AGCTGGACGC GGCCACGGCC
GCCGAGGCGG GCGGGGTGTT CGGCTGCCCG GTCGTCAACC TGTACGGCTC GGCCGACGGG
GTGAACTGCC ACAGCGGGTT CCGTCCGCCC CCGGTGGGCG ATCGCGGTCC CGGGGTCGTG
GTGGGCCTGC CGGACCCCCG GGTGGCGGAG ATCCGCATCG CCCCCGCCCC GGACGGGAAT
GAGTTCGGCG AGATCATCGC ACGCGGCCCG ATGACCCCGA TGTGCTACGT CGGCGCGCCG
GAACTGAACC GGCGCTACCG CACCGCGGAC GGCTGGGTCC GCACCGGCGA CCTGGGGGTG
ATCGACGCCG ACGGGCGGCT GCGCCTGGTC GGCAGGCTCA AGCGGGTCGT CATCCGCGGC
GGCGCCAACA TCAGCCTGGC CGAGGTGGAG CACGCGCTGG CGACCCACCC CGGGGTGCGC
GAGGCGGTGT GCCTGGGCGT GCCCGACCGG GTGATGGGAG AGCGGCTGGC GGCCTGCGTG
GTGCCGCGCC CCGGCCACGC CCCCGATCTC GCCGTCCTCA CCGCCCACCT GCTCCGGCAG
GGGCTGGAGC GGAGCAAGCA CCCCGAGCAC CTGCTGCTGG TGGAGGAGCT GCCGCTGACC
CCGGCGGGCA AGCCGGACCG GGACGCGCTC CGCGACCTGC TGCTCGGCGG GCGGCGCGGA
TCGGCGTGA
 
Protein sequence
MTLHLDDLVP ATLRRGWAAD GTCPDLDLYA LFRTHRLAAP GRVAVIDAAG ELTYAELDHL 
ARDAAAGLRR LGVCEGDVVG VQLPNGRDAV VADLALAALG AVALPFPVGR GITEAVSLLG
RSGARAVIAA TEHRGTAHAS ELVAAAERLP GLRAVVAAGP QEPPAGSAAW SEVLSADGRA
FVPARPDPDG AARILVSSGS ESEPKMVAYS HNALAGGRGN FMATLIAGAE PPRCLFLVPL
ASAFGSNGTA VTLARHGGSL VLLDHFSPRG ALAAIGEHRP THVLAVPTMI RMMLDQPRPG
PMPPMTALVL GGAELDAATA AEAGGVFGCP VVNLYGSADG VNCHSGFRPP PVGDRGPGVV
VGLPDPRVAE IRIAPAPDGN EFGEIIARGP MTPMCYVGAP ELNRRYRTAD GWVRTGDLGV
IDADGRLRLV GRLKRVVIRG GANISLAEVE HALATHPGVR EAVCLGVPDR VMGERLAACV
VPRPGHAPDL AVLTAHLLRQ GLERSKHPEH LLLVEELPLT PAGKPDRDAL RDLLLGGRRG
SA