Gene Sros_5953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5953 
Symbol 
ID8669247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6524713 
End bp6526551 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003341431 
Protein GI271967235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.664785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0418717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGTG TCCTTGAAGA GCGTGCGGCG ATCGAGCGGG AGATCGCGGG CCGTACCGTT 
TGCGAGCAGT TGAGGGAGAC GGCGGAGCGC AATCCCGACG CTCCGGCCTA CTCCGACCCG
GTCGAGGGGG GCTGGGCCAC CCTCACCTAC GCCGAGGCCC GGCAGCGGAT CCTGGAGATC
GCCGCCGGCT TCGTCGCCCT CGGGCTGCGG CCGGGCGAGG CGGTCGCGCT GATGATGGTC
AACCGCAGCG AGCACGTCCT GGCCGACCTG GGCGCCGTGC ACGCGGGCGG CGTGCCGTGC
TCGGTATACG CCACCTTCAC CCCGGACCAG GTGGCCTTCG TCGCCGGCGA CGTCGGCGCG
AGGATCGTCG TGCTGGGCGG GCCCGCCGAC CTGGCCAGGT GGGAGCCGGT GCTGGACGGG
CTTCCCGGGA TCTCCAAGGT CGTCATGCTG GAGGGCGCCC CGTCCGGCGA CCGCTTCCTC
GGCTGGGAGG AGTTCCTGGC CCTGGGCCGC GCGCGGCTCG CCGAGGACCC CGCCTCGATC
GAGGACCGCT GGCGGGCCGT GACCGCCGAC GACACCCTGA CGGTCCTGTA CACCTCGGGC
ACCACCGGCA ACCCCAAGGG CGTGCCGCTG ACCCACGCCA ACGTGTTCTT CGAGGTGGCG
GCGACCAGCC GGATGGTCGC GCTGCCCGAC AGGGGCACCC AGATCTCCTA CCTCACCTAC
GCCCACATCG CCGAGCGGGT GCTCAGCCTC TACCTGCCGC TGTTCAAGAT CTCCCACACC
CACTTCTGCA CCGACCTCGC CCAGCTCGGC GCCACCCTCG GCCAGGTCAA GCCGGTGCTG
TTCTTCGGTG TCCCGCGCGT CTGGGAGAAG ATGATGGCCC GCCTGCAGGC GCTGCTGGCC
ACCCAGCCGG AGGAGCAGCA GGAGAACGTG CGCAACGCCA TGGCCGCCGG TCTCGCCCAC
GTCGAGGCCA GCCAGTACGG GCGGACCCCC TCGCCCGAGG TCCAGGCCGC CTACGAGAAG
GCCGACGCCG CGCTGCTGTC GATCATCCGC TCGATGATCG GCCTCGACAA CGCGGCCTGG
CTGGCCAGCG CCGCCGCCCC GATGCCGCTG GAGGTCCAGC GCTTCTTCGC CGGCCTCGGC
ATGCGCGTGA TCGACGTGTA CGGCATGACC GAGACGACGG GCGCGTTCAC CGCCAACGCC
CCCGACCGCT TCAAGCTCGG CACGGTCGGC CAGGCGGGCC CGGGCGTCGA GGTCCGCATA
GCCGAGGACG GGGAGATCGT CACCCGCAGC CCCGCCAACG CCCGCGGCTA CCTGAACCGG
CCCGAGGCGA CCGCCGAGCT GCTGGACGAG GACGGCTGGC TGCACACCGG CGACGTGGGC
TCCATCGACG AGGACGGCTT CGTCCGCATC GTCGACCGCA AGAAGGAGCT CATCATCACC
TCCGGCGGCG AGAACATCTC ACCGGCCAAC ATCGAGAACT ACCTCAAGGA GCACCCCCTC
GTCGGACAGG CCCTGGCCTA CGGCGACGGC AGGCCGTACC CGGTGGCGGT CCTGACCCTC
GACGGCGAGG TCGCCCCCGG CTGGGCGCAG GGCCGCGGCA TCGAGTTCAC CACCCTGGCC
GACCTCGCCG AGCACCCCGA CGTCCTCAAG GTCGTCGAGG CCGCGGTCGC CACGGCCAAC
GACAAGCTCG CCCGCGTCCA GCAGGTCAAG CGCTGGCGTC TGCTGCCGGT GGAGTGGACG
GCCGAGACCG AGGAGCTCAC GCCGAGCCTG AAGCTCAAGC GCCGCGTCAT CCACGCCAAG
TACGCCGAGA TCATCGATGG AATGTACGAG AGCAGCTAG
 
Protein sequence
MAGVLEERAA IEREIAGRTV CEQLRETAER NPDAPAYSDP VEGGWATLTY AEARQRILEI 
AAGFVALGLR PGEAVALMMV NRSEHVLADL GAVHAGGVPC SVYATFTPDQ VAFVAGDVGA
RIVVLGGPAD LARWEPVLDG LPGISKVVML EGAPSGDRFL GWEEFLALGR ARLAEDPASI
EDRWRAVTAD DTLTVLYTSG TTGNPKGVPL THANVFFEVA ATSRMVALPD RGTQISYLTY
AHIAERVLSL YLPLFKISHT HFCTDLAQLG ATLGQVKPVL FFGVPRVWEK MMARLQALLA
TQPEEQQENV RNAMAAGLAH VEASQYGRTP SPEVQAAYEK ADAALLSIIR SMIGLDNAAW
LASAAAPMPL EVQRFFAGLG MRVIDVYGMT ETTGAFTANA PDRFKLGTVG QAGPGVEVRI
AEDGEIVTRS PANARGYLNR PEATAELLDE DGWLHTGDVG SIDEDGFVRI VDRKKELIIT
SGGENISPAN IENYLKEHPL VGQALAYGDG RPYPVAVLTL DGEVAPGWAQ GRGIEFTTLA
DLAEHPDVLK VVEAAVATAN DKLARVQQVK RWRLLPVEWT AETEELTPSL KLKRRVIHAK
YAEIIDGMYE SS