Gene Sros_5079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5079 
Symbol 
ID8668373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5598875 
End bp5600410 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003340611 
Protein GI271966415 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.11462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATC TGTCGATCAT TCTGGAAGAC AGCGCCCGTA ATACTCCAGA CCGCACCGCC 
CTGGTCTTCG GCGACCTGCG CCTGCCGTAC TCCATGGTGG ACTCCGTCGC GAACCAGGTG
GCCAACCTCC TGGTGGCCCG GGGGATCGGC AAGGGCGACA AGGTCGCGCT GCTCTGCCCG
AACCTGCCCT ACTTCCCCTT CGTCTACTTC GGCATCCTCA AGGCCGGGGC GACCGTCGTC
CCGCTCAACG TGCTGCTGCA GCCGCGCGAG ATCACCTACC ACCTGACCGA CAGCGACACC
AAGGCGCTCT TCTGCTTCGA GGGCTCCCCT GAGCTGCCCA TGGGCGCGCG CGGTCGGGAG
GCCTTCGACG CCGCCGAGGG CTGCGAGCTC TTCTTCGTCC TGCCCGCCAC GCCGCTGGCC
ACCGAGTCGG AGTACGGTGA GTCGTTCTGG GCGGCGCTGG ACGGCATGTC CGGGGAGTTC
GAGACAGTAC AGACCGAGCC GGACGACACC GCAGCCATCC TCTACACCTC CGGCACCACC
GGCCGGCCCA AGGGCGCCGT GCTGACCCAC ATGAACATGC TGACCAACAC CATCGTCAGC
GACGAGATGT TCCCCGCCGA TCCCCGCGGC GACGTCTCAC TCGCGGTGCT GCCGCTCTTC
CACTCCTTCG GCCAGACCGC GGTCATGAAC GTGAGCGTGC GCCGCCGCGC CACCCTCGTG
CTCCAGCCAC GCTTCGAGTC CGGCGAGACG CTGAAGCTCA TGCGCGAGGA GAAGGTGACC
ATGTTCGCCG GGGTGCCGAC GATGTTCTGG GCGCTGCTGT CGAAGATCCA CGCCGACGGG
GACGAGGCGC CCTCGACGCT GCGGGTGGCG GTGGCGGGTG GGGCCGCCTG CCCGGTCGAG
GTGCTCAAGG ACTTCGAGGG CACCTTCGGC ATCCCGATCC TGGAGGGCTA CGGGCTGTCG
GAGACCTCCC CGGTGGCCAG CTTCAACCAG CTCGGCAGGC CCACCAAGCC CGGCACCATC
GGCTTCCCCG TCTGGGGCGT GCAGATGCGG CTGGTGGACG ACGGCTGGAA CACCATTGAG
GGTGAGGGTT CCGGCGAGAT CGCCATCCGC GGGCACAACG TCATGAAGGG CTACTACGGG
CGGCCCGAGG CCACCGAGGA GGTCATGCGG GACGGCTGGT TCCGCACCGG TGACATCGCC
ACCTGCGACG AGGACGGCTA CTACACCATC ATCGACCGCA CGAAGGACAT GATCATCCGG
GGCGGCTTCA ACGTCTACCC GCGCGAGCTG GAGGAGGTGC TGATGACGCA CCCGGCGGTC
TCGCTGGTGG CGGTGGTCGG CGTGCCCCAC GACTCGCACG GCGAGGAGAT CAAGGCCTAC
GTCATCCCTG CGCCCGGCGC GACGGCGAGC GAGAGCGAGC TCATCGCCTG GTGCAAGGCG
AACATGGCGG CCTACAAGTA CCCGCGGATC GTCGAGTTCC GCGAGAACCT GCCGATGACG
GCGACCGGCA AGATCCTCAA ACGCGAGCTG CGCTAG
 
Protein sequence
MLNLSIILED SARNTPDRTA LVFGDLRLPY SMVDSVANQV ANLLVARGIG KGDKVALLCP 
NLPYFPFVYF GILKAGATVV PLNVLLQPRE ITYHLTDSDT KALFCFEGSP ELPMGARGRE
AFDAAEGCEL FFVLPATPLA TESEYGESFW AALDGMSGEF ETVQTEPDDT AAILYTSGTT
GRPKGAVLTH MNMLTNTIVS DEMFPADPRG DVSLAVLPLF HSFGQTAVMN VSVRRRATLV
LQPRFESGET LKLMREEKVT MFAGVPTMFW ALLSKIHADG DEAPSTLRVA VAGGAACPVE
VLKDFEGTFG IPILEGYGLS ETSPVASFNQ LGRPTKPGTI GFPVWGVQMR LVDDGWNTIE
GEGSGEIAIR GHNVMKGYYG RPEATEEVMR DGWFRTGDIA TCDEDGYYTI IDRTKDMIIR
GGFNVYPREL EEVLMTHPAV SLVAVVGVPH DSHGEEIKAY VIPAPGATAS ESELIAWCKA
NMAAYKYPRI VEFRENLPMT ATGKILKREL R