Gene Sros_3569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3569 
Symbol 
ID8666857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3956593 
End bp3958683 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content72% 
IMG OID 
ProductDNA ligase (NAD(+)) 
Protein accessionYP_003339246 
Protein GI271965050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.228058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG CACCCCCCAG CACCCTTTTC GCCGATCAGA CCGCCTATGC CGAAGCCGTT 
CAGCTCGCCC TGGACGCCGC CGCGGCATAC TACGGAGACG GCACGTCCAC GCTGGACGAC
GACGCCTACG ACCGGCTGGT CCGGGGCATC CAGGCGTATG AGGCCGAGCA TCCCGACGAG
GTGCTGCCGA CCTCGCCGAC GGGCAAGGTG GCCGGTGGCG CGGTGGTCGG CGACGTGCCA
CACACGGTGC CGATGCTGAG CCTGGACAAC GTCTTCGGCG CCGGGCAGCT GGCCGACTGG
GCCGCGTCGC TGGAGCGCAG GCTGGGCCGG CCGGTCACGG AGTGGAGTGT GGAGCCGAAG
CTCGACGGGC TGGCGATCGC CGCGCGATAC CGCGGCGGGC GGCTGGTGCA GCTGGTCACC
CGGGGCGACG GCACCGCGGG TGAGGACGTG AGCCACGCGA TCGGCACCAT CGTGGGACTG
CCCGCCAGGC TGGCCGAGCC CGTCACGGTG GAGCTGCGCG GTGAGGTGAT GATGACCGCG
TCGCAGTTCG AGGATGCCTG TGTCAAGCGC CAGGCGCACG ACGGCACCAC GTTCGCCAAC
CCGCGCAGCG CCGCCGCCGG CACGCTGCGC GCCCAGGACC GCCCCTACGT GTGCGAGCTG
ACGTTCTTCG GCTACGGCGC GCTGCCGTAC CCCGACGACA CCTCCGAGCA GGCCGTGCAG
CTGCGCGAGC TGCCGCACAG CGAGGTCATG ACGTGGGTCG CCGGCCAGGG CGTGCAGACG
CCCGCCGTCA CCGACGTCGG CGGGATCGTC GCCACCACCC TGGAGCAGAT CCAGGAGCGG
GTCGAGCAGA TCGCCGCCAA GCGGACCGAG CTGCCCTTCG GCATCGACGG CATCGTGATC
AAGTGCGATC TCGCCGCCGA CCAGGCGCAG GCCGGCTTCA GCTCGCGCGC TCCCCGCTGG
GCGATCGCCT ACAAGCTGCC CGCCACCGAG AAGATCACCA AGCTGCTCGG CGTCGAGTGG
AACACCGGCC GCAGCGGCAT CATCGCCCCG CGCGCCGTGC TGGAGCCGGT CGAGCTCGAC
GGCAGCATCG TCACCTACGC CACCCTGCAC AATGTCGCCG ACATCACCCG CAGGGGCCTG
ATGCTCGGCG ACAGCGTCGT CGTCTACAAG GCCGGCGACG TGATCCCCCG CGTCGAGGCT
CCGGTCGTGC ACCTGCGCAC CGGCGACGAG CAGCCGATCG CGATCCCGCA GGTCTGCCCG
ACCTGCGGCG ACGCGATCGA CGCCTCCCAG GAACGGTGGC GGTGCGTTCG CGGGCGCGCC
TGCCGGGTGA TCGCCTCCAT CGGCTACGCC GCCGGCCGCG ACCAGCTCGA CATCGAGGGC
CTGGCCGAGA ACCGCATCCA GCAACTGCTG GACGCCGGGC TGATCGTCGA CTTCGCCGAC
CTGTTCTACC TGACCCGCGA GCAGGTGCTG GGCCTGGAGC GGATGGGCGC CACCAGCACC
GACAACCTGC TGGCCGCCAT CGAGCGGGCC AAGGCGCAGC CGCTCAGCAG GGTGTTCTGC
GCGCTGGGCG TGCGCGGCAC CGGCCGTTCC ATGAGCCGCC GCATCGCCCG GCACTTCGGC
AGCATGGACG CCATCCGCGC GGCCGACGCC GAGGCGATCG AGGCGGTGGA GGGCATCGGC
CCGAAGAAGG CGCCGGGGGT GGTGGCCGAG CTGGTCGAGC TCGCCGACCT CATCGACAAG
CTGGTCAAGG CCGGGGTGAC CATGACCGAG CCGGGCTGGA CGCCACCCGC CGAGCCCGGC
GCCGACGCGC AGGCGGACCC GGAGGCGGGC GGCACGTCCG GGCTGCCGCT GGCCGGGATG
TCCGTCGTGG TGACCGGCGC GATGAGCGGA GCGCTGGAGG CGCTGACCCG CAACGAGATG
AACGAGCTGA TCGAACGCGC CGGAGGCAAG GCGTCCTCCA GCGTCTCGGC GAAGACGTCG
CTGCTGGTGG CCGGGGAGAA GGCGGGTTCC AAGCGGGCCA AGGCCGAGAG CCTGGGCGTG
CGGATCGTCG GCCCGGAGGA GTTCGCCGGC CTCGTCGGCC CGTTCATCTG A
 
Protein sequence
MNDAPPSTLF ADQTAYAEAV QLALDAAAAY YGDGTSTLDD DAYDRLVRGI QAYEAEHPDE 
VLPTSPTGKV AGGAVVGDVP HTVPMLSLDN VFGAGQLADW AASLERRLGR PVTEWSVEPK
LDGLAIAARY RGGRLVQLVT RGDGTAGEDV SHAIGTIVGL PARLAEPVTV ELRGEVMMTA
SQFEDACVKR QAHDGTTFAN PRSAAAGTLR AQDRPYVCEL TFFGYGALPY PDDTSEQAVQ
LRELPHSEVM TWVAGQGVQT PAVTDVGGIV ATTLEQIQER VEQIAAKRTE LPFGIDGIVI
KCDLAADQAQ AGFSSRAPRW AIAYKLPATE KITKLLGVEW NTGRSGIIAP RAVLEPVELD
GSIVTYATLH NVADITRRGL MLGDSVVVYK AGDVIPRVEA PVVHLRTGDE QPIAIPQVCP
TCGDAIDASQ ERWRCVRGRA CRVIASIGYA AGRDQLDIEG LAENRIQQLL DAGLIVDFAD
LFYLTREQVL GLERMGATST DNLLAAIERA KAQPLSRVFC ALGVRGTGRS MSRRIARHFG
SMDAIRAADA EAIEAVEGIG PKKAPGVVAE LVELADLIDK LVKAGVTMTE PGWTPPAEPG
ADAQADPEAG GTSGLPLAGM SVVVTGAMSG ALEALTRNEM NELIERAGGK ASSSVSAKTS
LLVAGEKAGS KRAKAESLGV RIVGPEEFAG LVGPFI