Gene Sros_9220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9220 
Symbol 
ID8672567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10166089 
End bp10167270 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content72% 
IMG OID 
ProductDNA-directed DNA polymerase 
Protein accessionYP_003344581 
Protein GI271970385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGTCT TCGATGACCT TGTGGGGCAG GAGCGGGCTG CGGTGGCGCT CCGCCGGGCC 
GCCGAAGGGG CCGCCGAGAT GCTGGCCGGA GGCTCCGGCG CGGGGATGAC CCACGCGTGG
CTGTTCACCG GGCCGCCCGG GTCCGGCCGG GAGGAGGCGG CACGGGCGTT CGCGGCCGCG
CTGTTCTGCC CCGACCAGGG GTGTGGCCAC TGCGACATGT GCCATCAGGT GGCGATCGGC
TCCCACCCGG ACCTGGAGGT CGTCCGCACC GAGGGTCTCT CCTACGGCAT CAAGGAGACC
CGTCAGCTCA TCCTCCGGGC GGCCGGGGCT CCGACGCTGG GGCGCTGGAG GGTCGTTCTG
TTCGAGGGCG CCGACCGGAT GCCGGAGCGC GCCTCCAACG CGCTGCTGAA GGCGATCGAG
GAGCCGCCGC CCAAGACGGT CTGGCTGCTC TGCACGCCCT CGCCCGCGGA CCTGGTCATC
ACCATCCGGT CGCGCTGCCG GGTGGTCACC CTGGTCACCC CGCCCACGGC GGCGGTCGCC
CACGCACTGG TGACCCGCGA CAACATTCCG CCGGACATGG CGGAGTTCGT CGCGCGGGCC
ACCCAGGGGC ATCTGGCGCG GGCGCGGCGG CTGGCGCTGG ACCCGGAGAT GCGCGCGCGC
CGCGAGGCCG TGCTCTCCAT CCCCCGCTCG CTCATCGGGG TCGGGGAGTG CGTCATCGCT
GCGGAGCGGT TGGTGGACAC CGCCAAGAAG GAGGCCGACG CGGTCTCCTC GGCGTTGGAC
GAGGGGGAGA CCGCCGAGCT CCGCAAGATC TACGGCGAGG GCTCCTCGGG GAAAGGGCTG
AACAAGGGCC TGATCCGGGG TGGGGCAGGG GCGATCAAGG ATCTGGAGAA GCTCCAGAAG
TCCCGGGCCA CCCGGACCCA GCGTGACGTC ATCGACGCGG CACTGCTCGA CCTGGTGGCG
TTCTACCGCG ACGTGCTGGC CATGCAGTTC GGCGCGCACG TGGAGCTGGC CAACGAGGAC
CGCCGGGCCG ACCTGGAAGG CCTGGCCCGT TCCTCCGGCC CGGAGGACAC GCTGCGCAGG
ATCGACGCGA TCATGCGCTG CCGCGAGCGG CTGGCCGCCA ACGTCAACCC GCAGATGGCC
GTCGAGGCGA TGACCATCTC GCTGCACCGG CCCCGACTCT GA
 
Protein sequence
MGVFDDLVGQ ERAAVALRRA AEGAAEMLAG GSGAGMTHAW LFTGPPGSGR EEAARAFAAA 
LFCPDQGCGH CDMCHQVAIG SHPDLEVVRT EGLSYGIKET RQLILRAAGA PTLGRWRVVL
FEGADRMPER ASNALLKAIE EPPPKTVWLL CTPSPADLVI TIRSRCRVVT LVTPPTAAVA
HALVTRDNIP PDMAEFVARA TQGHLARARR LALDPEMRAR REAVLSIPRS LIGVGECVIA
AERLVDTAKK EADAVSSALD EGETAELRKI YGEGSSGKGL NKGLIRGGAG AIKDLEKLQK
SRATRTQRDV IDAALLDLVA FYRDVLAMQF GAHVELANED RRADLEGLAR SSGPEDTLRR
IDAIMRCRER LAANVNPQMA VEAMTISLHR PRL