Gene Svir_21970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_21970 
Symbol 
ID8387521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp2365637 
End bp2366617 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content69% 
IMG OID644976250 
Productpredicted transcriptional regulator 
Protein accessionYP_003134032 
Protein GI257056200 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.530492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.375543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCT CGAGCGAGCG GCTGCCCCGG CTGCTGGCGT TGGTGCCGTA CCTGCTGGCG 
CGTCCGGGTA TCCGGATCGA CGAGGTGGCA CGGGATTTCG ACGTCACGCC CAAGCAGTTG
CGCAAGGACC TGGAACTGCT GTGGATGTGC GGCCTGCCCG GCTACGGGCC GGGGGATCTG
ATCGACCTGT CGTTCGACGG TGACACGGTG TCGGTGATCT ACGACGCCGG GATGCGCAGG
CCGCTGCGGC TGACAGGGGC GGAAGCGAGC GCGTTGCTGA TCGCGCTGCG AACGTTGGCG
GAGACGCCGG GGGTGGTGGA CACCGACGCG GTGCGGCGGG CGATCGCCAA GATCGAGGCG
GCCTCCGGGG ACGTCCGTCC TGCCGACGTG GTGGTGGGCC ACGGTATGCG GGAAGCGGAA
AGCACCGTCG AGACCCGTGG GCGGGTACAG GAGGCCGTAC GCCAGCGGCG AGCGCTGTGG
CTGCGCTACT ACACGGCGTC GAAGGACGAG CTCAGCGAAC GCACCGTGGA TCCGATGCGG
TTGCTGATCG TGCAGGGCAT CAGCTACCTG GAGGCGTGGT GTCGCAAGGC GGAGGCGGTG
CGGCTGTTCC GGCTGGATCG GATCGACGCG TTGACGGTGC TGGACGAACG GGCGGCCCCG
CCCGAGACCG CGATTCCGAA GGACCTTTCG GAGGGGGTGT TCCCCCAGCG GGCGGAGTAC
CCGGCGGCCG AACTGGTGTT GGAGCCTGAC GCGCGATGGG TCGCCGAGTA CTACCCGTGT
GAGGAGCTGG CCGAGCTGGA GGGCGGTCGA CTGCGGGTAC GGATGCGGTA CGGCGACGAG
TCCTGGCTGA TACGGCTGGT TCTGCGACTC GGGGGTGAGG CGACGGTGGA GCGACCCCGG
CACGTGGCGC AAGCTGTGCG ACAGCGGGCG GCCGAGGCAT TGGCCCGAAC TCGTCATCTC
GCCGCAACCC TGGCTGGTTA G
 
Protein sequence
MSGSSERLPR LLALVPYLLA RPGIRIDEVA RDFDVTPKQL RKDLELLWMC GLPGYGPGDL 
IDLSFDGDTV SVIYDAGMRR PLRLTGAEAS ALLIALRTLA ETPGVVDTDA VRRAIAKIEA
ASGDVRPADV VVGHGMREAE STVETRGRVQ EAVRQRRALW LRYYTASKDE LSERTVDPMR
LLIVQGISYL EAWCRKAEAV RLFRLDRIDA LTVLDERAAP PETAIPKDLS EGVFPQRAEY
PAAELVLEPD ARWVAEYYPC EELAELEGGR LRVRMRYGDE SWLIRLVLRL GGEATVERPR
HVAQAVRQRA AEALARTRHL AATLAG