Gene Sros_6078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6078 
Symbol 
ID8669376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6667567 
End bp6669228 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003341553 
Protein GI271967357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.225727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCA CGCATGACCA GGTCCAGGCC CAGCTCACCG GTCCAGGACA GCTTTTCGAG 
ATGGAGGAGG TCGCCGGCCA CGGCGGCACC GTCAGGACGT GGAAACACGC TCCCGGCCAC
TTCCGCGCGC TCCTCGAGAT GAGCCGGTTC CACGGCGACA AGGTCTTCCT GACGTACGAG
GACGAGCACA TCACCTACGA GGAGCACTTC CGGCGGGCGG CGACGCTGGC ACGGCGCCTG
GTGGAGGACT ACGGCGTGGT CAAGGGCGAC CGGGTCGCCA TCGCCATGCG CAACTACCCC
GAGTGGGTGG TCGCGTTCTC CGCCACGCTG GCCGCCGGGG CGGTGGCCGT ACCGCTCAAC
GCCTGGTGGA CCGCCCAGGA GCTGGAGTTC GGGCTGTCGG ACTCCGGGGC CAAGGTGCTG
ATCGCCGACG GCGAGCGCGC CGCCAGGCTC GGCGGCCTGG CGCAGTCGCT GATCGTGGCC
CGGGGCGAGG CTCCCGAGGG TGCCCGCTCG TTCGAGGACG TGCTGGGCGC CGTCGAGGCC
GACGTGACGC TGCCGACGGT CGAGCTGTCC CCGGAGGACC CGGCCACGAT CTTCTACACC
TCGGGCACCA CCGGCCGCCC CAAGGGCGCC CTCGGCAGCC ACCGCAACCT CGGCCAGTCC
CCCATGACCG TGGCCTACGG CCTGCTGCGC AGCGTGGTCC GGGCGGGCAA GGACCCGGCG
GAGTCGGCCG GGCAGCGCCG CGTCACCCTG CTCACCGTGC CGCTCTTCCA CGTCACCGGC
TGCTTCGCGG TGATGACCAC CACCATGTTC ACCGGCGGCG GCCTGGTGCT GATGTACAAG
TGGGACGCCG GGCGGGCCCT GGAGCTGATC GAGCGCGAGA AGGTCACCAC GTTCAGCGGC
GTGCCCACCA ACGTGTGGCA GCTCCTGTCC CACCCCGGCC TGGACAAGCA CGACATCTCC
AGCCTCAACT CCCTCGGGTA CGGCGGCGCC CCGGCCCCGC CGAAGCTGCT GGAGCGCATC
ACCGAGCTGC TGCCCAGCCG CTCCCCCTCC AACGGCTACG GCATGACCGA GACCACCGCG
CTCACCATCA ACAACGGCGG CGTGGACTAC CTGGCCAAGC CCGACAGCAT CGGCCTGCCG
ATGCCCGTGG TCGAGGTGAA GATCGCCGAC CCGCTCGGCG ACGAGCTCCC GGCCGGCGAG
GTCGGCGAGC TCTGCCTGCG CGGCCCGAAC GTGATCCTCG GCTACTGGAA CCGCCCCGAG
GCCACCGCCG AGACCTTCAT CGGCGGCTGG CTGCACACCG GCGACCTGGC GCGGGTGGAC
GAGGAGGGCT TCGTGTTCAT CGTGGACCGG GCCAAGGACA TGGTCATCCG CGGCGGCGAG
AACGTCTACT GCGCAGAGGT CGAGGCCGCG CTGTTCGAGC ATCCGGCGGT GGACGACGTC
GCGGTGATCG GCATCCCCCA CGACGAGCTC GGCGAGGAGG TCGGCGCGGT GGTACGGCTG
GCGGCCCCGG CGAGCGCCGA GGAGCTGCAG GCCTTCCTGC GCGAGCGGAT CGCGGCGTTC
AAGATCCCGG TCCGGTTCTG GGTCCGCGAG ACCGAGCTCC CCCGCAACCC CGGCGGCAAG
ATCCTCAAGA CCCACCTCCG CAAGGAGGTC CTGGGCTCCT GA
 
Protein sequence
MTVTHDQVQA QLTGPGQLFE MEEVAGHGGT VRTWKHAPGH FRALLEMSRF HGDKVFLTYE 
DEHITYEEHF RRAATLARRL VEDYGVVKGD RVAIAMRNYP EWVVAFSATL AAGAVAVPLN
AWWTAQELEF GLSDSGAKVL IADGERAARL GGLAQSLIVA RGEAPEGARS FEDVLGAVEA
DVTLPTVELS PEDPATIFYT SGTTGRPKGA LGSHRNLGQS PMTVAYGLLR SVVRAGKDPA
ESAGQRRVTL LTVPLFHVTG CFAVMTTTMF TGGGLVLMYK WDAGRALELI EREKVTTFSG
VPTNVWQLLS HPGLDKHDIS SLNSLGYGGA PAPPKLLERI TELLPSRSPS NGYGMTETTA
LTINNGGVDY LAKPDSIGLP MPVVEVKIAD PLGDELPAGE VGELCLRGPN VILGYWNRPE
ATAETFIGGW LHTGDLARVD EEGFVFIVDR AKDMVIRGGE NVYCAEVEAA LFEHPAVDDV
AVIGIPHDEL GEEVGAVVRL AAPASAEELQ AFLRERIAAF KIPVRFWVRE TELPRNPGGK
ILKTHLRKEV LGS