Gene Sros_1992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1992 
Symbol 
ID8665274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2145245 
End bp2146819 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content74% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003337723 
Protein GI271963527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.933097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAT CCCCCTACGG CAACTACGCC TACGCCATCC TCACCGCCAA CGCCCTGCGC 
ACCCCGGACA GGGTCGCGCT GACCTACTGC GGCGAGCGGA GCTTCACCTA CTCCGAGCTC
GACCGGATCG TCGACAGGCG CGCCCACGCC CTGACGGCGG CCGGGATCGA GCCGCGGCGG
CGGGTGGCGG CGCTGCTCAA CGAGACGCTG CACGTGGCCG AGGTCTACCT GGCGGCGGCC
AAGCTGGGCG CGGTCACCGC GGCGCTCAAC CCGTACTGGC CGGTGGACAC CCTGCGGGAG
GTCGTCGCGC ACTCCGGCGC CACCGCCTTC GTCTACGACT CCACCGTCGA GCAGGTCGTG
GCGGAGATCC GCCCCCGGCT GCCCGACGTG ACCACCTGGA TCAAGGTCGG CGGCCCGGGT
GAGGACGTCG TGGACCTGGA CGCGCTGACC GCCGCCGCCC CGGACGGGGA GTTCGCGCCG
GACGGCTCCG GGGACGACCC GCTCGCCCTC TACTACACCT CGGGGACCAC CGGGCTGCCC
AAGGCCGCGA TCCACACGCA CGCCTCGTCC CTGGCGACCG CGCGGATCTG GCTGGACGTG
CCGCGCGCCG AGGACTCGGT GCTCGGCACC GGGGCGATCA TCTGGGGCAT CGGCTTCCCC
GCCCTGGTCG GGCCGGCGCT GTACGCGGGC ATGCGGCTGG TGCTGGAGCA GGACTGGGGA
CCGGCGAACT TCCTGAGGGT GGTCCCCCGC GAGAGGGTCA CCCACGTCTC GCAGATCCCG
TCGTTCTACG CGGCCCTGCT CGGCTCCGAC GACCACGAGG GGGTGGACCT GTCGTCCCTG
CGGGTCATCA TGCTGGGCGG CGAGGCGCTC CCCGCGACGC TGCTGGGCAG GATGAGGGAG
CGGCTGCCCG GCGCGGGCGT CTACTGCTAC TACGGCCAGA CCGAGGCTCC CTACACCTGT
TTCGGCCGGG TGGACGACGG CAGCACGCCG CTCGGCTCCT CCGGCAGGGC GCGGACCGGC
AACGCCGTAC GGATCACCGG GCCGTCCGGC GAACGGGTGG TCGGCGAGGT CGGTGAGATC
AACCTGGCCG GGCCGCACCG GATGACCGGC TACGACCGGC TCCCCGAGAA GACGGCCGAG
GTGCTCCGCG GCGAGTGGTA CGTGGGCGGT GACCTGGGTG TCCTCGCCGA GGACGGCAAC
CTGACCGTGC TCGGCCGCCG CGAGGACGCG ATCCTGAAGG GCGGCGCCTG GACGCAGCCC
TCGGCGGTCG AGGAGGCCGC GGTGGCGCTG GACGACGTGG CCGAGGCAGG CGCGGTGGGC
GTGCCCGAGC ACGTCGAGGG CCGGGACGCG GCCGAGCAGA GGATCCTGCT CGCGGTGGTC
GCCCGCTCCG GGCACTCGCT CGACCCGTCG AAGCTCGCGA TCGCGCTCGC CGAGTCGCTG
CCCGAGCACC GGCGGCCCGA CCGCATCGTG GTCGCCGACG AGCTCCCCCA CTTCCAGGAC
GCCTCCGGCG GCCCCGGCAA GCTGCTGCGC CGGGAGATCC GGGAGAGGTA CCGGCACCTC
CTGGAGGAGG CGTGA
 
Protein sequence
MARSPYGNYA YAILTANALR TPDRVALTYC GERSFTYSEL DRIVDRRAHA LTAAGIEPRR 
RVAALLNETL HVAEVYLAAA KLGAVTAALN PYWPVDTLRE VVAHSGATAF VYDSTVEQVV
AEIRPRLPDV TTWIKVGGPG EDVVDLDALT AAAPDGEFAP DGSGDDPLAL YYTSGTTGLP
KAAIHTHASS LATARIWLDV PRAEDSVLGT GAIIWGIGFP ALVGPALYAG MRLVLEQDWG
PANFLRVVPR ERVTHVSQIP SFYAALLGSD DHEGVDLSSL RVIMLGGEAL PATLLGRMRE
RLPGAGVYCY YGQTEAPYTC FGRVDDGSTP LGSSGRARTG NAVRITGPSG ERVVGEVGEI
NLAGPHRMTG YDRLPEKTAE VLRGEWYVGG DLGVLAEDGN LTVLGRREDA ILKGGAWTQP
SAVEEAAVAL DDVAEAGAVG VPEHVEGRDA AEQRILLAVV ARSGHSLDPS KLAIALAESL
PEHRRPDRIV VADELPHFQD ASGGPGKLLR REIRERYRHL LEEA