Gene Sros_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0001 
Symbol 
ID8672741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp361 
End bp2109 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content68% 
IMG OID 
ProductDNA replication initiation ATPase- like protein 
Protein accessionYP_003335806 
Protein GI271961610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000523076 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCAA TGGACCTCGG CGCGGTCTGG GCGCGGGCGT TGGAGAACTC CCTCAACGAG 
AGCGTCCCGT CCCAGCAGCG CGTCTGGCTC AGCATGACCC GGCCCTTCGG CCTCATGAAC
GACACGGTGG TGCTCGCGGC CCCCAACGAT TTCGCCAGGG ATGTCCTGGA GGACAAGCTC
CGTCCCCTGA TCAGCCACGC GCTCTCCCAG GAGTTCGGCC GCCCGATGCG GGTCGCGGTC
ATGGTGGACC CCAGCGCCAC GGGCTCCGAC TCCGGCGTCG CCTCCCGCCC GGAGTCCTAT
CCACAGGCGC CGCCGACGAG TTATCCACAG AGTGGCGGGC CGCAGCAAAG TTATGCACAG
CCGGTGACCT CCCAGCACCA CAGTGCTTCC GGGGCTCAGC ATCCCTACCC GGCTGCCGGG
ACCGCGGCCC CGCCGTCCAC CGAATATCCA CAGCATCAGC CACAGGCCCA GCCGTTCTCC
TATTCCTATC AACACAGGGA TGAGCAGGCG CAGCCGTACC TCCCGGCGGA GCCCTCTCCC
GCCCCGCCGC CGGCGGAGCA GGGCGGCGGA TACGGCCGGG GCGGATACAC CCCGCGGCCC
TCTCCTCCCT CGCGCTCGGA ACCGGACACC TTCGAGCGTC CCGCGCAGAG CCCGGGCGCC
GGCGGAGTGC ACAACCGATG GGACAGCCGC GGCAGCCGCA CCCAGGGGGA GCCCGCCCGG
CTGAACCCGA AGTACACCTT CGAGACCTTC GTCATCGGCT CCAGCAACCG CTTCGCCCAC
GCGGCCGCCG TCGCGGTGGC CGAGGCGCCG GCCAAGGCCT ACAACCCGCT GTTCATCTAC
GGCGACTCGG GCCTGGGGAA AACCCACCTG CTGCACGCGA TCGGCCACTA CGCGCAGAGC
CTCTACGACG GCGCGCGGGT GAGATACGTC AGCTCGGAGG AGTTCACCAA CGACTTCATC
AACTCCATCC GCGACCACAA GGCCGACGGC TTCCGCAGCC GCTACCGCGC GGTCGACATC
CTGCTCGTGG ACGACATCCA GTTCCTGGAG GGCAAGGAGC AGACGCAGGA GGAGTTCTTC
CACACCTTCA ACACCCTGCA CAACGCCAAC AAGCAGATCG TCATCTCCAG CGACCGGGCG
CCCAAGCAGC TCGTCACCCT GGAGGACCGG CTCCGCAACC GCTTCGAGTG GGGCCTGATC
ACCGACGTCC AGCCGCCCGA GCTGGAGACC CGCATCGCGA TCCTGCGCAA GAAGGCGATC
CAGGAGGGCC TGGCCGCCCC GCCCGAGGTG CTGGAGTACA TCGCCAGCCG CATCTCCACC
AACATCCGCG AGCTCGAAGG CGCCCTGATC AGGGTCACCG CGTTCGCCAG CCTCAACCGG
CAGTCGGTCG ACCTGCAGCT CACCGAGGTC GTGCTCAAAG ACCTGATCAC CGAGGACGCC
GGCTCCGAGA TAACGGTCGC CACGATCATG GCGTCCACCG CCGCCTACTT CGGCCTGTCG
ATCGACGACC TGTGCGGCGG GTCGCGCTCG CGCGTCCTGG TCACCGCCCG GCAGATCGCC
ATGTATCTGT GCAGGGAGCT CACCGACATG TCGCTGCCGA AGATCGGCCA GCAGTTCGGC
GGACGCGACC ACACCACGGT CATGCACGCC GACCGGAAGA TCCGCTCGCT CATGGCGGAG
CGTCGTTCGA TCTACAACCA GGTCAACGAA CTCACCACGA GGATCAAGCA GCAGTCGCGC
AATGGATGA
 
Protein sequence
MDAMDLGAVW ARALENSLNE SVPSQQRVWL SMTRPFGLMN DTVVLAAPND FARDVLEDKL 
RPLISHALSQ EFGRPMRVAV MVDPSATGSD SGVASRPESY PQAPPTSYPQ SGGPQQSYAQ
PVTSQHHSAS GAQHPYPAAG TAAPPSTEYP QHQPQAQPFS YSYQHRDEQA QPYLPAEPSP
APPPAEQGGG YGRGGYTPRP SPPSRSEPDT FERPAQSPGA GGVHNRWDSR GSRTQGEPAR
LNPKYTFETF VIGSSNRFAH AAAVAVAEAP AKAYNPLFIY GDSGLGKTHL LHAIGHYAQS
LYDGARVRYV SSEEFTNDFI NSIRDHKADG FRSRYRAVDI LLVDDIQFLE GKEQTQEEFF
HTFNTLHNAN KQIVISSDRA PKQLVTLEDR LRNRFEWGLI TDVQPPELET RIAILRKKAI
QEGLAAPPEV LEYIASRIST NIRELEGALI RVTAFASLNR QSVDLQLTEV VLKDLITEDA
GSEITVATIM ASTAAYFGLS IDDLCGGSRS RVLVTARQIA MYLCRELTDM SLPKIGQQFG
GRDHTTVMHA DRKIRSLMAE RRSIYNQVNE LTTRIKQQSR NG