Gene Sros_8724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8724 
Symbol 
ID8672062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9619673 
End bp9622852 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content71% 
IMG OID 
ProductProtein related to penicillin acylase-like protein 
Protein accessionYP_003344103 
Protein GI271969907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGCAC GTCTCCGGCG GAAGATCACG GCGGCCTCCG CGCTCGCGAT CCTCACGCTG 
GCATCCCTCG TCGTGCCCGC ATCGGCAGCC ACTTTCACCA CCGACGACTA CTGCCTCGGC
GAATGCGCGG ACATCCTGCC ACCCGGCCAG AACGGCAACG CCACCCTGGT CGAGATCCTC
GCCAACCAGT CACTGGGCAC GATGCCGCGC CACAGCGACG ACCAGCTCGG CAAGTACGCC
AACCTCCTCT CCGGCTACAC CGGGCTGACC GACGACCAGA TCACCACGTT CTTCAACGAC
AGCTCCTTCG GCGTGCCGGC GGGGCAGGTG CAGAGCACCA CGAGCCCCCG GCCGGACGTC
ACGATCGTCC GGGACAAGGC GACCGGTGTC CCGCACATCA CCGGGACGAC CAGGGAAGGC
ACGATGTTCG GCGCCGGGTA CGCGGGCGCG CAGGACCGCC TGTGGGTGAT GGACCTGATG
CGGCACGTCG GCCGTGGCGA GCTGACGCCG TTCGCCGGCG GCGCGCCCGG CAACCGCGCG
CTGGAACAGA GCGTCTGGCG GAACTCCCCC TACACCGAGG CGGACTTCCA GGCGCAGATC
AACCGGCTCA GGACCTCCGG CACCCGGGGC GCGCAGCTCT ACGCCGACGT GCAGAACTAC
GTGAGCGGGG TCAACGCCTA CATCGACCGC TGCATGGCGA ACCGGAACTG CCCCGGCGAG
TACGTCCTGA CCGGGCACCT CGACGCGGTC ACCAACGCGG GCGGCCCCGA GGACTTCACC
ATGACCGACC TGATCGCGGT CTCCGGGGTG GTCGGCGGGC TGTTCGGTGG AGGCGGCGGG
GCGGAGATGC AGTCCGCGCT GGTACGGGTG GCGGCGCGGG CCAGGTACGG CACGGCGGCC
GGCGACCAGG TCTGGGCGGC GTTCCGGGCG CAGAACGACC CGGAGGCCAC GCTCACCCTG
CACAACGGCC AGAGCTTCCC CTCCGGGAAC GCGACCGGCG CCACCGGGAC CGTCCTGCCC
GACCCGGCGA CGGCACCCGT GGACATCACC GAGAACGAGA CCGGGTCGGC CACCACCTCG
GCCTCACCCT CCCGGGGCCT GCTGGACGGG CTGATCGTCG ACAACTCCAA GCCCGGCATG
TCCAACGCCG TGGTGGTGTC GGCGGCCAAG TCGGCCTCCG GGCATCCGAT CGCGGTGTTC
GGCCCGCAGA CGGGCTACTT CGCCCCTCAG CTGCTCATGC TGGAGGAGCT CTCCGGACCC
GGCATCCGGG CGCGCGGGGC GGCGTTCGCC GGGCTCAACC TCTACGTGCT GCTCGGCCGG
GGCACGGACT ACGCCTGGAG CGCGACCTCC AGCGGCCAGG ACATCACCGA CACCTACGCC
CTCCAGCTCT GCGAGCCCGG CGGCGGCACG GTGACCACGG CGTCGAACCA CTACCTCTAC
CGGGGCACCT GCACGGCGAT GGAGACGCTG AAGAAGACCA ACGCGTGGAA GCCCACCACC
GCGGACTCCA CCGCCGCCGG TTCCTACGAC CTCGTGATGA AGCGCACGAA GTACGGCCTG
GTCAGCTGGC GTGGCACCGT CAACGGGCAG CCCACCGCCT TCGCCACCCT GCGGTCGACG
TACCAGCACG AGGCCGACTC GGCGATCGGG TTCCAGATGT TCAACGAACC CGCCCAGATG
GGCGACGCGG CGGCGTTCGC GGCCTCGGCC TCCAAGATCG GGTTCGCGTT CAACTGGTTC
TACGTGAACT CCTCCGACGC GGCCTACTTC ATGTCGGGCA ACACCCCGGT CAGATCCGCG
GTGTCCGACC CGAACCTCCC GATGACGGCC GACGCGGCCC ACGAGTGGGC GGGCTTCGAC
CCGGCGACCA ACACCGCGAC GTACACGGCT CCCGCCGCCC ACCCGCAGAC GGCCGACCAG
GACTACCTGG TCAGCTGGAA CAACAAGCAG GCCAAGGACT ACGGCGCCGC GGACGGCAAC
TTCAGCTTCG GGCCGGTTCA CCGGGCCGAC CTGCTGGACG CCCCGGTCAA GGCGGCCCTG
GCAGGATCCG GCAAGCTCGA CCGGGCGGGC ACCGTCAAGA TCATGGCTGA GGCCGCCACA
ACCGACCTGC GCGGCAGGAA GGTCCTGCCC GACCTGCTCC GAGTGATCAA CAGCGCGACC
GTGACCGACC CGGCCCTCGC CTCGGCGGTC TCCAAGCTGT CCGCCTGGGC GTCGTCGGGA
GCCAGGAGGC TGGAGGCCTC GCCCGGCGGC AAGGCCTACG CGCACGCCGA CGCGATCCGC
GTGTTCGACG CGTGGTGGCC CAAGCTCGTC CAGGCGGCGT TCAAGCCCAG CCTGGGAGAC
GGCCTGTACC AGTCGCTGGT CAACGCCCTC CAGATCAACG AATCACCCTC GGGCCACCAG
CAGGGCGACG TCTCCAACCT GCCCACCTCC GCGAACGAGG CCCAGACGCA CAAGGGCTCG
GCGTTCCAGT ACGGCTGGTG GGGCTACGTG AGCAAGGACG TCCGGGCCGT GCTCGGCGAC
CCGGTCTCCG GCCCGCTGCC CGGCAGGCAC TGCGGGGGCG GCACGCCGGC GGGCTGCCGT
ACGGTCCTGC TGAACAGCCT GTCGGCGGCG CTCGCGGAGC CCGCGACGAC GACCTACCCG
GCGGACGGCG TGTGCGCGGC GGGTGACCAG TGGTGCGCCG ACGCCGTCCA GCAGTCGCCG
CTCGGCGGGA TCAAGCAGTC GCTGATCTCC TGGCAGAACC GGCCGACCTA CCAGCAGGTC
GTGTCCTTCC CCGCCCACCG GGGCGACAGC GTCACCAACC TGGCGGGCGG GAAGAAGGCG
AGCGCGTCCA GCGTCCAGTC CTTCCTCTAT CCGGCGGGCA AGGCCGTCGA CGGCGACCCG
ACGACCCGCT GGTCGAGCGC GGCCGGCGAC GACCAGTACC TCCAGGTCGA CCTCGGATCC
GCCATGACCG TGGCCCGGAC GGTGCTGCGC TGGGAGTCGG CCTACGGGAC CGGCTACTCG
ATCCAGACAT CGTCCGACGG CTCCACCTGG ACCACCGTCC ACTCCACCAC CACCGGGAAC
GGCGGCGTGG ACAACGTGAC GTTCAGCCCG ACCACCGCCC GTTACGTCCG CATGCGGGGC
GTCACGCGCG CGACGTCGTA CGGCTACTCC CTCTACGAGC TGGAGGTCTA CTCCCGCTGA
 
Protein sequence
MHARLRRKIT AASALAILTL ASLVVPASAA TFTTDDYCLG ECADILPPGQ NGNATLVEIL 
ANQSLGTMPR HSDDQLGKYA NLLSGYTGLT DDQITTFFND SSFGVPAGQV QSTTSPRPDV
TIVRDKATGV PHITGTTREG TMFGAGYAGA QDRLWVMDLM RHVGRGELTP FAGGAPGNRA
LEQSVWRNSP YTEADFQAQI NRLRTSGTRG AQLYADVQNY VSGVNAYIDR CMANRNCPGE
YVLTGHLDAV TNAGGPEDFT MTDLIAVSGV VGGLFGGGGG AEMQSALVRV AARARYGTAA
GDQVWAAFRA QNDPEATLTL HNGQSFPSGN ATGATGTVLP DPATAPVDIT ENETGSATTS
ASPSRGLLDG LIVDNSKPGM SNAVVVSAAK SASGHPIAVF GPQTGYFAPQ LLMLEELSGP
GIRARGAAFA GLNLYVLLGR GTDYAWSATS SGQDITDTYA LQLCEPGGGT VTTASNHYLY
RGTCTAMETL KKTNAWKPTT ADSTAAGSYD LVMKRTKYGL VSWRGTVNGQ PTAFATLRST
YQHEADSAIG FQMFNEPAQM GDAAAFAASA SKIGFAFNWF YVNSSDAAYF MSGNTPVRSA
VSDPNLPMTA DAAHEWAGFD PATNTATYTA PAAHPQTADQ DYLVSWNNKQ AKDYGAADGN
FSFGPVHRAD LLDAPVKAAL AGSGKLDRAG TVKIMAEAAT TDLRGRKVLP DLLRVINSAT
VTDPALASAV SKLSAWASSG ARRLEASPGG KAYAHADAIR VFDAWWPKLV QAAFKPSLGD
GLYQSLVNAL QINESPSGHQ QGDVSNLPTS ANEAQTHKGS AFQYGWWGYV SKDVRAVLGD
PVSGPLPGRH CGGGTPAGCR TVLLNSLSAA LAEPATTTYP ADGVCAAGDQ WCADAVQQSP
LGGIKQSLIS WQNRPTYQQV VSFPAHRGDS VTNLAGGKKA SASSVQSFLY PAGKAVDGDP
TTRWSSAAGD DQYLQVDLGS AMTVARTVLR WESAYGTGYS IQTSSDGSTW TTVHSTTTGN
GGVDNVTFSP TTARYVRMRG VTRATSYGYS LYELEVYSR