Gene Sros_5254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5254 
Symbol 
ID8668548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5766591 
End bp5768384 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content74% 
IMG OID 
ProductAsparagine synthase (glutamine-hydrolyzing) 
Protein accessionYP_003340766 
Protein GI271966570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.381893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.232411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGA TCGCCGGTTG GGTGGACTTC GAACGCCAGC CCGCCACGGC AGGCGCCGTG 
GTCACCGAGA TGACCCGGGC CCTGGCCGCG GGGGAGGGCG GCGCCCCGCG GCTGTGGGCG
GGGCCCGGGG GCGCCCTCGG CCTGGGCGGC GCCCGTTCGC ACGGCATGGC CGGCCGGGCA
CCGCTCGACG CGCGTGGCGC GCGCGGCCCG GCGGTCATCG CGTTCGGGGG TGCCTGCGAC
AACCGTGCGG AGCTGCGCGG CCTGCCAGGC CGCCCGGACG GCGAGGGCGA CGCGGCGGCG
GTGCTGCACG CCTACCGGGT GCTCGGCGTC CGCTTCACCG AGCACCTGCG CGGCTCGTAC
GCCTTCGCCC TCTGGGAGCC TCAGGACGCC GCGCTGACGC TGGTCCGGGA CCGGCTGGGC
ACCCGACCGC TCTACTATCT GGAACTCGAC ACCGGCGTGG TCTTCGCCTC GCGGCCGGAG
GCCGTCCTGG CACATCCGGC GGCACGCCCG GCGCTCGACG AGGACGGCCT GCGCATCGTG
CTCTCCGGGA TCACCGTCCC AGGCCGGACC GTCTACCGGG ACGTCCGCGA GGTGCGTCCG
GGCCACGCGG TCCGGTTCTC CTCCGGAGGC AGGACCGAAC GCCGCTACTG GGCGCTGTCG
GCGGCCGAGC ACCGCGACGA CACCGCGACC ACCGTCGCAC GGGTCCGTGA GCTGCTGGCG
GACGCCGTCG GCGAGCAGAC GGCCTCCGCC GGCCGGGCCG GCAGCCTGAT GTCGGGCGGG
CTGGACTCCA GCACGCTCGC GGCGCTGCTC GCCGGGCGGC GGGAAGAGCG CCTGGCCACG
TTCTCGGTCG ACTACCAGGG CTACGAGGAG AACTTCCGGC CGCACATCGT CCGGCCGGCA
CCGGACAGCC CCTATGTCCG TGAGATGACA GCCCATCTCG GATCCGACCA TACCGACGTG
GTGCTGACCA CCGGCGACCT CACCGCCCCC GACGTCTGGA ACGCCCTGGT GGCCGCGCTG
GACCAGCCCC GGCTGTTCGC CGACATCGAA CCGTCGATGA TCCTGCTCTA CCGGGCGGTC
CAGGGCAGGC TGGACACGGT GCTCAGCGGC GAGGGAGCGG ACGAACTCTT CGGGGGGTTC
CCCTGGTTCC ACCACCCCCG GTGGGCCGAC GCGCCCGACT TCCCCTGGAC GCCGACGACC
GACGAGCTGG TCGGCACCCT CTTCGCGCCG GCGATGAAGG ACCTGGCGGT GCCGGACTTC
CGCGCCGAGC ACTACCGCGA GGCCCTGGCG GAGCTGCCCG CCCCGCCGGA TGAGGACCCC
CGGGAGCGGC GGATGAGGGA GGTCGGCTAC CTTTTCGTCA CCCGCTTCCT GCCCGAGCAG
CTCGACCGCG CCCACCGGCT GAGCGCCGGC TGCGGATTCG ACGTCAGGAT GCCCTTCTGC
GACCACCGGC TGGTGGAGTA CGCGCTGAAC ATCCCGTGGG GCGTCAAGAA CTTCGACGGC
TCGGAGAAGA GCGTGCTGCG CGCGGCCGCG GCCGGCCTGC TCCCGGCTTC CGTGCTGGAG
CGCCGCAAGT CCGGGTACCC GATGACGCAC GACCGGGGAT ACGACCGGAT CCTGCGGGCG
AAGGTCGGCG AGCTGGCTCC CGGCGGGCCG GTTCTGCCGC TGCTCGACGC GTCCGTGGTG
GACCGCCTGC GCGAGGACCC GTCCCAGGGG CCGGCGCTCA GCCGTACCGA GCTGGAACTG
GCCCTGAAGC TCGACGCCTG GCTCACCCGG TGGCGGCTCA CCCTGCCGGG CTGA
 
Protein sequence
MSEIAGWVDF ERQPATAGAV VTEMTRALAA GEGGAPRLWA GPGGALGLGG ARSHGMAGRA 
PLDARGARGP AVIAFGGACD NRAELRGLPG RPDGEGDAAA VLHAYRVLGV RFTEHLRGSY
AFALWEPQDA ALTLVRDRLG TRPLYYLELD TGVVFASRPE AVLAHPAARP ALDEDGLRIV
LSGITVPGRT VYRDVREVRP GHAVRFSSGG RTERRYWALS AAEHRDDTAT TVARVRELLA
DAVGEQTASA GRAGSLMSGG LDSSTLAALL AGRREERLAT FSVDYQGYEE NFRPHIVRPA
PDSPYVREMT AHLGSDHTDV VLTTGDLTAP DVWNALVAAL DQPRLFADIE PSMILLYRAV
QGRLDTVLSG EGADELFGGF PWFHHPRWAD APDFPWTPTT DELVGTLFAP AMKDLAVPDF
RAEHYREALA ELPAPPDEDP RERRMREVGY LFVTRFLPEQ LDRAHRLSAG CGFDVRMPFC
DHRLVEYALN IPWGVKNFDG SEKSVLRAAA AGLLPASVLE RRKSGYPMTH DRGYDRILRA
KVGELAPGGP VLPLLDASVV DRLREDPSQG PALSRTELEL ALKLDAWLTR WRLTLPG