Gene Sros_9344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9344 
Symbol 
ID8672695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10293242 
End bp10294690 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content70% 
IMG OID 
ProductArgininosuccinate synthase 
Protein accessionYP_003344705 
Protein GI271970509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.34089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGG TTCTCACCTC CCTGCCGACC GGCGAACGCG TCGGCATCGC CTTCTCGGGC 
GGCCTCGACA CCTCGGTCGC CGTCGCGTGG ATGCGCGACA AGGGTGCCGT CCCGTGCACC
TACACCGCCG ACATCGGCCA GTACGACGAG CCCGACATCG CCTCGGTGCC CGGCCGCGCG
CTGGCGTACG GTGCCGAGGT CGCGCGCCTG GTCGACTGCC GCGCGGCACT GGTCGAGGAG
GGCCTGGCCG CGCTCACCTG CGGCGCGTTC CACATCCGCT CGGCCGGGCG CACCTACTTC
AACACCACGC CGCTCGGCCG TGCCGTCACC GGGACCCTGC TGGTCCGGGC GATGATCGAG
GACGACGTAC AGATCTGGGG CGACGGCTCG ACGTTCAAGG GCAACGACAT CGAGCGGTTC
TACCGGTACG GCCTGCTCGC AAACCCCCAC CTGCGCATCT ACAAGCCCTG GCTGGACGCG
GACTTCGTGT CCGAGCTCGG CGGCCGCAAG GAGATGTCGG AGTGGCTGCT CGCCCACGAC
CTGCCCTACC GTGACAGCAC CGAGAAGGCC TACTCGACCG ACGCCAACAT CTGGGGCGCC
ACCCACGAGG CCAAGACCCT GGAGCACCTC GACACCGGTA TCGAGACCGT GGACCCGATC
ATGGGCGTGC GGTTCTGGGA CCCCGCGGTC GAGATCGCGA CCGAGGACGT GACCATCGGC
TTCGACCAGG GCCGCCCGGT GACGATCAAC GGCAAGGAGT TCGCCACCCC GGTCGACCTG
GTGATGGAGG CGAACACGAT CGGCGGACGG CATGGCATGG GCATGTCGGA CCAGATCGAG
AACCGGGTGA TCGAGGCCAA GAGCCGCGGC ATCTACGAGG CCCCCGGGAT GGCGTTGCTG
CACGCGGCGT ACGAACGGCT GGTCAACGCG ATCCACAACG AGGACACCCT GGCGAGCTAC
CACAACGAGG GACGGCGGCT CGGCCGGCTG ATGTACGAGG GCCGCTGGCT CGACCCGCAG
GCGCTGATGC TGCGCGAGTC GCTGCAGCGC TGGGTCGGCA CGGCGGTCAT CGGCGAGGTG
ACGCTGCGGC TGCGGCGCGG TGAGGACTAC TCGATCCTGG ACACCTCCGG CCCGGCCTTC
AGCTACCACC CGGACAAGCT GTCGATGGAG CGCACCGAGG ACTCGGCGTT CGGTCCGGTG
GACCGGATCG GCCAGCTCAC CATGCGCAAC CTCGACATCG CCGACTCGCG CGCCAAGCTT
GAGCAGTACG CCAGCCTCGG CATGGTCGGC ACCACCCACC CCGCGCTCAT CGGTGCCGCC
CAGGCGGCCT CGACCGGACT GATCGGCGCG ATGCCGCAGG GCGGCGCCGA GGCCATCGCC
TCACGCGGCA CGGTCTCCGA TGAAGACGCG ATGCTCGACC GCGCCGCGAT GGAGTCCGGC
ACCGACTGA
 
Protein sequence
MSKVLTSLPT GERVGIAFSG GLDTSVAVAW MRDKGAVPCT YTADIGQYDE PDIASVPGRA 
LAYGAEVARL VDCRAALVEE GLAALTCGAF HIRSAGRTYF NTTPLGRAVT GTLLVRAMIE
DDVQIWGDGS TFKGNDIERF YRYGLLANPH LRIYKPWLDA DFVSELGGRK EMSEWLLAHD
LPYRDSTEKA YSTDANIWGA THEAKTLEHL DTGIETVDPI MGVRFWDPAV EIATEDVTIG
FDQGRPVTIN GKEFATPVDL VMEANTIGGR HGMGMSDQIE NRVIEAKSRG IYEAPGMALL
HAAYERLVNA IHNEDTLASY HNEGRRLGRL MYEGRWLDPQ ALMLRESLQR WVGTAVIGEV
TLRLRRGEDY SILDTSGPAF SYHPDKLSME RTEDSAFGPV DRIGQLTMRN LDIADSRAKL
EQYASLGMVG TTHPALIGAA QAASTGLIGA MPQGGAEAIA SRGTVSDEDA MLDRAAMESG
TD