Gene Sros_6071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6071 
Symbol 
ID8669369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6660178 
End bp6661377 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID 
ProductArgininosuccinate synthase 
Protein accessionYP_003341546 
Protein GI271967350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACC GGGTCGTACT CGCCTTCTCA GGCGGCCTCG ACACCTCTGT CGCCATTCCC 
TTCCTCGCCG AGAAGACCGG CGCCGAAGTC ATCGCCGTGG CCGTCGACGT CGGCCAGGGC
GGCGAGGACA TGGAGGTCAT CCGGAAGCGG GCCATCGACT GCGGCGCCGT CGAGTCCGTC
GTCGTGGACG CCCGCGAGGA GTTCGCCTCC GACTTCTGCG TGCCCGCCCT GCAGGCCAAC
GCCCTCTACA TGGACCGCTA CCCGCTGGTC TCCGCGCTGT CGCGGCCGCT GATCGTCAAG
CACCTGGCCG CCGCGGCCAA GGAGTTCGGC GGCACCCACG TCTCCCACGG CTGCACCGGC
AAGGGCAACG ACCAGGTCCG GTTCGAGGCC GGCCTGGCCG CGCTCTTCCC CGAGCTCAAG
GTCATCGCCC CCGCCCGCGA CTACGCGTGG ACCCGCGACA AGGCGATCGC CTACGCCGAG
GAGAAGAACC TCCCGATCGA GACCAGCAAG AAGAACCCCT ACTCGATCGA CCAGAACATC
TGGGGCCGGG CCGTCGAGAC CGGCTTCCTG GAGGACATCT GGAACGGCCC CGTCGAGGAC
GTCTACTCCT ACACCGCCGA CCCGGCCGAG CCGCGCGAGG CCGACGAGGT CATCGTCAGC
TTCGTCAAGG GCGTCCCGGT CGCGCTGGAC GGGCGTCACC TGACCCCGTT CCAGGTCATC
GCAGAGCTCA ACCGGCGCGC CGGCGCCCAG GGCGTCGGCC GGCTCGACAT GGTCGAGGAC
CGGCTCGTCG GCATCAAGTC CCGCGAGGTC TACGAGGCGC CCGGCGCCAT CGCGCTGATC
ACCGCGCACA TGGAGCTGGA GAACGTCACC GTCGAGCGCG ACCTCGCCCG GTTCAAGCGG
TCGGTGGACC AGCGCTGGGG CGAGCTCGTC TACGACGGCC TCTGGTTCTC CCCGCTGAAG
AAGGCCCTGG ACGTCTTCAT CGCCGAGGCC CAGCAGCATG TCACCGGTGA GATCCGGATG
ACCCTGCACG GCGGCCGGGC CACGGTCACC GGCCGGCGCT CCGAGGCCTC GCTGTACGAC
TTCAACCTCG CCACCTACGA CACCGGCGAC ACCTTCGACC AGTCGCTCGC CAAGGGTTTC
GTCGAGCTGT GGAGCCTGCC CTCCAAGATC GCGTCCGCCC GGGACGCCCG ACTGGTCTGA
 
Protein sequence
MNDRVVLAFS GGLDTSVAIP FLAEKTGAEV IAVAVDVGQG GEDMEVIRKR AIDCGAVESV 
VVDAREEFAS DFCVPALQAN ALYMDRYPLV SALSRPLIVK HLAAAAKEFG GTHVSHGCTG
KGNDQVRFEA GLAALFPELK VIAPARDYAW TRDKAIAYAE EKNLPIETSK KNPYSIDQNI
WGRAVETGFL EDIWNGPVED VYSYTADPAE PREADEVIVS FVKGVPVALD GRHLTPFQVI
AELNRRAGAQ GVGRLDMVED RLVGIKSREV YEAPGAIALI TAHMELENVT VERDLARFKR
SVDQRWGELV YDGLWFSPLK KALDVFIAEA QQHVTGEIRM TLHGGRATVT GRRSEASLYD
FNLATYDTGD TFDQSLAKGF VELWSLPSKI ASARDARLV