Gene Sare_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1888 
Symbol 
ID5704190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2174208 
End bp2175671 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content72% 
IMG OID641271389 
Productargininosuccinate lyase 
Protein accessionYP_001536764 
Protein GI159037511 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0738965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0427555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGGG TGGACGACAA GAGCCTGACC GAGAACAGCG CGGCCACCAA TCGGACGAGC 
CTGTGGGGAG CCCGCTTCGC CGGCGGCCCC GCCGAGGCCC TCGCGCGGCT GTCGGTGAGC
GTGCAGTTCG ACTGGCGGCT GGCCCCGTAC GACATCGCCG GATCCCGGGC ACACGCCCGG
GTCCTGGCCG GCGCCGGCCT GCTCGACCCG GAGGAACTGG GGCAGATCCT GGCGGCATTG
GACGACCTGG AGGCCGCCTG CGCCGCCGGC ACGTTCCGGC CTACCGTCGA CGACGAGGAC
GTACACACCG CCCTGGAGCG GGGCCTGCTG GAACGGCTCG GCCGTCTCGG CGGCAAGTTG
CGCGCCGGCC GTTCCCGCAA CGACCAGGTC GCCACGGACC TGCGGCTCTA CCTGCGCGAC
CACGCCCGTG GCGTGGCCGC TCGGCTGGTC GAACTGGCCG AGGCGTTGGT CGATCAGGCC
GGACGGCACG TGGAAACCGC GACACCGGGC ATGACGCATC TGCAACCCGC CCAGCCGGTC
ACCTTCGGAC ACTGGTTGCT CGCCCACGTG CAGCCGCTGC TGCGTGACCT GCAGCGACTG
CGGGACTGGG ACCACCGCAC CGCGGTCAGC CCGCTCGGAG CGGGTGCCCT CGCGGGCTCC
GGCCTGCCGC TGGACCCGGT GGCGGTCGCC CGGGAGCTGG GCTTCCGCAC GTCCTTCGCC
AACTCGATGG ACGCCGTCGC CGACCGGGAC TTCGTCGCCG AGTTCCTGTT CACCACGGCC
CTGATCGGCG TGCACCTGTC CCGCCTCGGC GAGGAGGTGG TGCTGTGGAC GTCGCCGGAG
TTCGGCTGGG TGGAGTTGGA TGACGCCTTC GCCACCGGTT CGTCGATCAT GCCGCAGAAG
AAGAACGCGG ACATCGCCGA GTTGGCCCGA GGCAAGTCCG GCCGGCTCGT CGGCGGGCTG
GTGAGCGTGC TCACCATGCT CAAGGGCCTG CCGATGGCGT ACGACCGGGA CATGCAGGAG
GACAAGGAGC CCGCCTTCGA CGCGGTGGAC ACGCTGGAGC TGCTGCTACC AGCCCTGGCG
GGGATGATCT CCACGATGAC GGTACGAGTC GACCGGCTGG TCGCCGCCGC GCCGGAGGGA
TTTTCGCTCG CCACCGAGGT GGCCGACTGG CTGGTCCGTC GCAGCGTGCC GTTCCGTGAG
GCACACGAGA TCACCGGACG GTTGGTGGCG CTCTGCGTGG CCCGGGGCTG TGCACTCGAC
GAGGTGTCGG ACGCCGACCT CGCCGCGGTC AGCGAGCACC TCGACCCCGC GGTGCGGGAC
GTGCTCTCGG TCCGCAGCGC CCTCGCCGCC CGCACCACCC CCGGCTCCAC GGGCCCCGGG
CCGGTCACCG ACCAACTCGC CACCGCCTCC GACCAGCTCA CCGGTTGGCG GGAGTGGGCC
GCCGAACAGG TCGTTCCCCG CTGA
 
Protein sequence
MGGVDDKSLT ENSAATNRTS LWGARFAGGP AEALARLSVS VQFDWRLAPY DIAGSRAHAR 
VLAGAGLLDP EELGQILAAL DDLEAACAAG TFRPTVDDED VHTALERGLL ERLGRLGGKL
RAGRSRNDQV ATDLRLYLRD HARGVAARLV ELAEALVDQA GRHVETATPG MTHLQPAQPV
TFGHWLLAHV QPLLRDLQRL RDWDHRTAVS PLGAGALAGS GLPLDPVAVA RELGFRTSFA
NSMDAVADRD FVAEFLFTTA LIGVHLSRLG EEVVLWTSPE FGWVELDDAF ATGSSIMPQK
KNADIAELAR GKSGRLVGGL VSVLTMLKGL PMAYDRDMQE DKEPAFDAVD TLELLLPALA
GMISTMTVRV DRLVAAAPEG FSLATEVADW LVRRSVPFRE AHEITGRLVA LCVARGCALD
EVSDADLAAV SEHLDPAVRD VLSVRSALAA RTTPGSTGPG PVTDQLATAS DQLTGWREWA
AEQVVPR