Gene Ndas_3055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3055 
Symbol 
ID9246911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3650825 
End bp3652243 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content74% 
IMG OID 
Productargininosuccinate lyase 
Protein accessionYP_003680971 
Protein GI297561997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.097869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.940422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC AGCCCGAGGC GCTGCGCCTG TGGGGAGGCC GTTTCGAGGG CGGCCCCGAC 
CAGGCCCTGG CGCGCCTGTC GCTGAGCACC CACTTCGACT GGCGCCTGGC CCGCCACGAC
ATCGCCGGTT CGCGCGCCCA CGCCCGCGCC CTGCACCGGG CGGGCCTGCT CACCGCCGAC
GAGCTGGACC GCATGATCGA GGGCCTGGAC CGGCTGGAGG CCGACGTGGC CTCCGGGGCG
TTCACCCCCG TCCTGGAGGA CGAGGACGTG CACACCGCCC TGGAGCGCGG CTTCATCGAG
CGGGTCGGCA CCGAACTGGG CGGGCGGCTG CGCGCCGGCC GCTCCCGCAA CGACCAGATC
GCCACCCTCG TGCGCATGTA CCTGCGCGAG GAGGCCCGGC AGATCGCCGA CCAGATCCTG
GACCTGGTGC GGGCCCTGGC CGACCAGGCC GCGGCCCACC CGGACGCCGC CATGCCCGGC
CGCACCCACC TCCAGCACGC CCAGCCCGTG CTGCTCGCCC ACCAGCTCAT GGCGCACGCC
TGGCCCTTGG TGCGCGACGT CGAGCGGCTG CGCGACTGGG ACCGCCGCGC CGCGGTGTCC
GCCTACGGCT CCGGCGCGCT GGCCGGGTCC TCCCTGGGAC TGGACCCCCG CGCGGTCGCC
GCCGAGCTGG GCTTCCCCGA CTCGGTGGAC AACTCCATCG ACGGCACCGC CGCCCGCGAC
GTCGTCGCCG AGTTCGCCTT CGTCACCGCC ATGATCGGCG TGGACCTGTC CCGGCTCTCG
GAGGAGGTCA TCCTGTGGGC GACGAAGGAG TTCTCCTTCG TCACACTCGA CGACGCCTTC
TCCACCGGCT CCTCGATCAT GCCCCAGAAG AAGAACCCCG ACGTCGCCGA ACTCGCCCGC
GGCAAGGCCG GGCGCCTCGT CGGCGACCTC ACCGGGCTGC TCACCACCCT CAAGGGGCTC
CCGCTGGCCT ACAACCGGGA CCTGCAGGAG GACAAGGAAC CGGTCTTCGA CGCGGTGGAC
ACCCTCCACC TGCTGCTGCC CGCGATGACC GGCATGGTCG CCACCCTCAC CTTCCACACC
GACCGGATGG CCGAACTCGC CCCGCAGGGC TTCTCCCTGG CCACCGACAT CGCCGAGTGG
CTGGTGCGCG AGCGCGTGCC CTTCCGGGAG GCGCACGAGA TCGCGGGCGC CTGCGTGCGC
GTGTGCGAGG AGCGCGGCAT CGACCTGCCC GACCTCGGCG ACGACGACCT CGCCGCCGTC
TCAGAGCACC TGACCCCGGC CGTGCGCGAG GTCCTCAGCG TGTCCGGCTC GCTGGCCTCG
CGCTCCGACA AGGGCGGCAC CGCCCCCGTG CGCGTGGCCG AGCAGCTGGA GAAGCTGCGC
TCGGTCGTCG AGGAGCACCG CAAGGCCTTC ACCGCCTGA
 
Protein sequence
MADQPEALRL WGGRFEGGPD QALARLSLST HFDWRLARHD IAGSRAHARA LHRAGLLTAD 
ELDRMIEGLD RLEADVASGA FTPVLEDEDV HTALERGFIE RVGTELGGRL RAGRSRNDQI
ATLVRMYLRE EARQIADQIL DLVRALADQA AAHPDAAMPG RTHLQHAQPV LLAHQLMAHA
WPLVRDVERL RDWDRRAAVS AYGSGALAGS SLGLDPRAVA AELGFPDSVD NSIDGTAARD
VVAEFAFVTA MIGVDLSRLS EEVILWATKE FSFVTLDDAF STGSSIMPQK KNPDVAELAR
GKAGRLVGDL TGLLTTLKGL PLAYNRDLQE DKEPVFDAVD TLHLLLPAMT GMVATLTFHT
DRMAELAPQG FSLATDIAEW LVRERVPFRE AHEIAGACVR VCEERGIDLP DLGDDDLAAV
SEHLTPAVRE VLSVSGSLAS RSDKGGTAPV RVAEQLEKLR SVVEEHRKAF TA