Gene Ndas_3484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3484 
Symbol 
ID9247353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4176153 
End bp4177601 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content71% 
IMG OID 
Productargininosuccinate synthase 
Protein accessionYP_003681391 
Protein GI297562417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.627119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGG TACTCACTTC CCTCCCCGTC GGCGAGCGCG TCGGTATCGC CTTCTCCGGT 
GGCCTCGACA CCTCGGTGGC GGTCGCGTGG ATGCGCGACA AGGGCGCCGT CCCGTACGCC
TACACCGCCG ACATCGGCCA GTACGACGAA CCCGACATCG CCTCGGTCCC CGGCCGCGCC
ACCGCCTACG GCGCCGAGGG CGCCCGCCTG GTGGACGGCC GGGAGGCGCT GGTGGAGGAG
GGTTTCGCGG CGCTGGCCTG CGGAGCCTTC CACATCCGCT CCGGCGGCCG CACCTACTTC
AACACCACCC CCCTCGGGCG GGCCGTCACG GGCACCCTGC TGGTGCGCGC GATGCTCGAG
GACGGCGTGC AGATCTGGGG CGACGGGTCC ACCTTCAAGG GCAACGACAT CGAGCGGTTC
TACCGCTACG GCCTGCTCGC CAACCCCTCC CTGCGGATCT ACAAGCCGTG GCTGGACGCC
GACTTCGTCA ACGAGCTCGG CGGCCGCAAG GAGATGTCGG AGTGGCTGCT GGCCCACGGC
CTGCCCTACC GGGACAGCAC CGAGAAGGCC TACTCCACCG ACGCCAACAT CTGGGGCGCC
ACGCACGAGG CCAAGGCGCT CGAACACCTC GACACCGGCA TCGAGATCGT CGAGCCCATC
ATGGGCGTGC GGTTCTGGGA CCCCGAGGTC GAGATCACCC CCGAGGACGT CACGATCGGC
TTCGAGCAGG GCCGCCCGGT GACCGTCAAC GGCAAGACCT TCGCCACCGC CGTCGACCTG
GTCAACGAGG TCAACGCCAT CGGCGGCCGG CACGGCCTGG GCATGTCGGA CCAGATCGAG
AACCGCGTCA TCGAGGCCAA GAGCCGCGGC ATCTACGAGG CCCCGGGCAT GGCGCTGCTG
CACGCGGCCT ACGAACGGCT GGTCAACGCG GTCCACAACG AGGACACCCT CGCCAGCTAC
CACAACGACG GCCGACGGCT CGGCAGGCTG CTCTACGAGG GCCGCTGGCT GGAGCCCCAG
GCGCTGATGC TGCGCGAGGC CCTCCAGCGC TGGGTGGGCA CGGCGGTCAC CGGCGAGGTG
ACCCTGCGGC TGCGGCGCGG CGAGGACTAC TCCCTCATGG ACACCACCGG GGCGGCGTTC
AGCTACCACC CGGACAAGCT GTCCATGGAG CGGACCGAGG ACTCCGCGTT CGGCCCGGTC
GACCGCATCG GCCAGCTGAC CATGCGCAAC CTCGACATCG CGGACTCCCG CGCCAAGCTG
GAGGAGTACT CCAGGGTCGG CATGGTCGGC ACCTCGCACC CGACGTCGAT CGGCGCCGCC
CAGGCGGCCT CGACCGGGCT CATCGGCGCG ATGCCCGAGG GCGGCGCCGA GGCGATCGCC
TCCCGCGGCC AGGCCCCCGA GAGCGACGAC CTGCTCGACC ACGCCGCGAT GGAGTCCGGC
AACGACTGA
 
Protein sequence
MSKVLTSLPV GERVGIAFSG GLDTSVAVAW MRDKGAVPYA YTADIGQYDE PDIASVPGRA 
TAYGAEGARL VDGREALVEE GFAALACGAF HIRSGGRTYF NTTPLGRAVT GTLLVRAMLE
DGVQIWGDGS TFKGNDIERF YRYGLLANPS LRIYKPWLDA DFVNELGGRK EMSEWLLAHG
LPYRDSTEKA YSTDANIWGA THEAKALEHL DTGIEIVEPI MGVRFWDPEV EITPEDVTIG
FEQGRPVTVN GKTFATAVDL VNEVNAIGGR HGLGMSDQIE NRVIEAKSRG IYEAPGMALL
HAAYERLVNA VHNEDTLASY HNDGRRLGRL LYEGRWLEPQ ALMLREALQR WVGTAVTGEV
TLRLRRGEDY SLMDTTGAAF SYHPDKLSME RTEDSAFGPV DRIGQLTMRN LDIADSRAKL
EEYSRVGMVG TSHPTSIGAA QAASTGLIGA MPEGGAEAIA SRGQAPESDD LLDHAAMESG
ND