Gene Ndas_2434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2434 
Symbol 
ID9246284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2887454 
End bp2888884 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID 
Productadenylosuccinate lyase 
Protein accessionYP_003680360 
Protein GI297561386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.241079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCGA AACCGCTGAT CCCTGATGTC CTGGCTGCCC GCTACGCGTC CGCGGAGCTG 
ACCCGGCTGT GGTCCCCCGA GTACAAGGTG GTCGCCGAGC GGCGGCTGTG GCTGGCCGTG
CTGCGCGCCC AGGCGCGTCT GGGCGTGGAC GTCCCGGCCG GTGTGGTCGA GGACTACGAG
AAGGTCCTGG AGCAGGTGGA CCTGGCCTCC ATCGCCGAAC GCGAGCGGGT CACCCGCCAC
GACGTCAAGG CCCGCATCGA GGAGTTCAAC GCCCTCGCCG GGCACGAGCA CGTGCACAAG
GGCATGACCT CGCGCGACCT GACCGAGAAC GTCGAGCAGC TCCAGGTGCG CGACAGCCTG
CTGCTGGCCC GCGACCGGGT GGTGGCGCTG CTGGCGCGGC TGGGGAGGCT CTCGGCCGAG
TACGGCGAGA CGGTGATGGC GGGCCGCTCC CACAACGTGG CGGCGCAGGC CACGACGCTC
GGCAAGCGGT TCGCGTCGAT CGCCGACGAG GTGCTGGTCG CCCACGGGCG GCTGGAGGAG
CTGATCTCCC GCTACCCGCT GCGCGGGATC AAGGGCCCGG TGGGCACCGC GCAGGACATG
CTGGACCTGC TGGGCGGGGA CCGCGCCGCG CTCAGCGCGC TGGAGGACGG GGTGGCCTCC
CACCTCGGGT TCGAGCGGCG CTTCACCAGC GTGGGCCAGG TCTACCCGCG CTCGCTGGAC
TTCGAGGTGC TGACCGCGCT GGTGCAGCTG GCCGCCGGTC CCTCCTCGCT GGCCAAGACG
ATCCGCCTGA TGGCCGGGCA CGAGCTGGTC ACCGAGGGGT TCGCCGAGGG TCAGGTGGGC
TCCTCGGCCA TGCCGCACAA GATGAACACG CGCTCGTGCG AGCGGGTCAA CGGGCTGACC
GTGATCCTGC GCGGGTACGC GTCCATGGCC GGTGAACTGG CCGGGGACCA GTGGAACGAG
GGCGACGTGT CCTGCTCGGT GGTGCGCCGG GTGGCCCTGC CGGACGCGTT CTTCGCCTTC
GACGGGCTGG TGGAGACGAT GCTGACGGTG CTGGACGAGT TCGGGGCCTT CCCCGCGGTG
GTCTCCGCCG AGCTCGACCG CTACCTGCCG TTCCTGGCCA CGACCAAGAT GCTCATGGCC
GCGGTGCGCG CCGGGGTGGG CCGCGAGACC GCCCACGAGC TGATCAAGGA GCACGCGGTG
GGCTCGGCCC TGGCCATGCG CCAGGAGGGT GCGGGCAACC GGCTGCTGGA CCGCCTGGCC
GAGGACGGGC GCTTCCCGCT CGGCAGGGAG GAGCTGGACG CGCTGCTGGC CGACCGGATC
ACCTTCACCG GCGCCGCCGC CGACCAGGTG GCCGCCGTGG TGGGGCGGAT CGACGCGATC
GTGGCCGCGC ACCCGGAGGC GGCCGCCTAC TCCCCCGGTT CGATCCTGTA G
 
Protein sequence
MSAKPLIPDV LAARYASAEL TRLWSPEYKV VAERRLWLAV LRAQARLGVD VPAGVVEDYE 
KVLEQVDLAS IAERERVTRH DVKARIEEFN ALAGHEHVHK GMTSRDLTEN VEQLQVRDSL
LLARDRVVAL LARLGRLSAE YGETVMAGRS HNVAAQATTL GKRFASIADE VLVAHGRLEE
LISRYPLRGI KGPVGTAQDM LDLLGGDRAA LSALEDGVAS HLGFERRFTS VGQVYPRSLD
FEVLTALVQL AAGPSSLAKT IRLMAGHELV TEGFAEGQVG SSAMPHKMNT RSCERVNGLT
VILRGYASMA GELAGDQWNE GDVSCSVVRR VALPDAFFAF DGLVETMLTV LDEFGAFPAV
VSAELDRYLP FLATTKMLMA AVRAGVGRET AHELIKEHAV GSALAMRQEG AGNRLLDRLA
EDGRFPLGRE ELDALLADRI TFTGAAADQV AAVVGRIDAI VAAHPEAAAY SPGSIL