Gene Ndas_1429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1429 
Symbol 
ID9245279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1749655 
End bp1750953 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID 
Productisocitrate lyase 
Protein accessionYP_003679367 
Protein GI297560393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.307912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACCA GCGCTCGACG ACAGGAAGCC AGCGCCGCAC TCCAGCGCGA CTGGGAGACC 
AACCCCCGGT GGGCCGGTAT CGAGCGCACC TACTCCGCCG ACGACGTGGT CAAGCTCCGC
GGTTCGGTCA CCGAGGAGCA CACCCTGGCC CGCCGCGGCG CGGAGCGCCT GTGGGACCTG
CTCCGCACCG AGGACTACGT CAACACCCTC GGCGCCCTCA CCGGCAACCA GGCGGTCCAG
CAGGTCCGTG CCGGGCTGAA GGCCATCTAC CTCTCCGGCT GGCAGGTCGC GGCCGACGCC
AACCTCTCGG GCAACACCTA CCCCGACCAG AGCCTGTACC CGGCCAACTC GGTCCCGGCC
GTGGTCCGCC GCATCAACAA CGCGCTGATG CGCGCCGACC AGATCACCTG GGCCGAGGGC
GACGACAGCG CGCCCGAGTG GCTCGCCCCG ATCGTGGCCG ACGCCGAGGC CGGGTTCGGC
GGCGTCCTCA ACGCCTACGA GCTGATGCGG GGCATGATCG CCTCCGGCGC GGCGGGCGTG
CACTGGGAGG ACCAGCTGGC CTCCGAGAAG AAGTGCGGCC ACCTGGGCGG CAAGGTCCTC
ATCCCCACCG GGCAGCACGT CAAGACGCTC AACGCCGCCC GCCTGGCCGC GGACGTCGCC
GGTGTCCCGT CCCTGGTCAT CGCGCGGACC GACGCCCAGG CCGCGACCCT GATCACCAGC
GACGTCGACG AGCGCGACCA GCCCTTCATC ACGGGCGAGC GCACCGCCGA GGGCTTCTAC
CGCTTCCGCA ACGGCGTCGA GGCCTGCGTG GCGCGCGGCC TGGCCTACGC CCCGCACTCG
GACCTGCTGT GGATGGAGAC CGCCACCCCC GACCTGGAGG TGGCCCGCGA CTTCGCCGAG
CGGATCAAGG CCGAGTACCC GGACCAGATG CTGGCCTACA ACTGCTCGCC GTCCTTCAAC
TGGAAGCGCC ACCTGGACGA CGCGACCATC GCCAAGTTCC AGCGCGAGCT GGGCCACATG
GGCTACAAGT TCCAGTTCAT CACCCTGGCG GGCTTCCACT CGCTCAACTA CTCGATGTTC
AACCTGGCCA AGGGCTACGC CCAGAACGGG ATGACCTCGT ACGTCGAGCT CCAGGAGGCC
GAGTTCGCGG CCGAGTCGCA GGGCTACACC GCCACCCGCC ACCAGCGCGA GGTCGGCACC
GGCTACTTCG ACTCCATCAG CACGACCATC TCCCCCGAGG CCAGCACCAC GGCTCTCACC
GGCTCCACCG AGGAAGACCA GTTCTCAGCG GCTCACTGA
 
Protein sequence
MGTSARRQEA SAALQRDWET NPRWAGIERT YSADDVVKLR GSVTEEHTLA RRGAERLWDL 
LRTEDYVNTL GALTGNQAVQ QVRAGLKAIY LSGWQVAADA NLSGNTYPDQ SLYPANSVPA
VVRRINNALM RADQITWAEG DDSAPEWLAP IVADAEAGFG GVLNAYELMR GMIASGAAGV
HWEDQLASEK KCGHLGGKVL IPTGQHVKTL NAARLAADVA GVPSLVIART DAQAATLITS
DVDERDQPFI TGERTAEGFY RFRNGVEACV ARGLAYAPHS DLLWMETATP DLEVARDFAE
RIKAEYPDQM LAYNCSPSFN WKRHLDDATI AKFQRELGHM GYKFQFITLA GFHSLNYSMF
NLAKGYAQNG MTSYVELQEA EFAAESQGYT ATRHQREVGT GYFDSISTTI SPEASTTALT
GSTEEDQFSA AH