Gene Ndas_0177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0177 
Symbol 
ID9244008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp226317 
End bp227927 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content71% 
IMG OID 
Product2-isopropylmalate synthase/homocitrate synthase family protein 
Protein accessionYP_003678133 
Protein GI297559159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.342023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGACG ACAGTTTCCA CGTCTTCGAC ACCACGCTGC GCGACGGGGC CCAGCGCGAG 
GGGATCAACC TGCGGGTCTC CGACAAGCTG GCCATCGCCA AGCTGCTGGA CGACTTCGGG
GTGGCGTTCA TCGAGGGAGG GTGGCCGGGG GCCAACCCCA AGGACACCGA GTTCTTCCAG
CGGGCTTCAC GAGAGCTGAC GCTGGAGCAC GCGCAACTCA CCGCGTTCGG CGCGACCCGC
CGTGCCGGTG TGCGGGCCGC CGACGACCCA CAGGTGGCCG CACTGCGCGA CAGCGGCGCA
CCGGTCGTCA CCCTTGTCGC CAAGAGTGAC GACCGGCACG TCGAGCGCGC GCTGCGCACG
ACCCTCGACG AGAACCTCGC CATGATCGCC GACACGGTGT CCCACCTGAG CGAACACGGC
CAGCGCGTAT TCGTGGACTG CGAGCACTTC TTCGACGGAT ACCTCCACAA CCCCGACCAC
GCGCTCGACG TGGTCCGCGC CGCCGCCGGG GCCGGTGCCG ACGTCGTCGT CCTGTGCGAC
ACCAACGGCG GCATGCTCCC CACCGACGTC ACCCGTATCG TCACCGAGGT CCGCGAGGCC
ACCGGCGCAC GCCTGGGCAT CCACGCCCAG GACGACACCG GCTGCGCCGT CGCCAACACC
CTCGCCGCCG TGGACGCGGG CGCCACCCAC GTACAGTGCA CCGCCAACGG CTACGGCGAG
CGGGTCGGCA ACGCCAACCT CTTCTCCGTG GTCGGCGCGC TCACGCTCAA GCGCGGCCAG
GAGGTCCTCC CTGAGGGCTG CCTGGCCGAG ATGACCCGCG TGGCCACCGC CATCGCCGAG
ATCGTCAACC TCACCCCCGA CACGCACCAG CCCTACGTGG GGGTGTCGGC CTTCGCGCAC
AAGGCGGGGC TGCACGCCTC CGCGATCAAG GTCGACCCCG ACCTGTACCA GCACACGGAC
CCCGCGCTGG TCGGCAACGC CATGCGCATG CTCGTCTCCG ACATGGCCGG GCGGGCCTCC
ATCGAACTCA AGGCCAAGGA GTTGGGCCTG GACCTGTCCG GAGACCGCGC CCTGTCGGGG
CGGGCCGTGG AGCGGGTCAA GGGCCTGGAG CTGTCGGGCT ACAGCTTCGA GGCCGCCGAC
GCCTCCCTGG ACCTGCTGCT GCGCGAGGAA CTGGGGCAGC CGGTCCGCTA CTTCGACACC
GAGTCCTGGC GCGTCATCAC CGAACGCCGA CCCCGGGCCG GGTCCAGCCC CCTGGCCAGC
GACTACGAGA GCCTCACCGA GGCCACCGTC AAACTGCGGG TCAAGGGCGA ACGCGTGATC
GCCACCGCGG AGGGCAACGG CCCCGTCAAC GCCCTGGACC GGGCGCTGCG CAGCGCCATG
GAGGGCGTGT ACACCGCGCT GGCCGGGCTG GAGCTGACCG ACTACAAGGT CCGCATCCTG
GAGGGCAGCT CCGGCACCAA CGCCATCACC CGCATCCTCA TCACCTTCAG CGACGGGGTG
GGGGAGTGGA CCACGGTGGG CGTGGGCCCC AACGTCGTCG ACGCGTCCTG GGTCGCCCTC
GAACAGGCCG TCACCTACGG GCTCCTGCGC CAGGGCTACC CGCAGGGCTG A
 
Protein sequence
MRDDSFHVFD TTLRDGAQRE GINLRVSDKL AIAKLLDDFG VAFIEGGWPG ANPKDTEFFQ 
RASRELTLEH AQLTAFGATR RAGVRAADDP QVAALRDSGA PVVTLVAKSD DRHVERALRT
TLDENLAMIA DTVSHLSEHG QRVFVDCEHF FDGYLHNPDH ALDVVRAAAG AGADVVVLCD
TNGGMLPTDV TRIVTEVREA TGARLGIHAQ DDTGCAVANT LAAVDAGATH VQCTANGYGE
RVGNANLFSV VGALTLKRGQ EVLPEGCLAE MTRVATAIAE IVNLTPDTHQ PYVGVSAFAH
KAGLHASAIK VDPDLYQHTD PALVGNAMRM LVSDMAGRAS IELKAKELGL DLSGDRALSG
RAVERVKGLE LSGYSFEAAD ASLDLLLREE LGQPVRYFDT ESWRVITERR PRAGSSPLAS
DYESLTEATV KLRVKGERVI ATAEGNGPVN ALDRALRSAM EGVYTALAGL ELTDYKVRIL
EGSSGTNAIT RILITFSDGV GEWTTVGVGP NVVDASWVAL EQAVTYGLLR QGYPQG