Gene Ndas_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0685 
Symbol 
ID9244527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp841443 
End bp842651 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content72% 
IMG OID 
Productimidazolonepropionase 
Protein accessionYP_003678636 
Protein GI297559662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.298595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.285434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG TACTCCTCAG CAACATCGGT CGGCTGTGGA CCGGGAGCGA GCTGATCACC 
AAGGCCGCGC TCCTGATGGA CGACGACCGG GTGGCCTGGG TCGGGCCCGC CGCCGAGCTC
CCCCAGAGCA TTCCGGGCGT GGTGGACGAC CTCACCGACG TCGACGACGT CGTCAACATC
GGCGGCGGCA TGATCACCCC GGGGCTGATC GACGCCCACT GCCACCCGGT GTACGCGGGG
GACCGCTACG CCGAGGTCAA CATGTGGGCC AAGGGCGCCT CCAAGGGGGA CATCTTCGCG
GCCGGGGGCG GTGTCTCCAC CACCGTCACC ATCACCCGGG GTACCGATCC CTGGACCCTG
TGCAACGCGG TCCGCGAACG GCTCCGGCAC TGGGTGCTGA CCGGCACCAC CACCGTGGAG
GCCAAGACCG GCTACCACCT GACCAGGGAC GGCGAACTGG CCGACGTGCG CCTGCTGCGC
TCCCTGGAGG AGGAGCCGGG CATGCCGCGC CTGCACGTCA CCTTCTTCCC CGCGCACGGG
GTGCCGCCCG AGTTCTTCGG CAGACCCAGG GAGTACGCGG CCACCGCCGC CTCCTGGCTC
AGCGACGCCG CGCTGGCCGG TGCCGACGGG GTGGACGTGT ACTGCGACAA CCAGCAGTTC
ACCACCGAGG ACGCCCGCAT GCTGCTGGGG GTGGGCCAGT CCGCGGGGCT GCGCACCACC
CTGCACGCCT GCTCGCGCCC CCGGCACGGC GCCGTGCGCA TGGCCGCCGA GATCGGCTGC
TCCTCGGTGG ACCTGCTGCA CGAGACCGAC GAGCAGGACG TCCTGGCGCT GGCCGCCACC
CGCACACCGG TGGTGGCCTG CCCGACGACC TCGCTGCACG AGCGCCGCAC CCCGCCGGTG
CGGGCGCTGC TCGACCACGG CGTGCCCATC GGGCTGGGCA CCGACCACAA CCCCGGCCAG
TCGGGCACGA TGTCGATGCC GCTGGTGATC TCGCTGGCCA TCTCCATGTT CGAGATGACC
GTGCAGGAGG CCCTGTACGC CGCCACGGTG GGCAGCGCAC GCGCCCTGGG CCTGACCGAC
CGGGGCGTGC TCGCGCCCGG GAGCCTGGCC GACCTCGTCC AGTGGGACGC CGACCACGAG
GGCGCCTTCG CCTGGTCGAT GGGCCTCAAC ACCCTGCGGG TGTGGCAGGG CGGCAGGACC
ATCCGCTGA
 
Protein sequence
MTAVLLSNIG RLWTGSELIT KAALLMDDDR VAWVGPAAEL PQSIPGVVDD LTDVDDVVNI 
GGGMITPGLI DAHCHPVYAG DRYAEVNMWA KGASKGDIFA AGGGVSTTVT ITRGTDPWTL
CNAVRERLRH WVLTGTTTVE AKTGYHLTRD GELADVRLLR SLEEEPGMPR LHVTFFPAHG
VPPEFFGRPR EYAATAASWL SDAALAGADG VDVYCDNQQF TTEDARMLLG VGQSAGLRTT
LHACSRPRHG AVRMAAEIGC SSVDLLHETD EQDVLALAAT RTPVVACPTT SLHERRTPPV
RALLDHGVPI GLGTDHNPGQ SGTMSMPLVI SLAISMFEMT VQEALYAATV GSARALGLTD
RGVLAPGSLA DLVQWDADHE GAFAWSMGLN TLRVWQGGRT IR